Configuring the components - 6.5

Regex

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > File components (Integration) > Regex components
Data Quality and Preparation > Third-party systems > File components (Integration) > Regex components
Design and Development > Third-party systems > File components (Integration) > Regex components
EnrichPlatform
Talend Studio

Procedure

  1. Select the tFileInputRegex again so the Component view shows up, and define the properties:
  2. The Job is built-in for this scenario. Hence, the Properties are set for this station only.
  3. Fill in a path to the file in File Name field. This field is mandatory.
  4. Define the Row separator identifying the end of a row.
  5. Then define the Regular expression in order to delimit fields of a row, which are to be passed on to the next component. You can type in a regular expression using Java code, and on mutiple lines if needed.
    Warning:

    Regex syntax requires double quotes.

  6. In this expression, make sure you include all subpatterns matching the fields to be extracted.
  7. In this scenario, ignore the header, footer and limit fields.
  8. Select a local (Built-in) Schema to define the data to pass on to the tFileOutputPositional component.
  9. You can load or create the schema through the Edit Schema function.
  10. Then define the second component properties:
  11. Enter the Positional file output path.
  12. Enter the Encoding standard, the output file is encoded in. Note that, for the time being, the encoding consistency verification is not supported.
  13. Select the Schema type. Click on Sync columns to automatically synchronize the schema with the Input file schema.