Configuring the components - Cloud - 8.0

Delimited

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > File components (Integration) > Delimited components
Data Quality and Preparation > Third-party systems > File components (Integration) > Delimited components
Design and Development > Third-party systems > File components (Integration) > Delimited components
Last publication date
2024-02-29

Procedure

  1. Double-click tFileFetch to display the Basic settings tab in the Component view and set the properties.
  2. From the Protocol list, select the appropriate protocol to access the server on which your data is stored.
  3. In the URI field, enter the URI required to access the server on which your file is stored.
  4. Select the Use cache to save the resource check box to add your file data to the cache memory. This option allows you to use the streaming mode to transfer the data.
  5. In the workspace, click tSleep to display the Basic settings tab in the Component view and set the properties.
    By default, tSleep's Pause field is set to 1 second. Do not change this setting. It pauses the second Job in order to give the first Job, containing tFileFetch, the time to read the file data.
  6. In the workspace, double-click tFileInputDelimited to display its Basic settings tab in the Component view and set the properties.
  7. In the File name/Stream field:
    - Delete the default content.
    - Press Ctrl+Space to view the variables available for this component.
    - Select tFileFetch_1_INPUT_STREAM from the auto-completion list, to add the following variable to the Filename field: ((java.io.InputStream)globalMap.get("tFileFetch_1_INPUT_STREAM")).
  8. From the Schema list, select Built-in and click [...] next to the Edit schema field to describe the structure of the file that you want to fetch. The US_Employees file is composed of six columns: ID, Employee, Age, Address, State, EntryDate.
    Click [+] to add the six columns and set them as indicated in the above screenshot. Click OK.
  9. In the workspace, double-click tLogRow to display its Basic settings in the Component view and click Sync Columns to ensure that the schema structure is properly retrieved from the preceding component.