Configuring tSortRow - 7.3

Orchestration (Integration)

Version
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Orchestration components (Integration)
Data Quality and Preparation > Third-party systems > Orchestration components (Integration)
Design and Development > Third-party systems > Orchestration components (Integration)

Procedure

  1. Double-click tSortRow to open its Component view.
  2. Under the Criteria table, click the button three times to add three rows to the table.
  3. In the Schema column column, select, for each row, the schema column to be used as the sorting criterion. In this example, select ZipCode, City and Address, sequentially.
  4. In the Sort num or alpha? column, select alpha for all the three rows.
  5. In the Order asc or desc column, select asc for all the three rows.
  6. If the schema does not appear, click the Sync columns button to retrieve the schema from the preceding component.
  7. Click Advanced settings to open its view.
  8. Select Sort on disk. Then the Temp data directory path field and the Create temp data directory if not exist check box appear.
  9. In Temp data directory path, enter the path to, or browse to the folder you want to use to store the temporary data processed by tSortRow. In this approach, tSortRow is enabled to sort considerably more data.
    As the threads will overwrite each other if they are written in the same directory, you need to create the folder for each thread to be processed using its thread ID. To do this, you can drop directly the global variable THREAD_ID of tCollector from the Outline view into this field; then the corresponding code is generated automatically, reading:
    
                      ((Integer)globalMap.get("tCollector_1_THREAD_ID"))
                   
    This makes the path read like:
    "E:/Studio/workspace/temp"+((Integer)globalMap.get("tCollector_1_THREAD_ID")).
    If the Outline view does not appear in the Studio, you can display it by selecting it from the Show view dialog box. For further information, see Talend Studio User Guide.
  10. Ensure that the Create temp data directory if not exists check box is selected.