The following Job reads from the cache memory a huge amount of data loaded by two tHashOutput components and pass it to a tFileOutputDelimited. The goal of this scenario is to show the speed at which mass data is read and written. In practice, data feed generated in this way can be used as lookup table input for some use cases where a big amount of data needs to be referenced.
Drag and drop the following components from the Palette to the workspace: tFixedFlowInput (X2), tHashOutput (X2), tHashInput and tFileOutputDelimited.
Connect the first tFixedFlowInput to the first tHashOutput using a Row > Main link.
Connect the second tFixedFlowInput to the second tHashOutput using a Row > Main link.
Connect the first subjob (from tFixedFlowInput_1) to the second subjob (to tFixedFlowInput_2) using an OnSubjobOk link.
Connect tHashInput to tFileOutputDelimited using a Row > Main link.
Connect the second subjob to the last subjob using an OnSubjobOk link.
Double-click the first tFixedFlowInput component to display its Basic settings view.
Select Built-In from the Schema drop-down list.
You can select Repository from the Schema drop-down list to fill in the relevant fields automatically if the relevant metadata has been stored in the Repository. For more information about Metadata, see the Talend Studio User Guide.
Click Edit schema to define the data structure of the input flow. In this case, the input has two columns: ID and ID_Insurance, and then click OK to close the dialog box.
Fill in the Number of rows field to specify the entries to output, e.g. 50000.
Select the Use Single Table check box. In the Values table and in the Value column, assign values to the columns, e.g. 1 for ID and 3 for ID_Insurance.
Perform the same operations for the second tFixedFlowInput component, with the only difference in the values. That is, 2 for ID and 4 for ID_Insurance in this case.
Double-click the first tHashOutput to display its Basic settings view.
Select Built-In from the Schema drop-down list and click Sync columns to retrieve the schema from the previous component. Select Keep all from the Keys management drop-down list and keep the Append check box selected.
Perform the same operations for the second tHashOutput component, and select the Link with a tHashOutput check box.
Double-click tHashInput to display its Basic settings view.
Select Built-In from the Schema drop-down list. Click Edit schema to define the data structure, which is the same as that of tHashOutput.
Select tHashOutput_1 from the Component list drop down list.
Double-click tFileOutputDelimited to display its Basic settings view.
Select Built-In from the Property Type drop-down list. In the File Name field, enter the full path and name of the file, e.g. "E:/Allr70207V5.0/Talend-All-r70207-V5.0.0NB/workspace/out.csv".
Select the Include Header check box and click Sync columns to retrieve the schema from the previous component.