Measuring the processing time of a subjob and part of a subjob

Logs and errors (Integration)

author
Talend Documentation Team
EnrichVersion
6.4
EnrichProdName
Talend MDM Platform
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
Talend Data Management Platform
Talend Open Studio for ESB
Talend Data Fabric
Talend Big Data
Talend Data Services Platform
Talend ESB
Talend Data Integration
Talend Open Studio for Data Integration
Talend Big Data Platform
Talend Open Studio for Big Data
task
Data Governance > Third-party systems > Logs and errors components (Integration)
Data Quality and Preparation > Third-party systems > Logs and errors components (Integration)
Design and Development > Third-party systems > Logs and errors components (Integration)
EnrichPlatform
Talend Studio

This scenario is a subjob that does the following in a sequence:

For more technologies supported by Talend, see Talend components.

  • generates 1000 000 rows of first and last names,
  • gathers first names with their corresponding last names,
  • stores the output data in a delimited file,
  • measures the duration of the subjob as a whole,
  • measures the duration of the name replacement operation,
  • displays the gathered information about the processing time on the Run log console.

To measure the processing time of the subjob:

  • Drop the following components from the Palette onto the design workspace: tRowGenerator, tMap, tFileOutputDelimited, and tChronometerStop.
  • Connect the first three components using Main Row links.
Note: When connecting tMap to tFileOutputDelimited, you will be prompted to name the output table. The name used in this example is "new_order".
  • Connect tFileOutputDelimited to tChronometerStop using an OnComponentOk link.
  • Select tRowGenerator and click the Component tab to display the component view.
  • In the component view, click Basic settings. The Component tab opens on the Basic settings view by default.
  • Click Edit schema to define the schema of the tRowGenerator. For this Job, the schema is composed of two columns: First_Name and Last_Name, so click twice the [+] button to add two columns and rename them.
  • Click the RowGenerator Editor three-dot button to open the editor and define the data to be generated.
  • In the RowGenerator Editor, specify the number of rows to be generated in the Number of Rows for RowGenerator field and click OK. The RowGenerator Editor closes.
  • You will be prompted to propagate changes. Click Yes in the popup message.
  • Double-click on the tMap component to open the Map editor. The Map editor opens displaying the input metadata of the tRowGenerator component.
  • In the Schema editor panel of the Map editor, click the plus button of the output table to add two rows and define them.
  • In the Map editor, drag the First_Name row from the input table to the Last_Name row in the output table and drag the Last_Name row from the input table to the First_Name row in the output table.
  • Click Apply to save changes.
  • You will be prompted to propagate changes. Click Yes in the popup message.
  • Click OK to close the editor.
  • Select tFileOutputDelimited and click the Component tab to display the component view.
  • In the Basic settings view, set tFileOutputDelimited properties as needed.
  • Select tChronometerStop and click the Component tab to display the component view.
  • In the Since options panel of the Basic settings view, select Since the beginning option to measure the duration of the subjob as a whole.
  • Select/clear the other check boxes as needed. In this scenario, we want to display the subjob duration on the console preceded by the component name.
  • If needed, enter a text in the Caption field.
  • Save your Job and press F6 to execute it.
Note: You can measure the duration of the subjob the same way by placing tChronometerStop below tRowGenerator, and connecting the latter to tChronometerStop using an OnSubjobOk link.