Scenario: Reading data in the stewardship console database - 6.3

Talend Components Reference Guide

EnrichVersion
6.3
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
task
Data Governance
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

This scenario describes a two-component Job that reads data sets (data records) from the tasks stored in the database of Talend Data Stewardship Console according to the criteria you define in the Basic settings view of the tStewardshipTaskInput component.

Then you can use any Talend output component to write the data retrieved from the stewardship database into the target application or database.

In this example, the filtered data is fetched and displayed in the log console.

  • Drop the tStewardshipTaskInput and tLogRow components from the Palette onto the design workspace.

  • Connect the two components together using a Row Main link.

  • Double-click tStewardshipTaskInput to open the Basic settings view and define the component properties.

  • In the Schema list, select Built-In and click the three-dot button next to Edit schema to open a dialog box.

    Here you can define the structure of the data you want to read on the Talend Data Stewardship Console database.

Note

The default schema columns in the schema dialog box vary according to whether the the Target record only check box is selected or not.

If the Target record only check box is selected, the by-default schema looks like the following:

If the Target record only check box is not selected, the by-default schema has two extra columns: TARGET and SOURCE.

The TARGET column will indicate which data set is the target record in each of the tasks in the database. The SOURCE column will indicate the name of the source application for each source record in the tasks.

In this scenario, data is collected from the three defined input columns: Firstname, Lastname and DOB and all the by-default columns.

  • Click OK to close the dialog box and proceed to the next step.

  • In the Url field, enter the URL for connecting to the stewardship console database.

  • In the Username and Password fields, enter your login and password to connect to the MDM server.

  • From the Type list, select the type of the tasks from which you want to retrieve data record: Resolution or Data. In this example, you want retrieve data only from resolution tasks.

    For further information on task type, see Talend Data Stewardship Console User Guide.

  • In the Owner field, enter between inverted commas the name of the task owner, the user to whom the task is assigned, Administrator in this example.

Note

Task can be assigned to a specific user either from the Basic settings view of the tStewardshipTaskOutput component, or directly from the stewardship console by an administrator. For further information, see tStewardshipTaskOutput.

  • In the Tag field, enter between inverted commas the name of the tag category associated with the tasks you want to read, not used in this example.

    For further information, see Talend Data Stewardship Console User Guide.

  • In the Start Date and End Date fields, enter between inverted commas a task creation date range within which you want to read the tasks, not used in this example.

  • In the Status field, select the task status to decide from what tasks you want to retrieve data. In this example, you want to retrieve data only from resolved tasks.

  • In the Star ranking field, select from the list the number of stars, 0 through 5, assigned to the tasks in the stewardship console. This will enable you to filter the tasks from which you want to retrieve data by the star rate assigned to them.

    In this example, select 2 from the list. Data will be retrieved from all tasks that have been assigned a star rate up to 2.

Note

If you select All from the list, you will retrieve data from all tasks regardless of the star ranking assigned to each of them.

  • In the Limit field, enter a number to limit the tasks to retrieve from the stewardship database.

  • Leave the Target record only check box selected in order to retrieve from the tasks only the target record.

  • In the design workspace, double-click the tLogRow component to display its Basic settings view and set the component properties.

  • Click Edit Schema to open the schema dialog box and ensure that the schema has been collected from the previous component. If not, click Sync Columns.

  • Save the Job and press F6 to execute it.

The tStewardshipTaskInput component has retrieved from the stewardship console database the target data records from all resolved tasks that have been assigned a star rate from 0 through 2. The output schema is the same for all the retrieved records.

  • Clear the Target record only check box in order to retrieve from the tasks all source and target records.

    This will retrieve more than one row for each task.

  • In the Basic settings view of the tLogRow component, click Sync columns to synchronize the schema between the input link (that have two extra columns now) and the tLogRow component.

  • Save the Job and press F6 to execute it.

    The tStewardshipTaskInput component retrieves from the stewardship console database both the target and source data records from all resolved tasks that have been assigned a star rate of 0 through 2.