Reading data in the stewardship console database - 6.4

Data Stewardship Console

author
Talend Documentation Team
EnrichVersion
6.4
EnrichProdName
Talend Data Fabric
Talend MDM Platform
task
Data Governance > Third-party systems > MDM components > Data Stewardship Console components
Data Quality and Preparation > Third-party systems > MDM components > Data Stewardship Console components
Design and Development > Third-party systems > MDM components > Data Stewardship Console components
EnrichPlatform
Talend Studio

This scenario applies only to Talend MDM Platform and Talend Data Fabric.

For more technologies supported by Talend, see Talend components.

This scenario describes a two-component Job that reads data sets (data records) from the tasks stored in the database of Talend Data Stewardship Console according to the criteria you define in the Basic settings view of the tStewardshipTaskInput component.

Then you can use any Talend output component to write the data retrieved from the stewardship database into the target application or database.

In this example, the filtered data is fetched and displayed in the log console.

  • Drop the tStewardshipTaskInput and tLogRow components from the Palette onto the design workspace.

  • Connect the two components together using a Row Main link.

  • Double-click tStewardshipTaskInput to open the Basic settings view and define the component properties.

  • In the Schema list, select Built-In and click the three-dot button next to Edit schema to open a dialog box.

    Here you can define the structure of the data you want to read on the Talend Data Stewardship Console database.

Note:

The default schema columns in the schema dialog box vary according to whether the the Target record only check box is selected or not.

If the Target record only check box is selected, the by-default schema looks like the following:

If the Target record only check box is not selected, the by-default schema has two extra columns: TARGET and SOURCE.

The TARGET column will indicate which data set is the target record in each of the tasks in the database. The SOURCE column will indicate the name of the source application for each source record in the tasks.

In this scenario, data is collected from the three defined input columns: Firstname, Lastname and DOB and all the by-default columns.

  • Click OK to close the dialog box and proceed to the next step.

  • In the Url field, enter the URL for connecting to the stewardship console database.

  • In the Username and Password fields, enter your login and password to connect to the MDM server.

  • From the Type list, select the type of the tasks from which you want to retrieve data record: Resolution or Data. In this example, you want retrieve data only from resolution tasks.

    For further information on task type, see Talend Data Stewardship Console User Guide.

  • In the Owner field, enter between inverted commas the name of the task owner, the user to whom the task is assigned, Administrator in this example.

Note:

Task can be assigned to a specific user either from the Basic settings view of the tStewardshipTaskOutput component, or directly from the stewardship console by an administrator. For further information, see tStewardshipTaskOutput.

  • In the Tag field, enter between inverted commas the name of the tag category associated with the tasks you want to read, not used in this example.

    For further information, see Talend Data Stewardship Console User Guide.

  • In the Start Date and End Date fields, enter between inverted commas a task creation date range within which you want to read the tasks, not used in this example.

  • In the Status field, select the task status to decide from what tasks you want to retrieve data. In this example, you want to retrieve data only from resolved tasks.

  • In the Star ranking field, select from the list the number of stars, 0 through 5, assigned to the tasks in the stewardship console. This will enable you to filter the tasks from which you want to retrieve data by the star rate assigned to them.

    In this example, select 2 from the list. Data will be retrieved from all tasks that have been assigned a star rate up to 2.

Note:

If you select All from the list, you will retrieve data from all tasks regardless of the star ranking assigned to each of them.

  • In the Limit field, enter a number to limit the tasks to retrieve from the stewardship database.

  • Leave the Target record only check box selected in order to retrieve from the tasks only the target record.

  • In the design workspace, double-click the tLogRow component to display its Basic settings view and set the component properties.

  • Click Edit Schema to open the schema dialog box and ensure that the schema has been collected from the previous component. If not, click Sync Columns.

  • Save the Job and press F6 to execute it.

The tStewardshipTaskInput component has retrieved from the stewardship console database the target data records from all resolved tasks that have been assigned a star rate from 0 through 2. The output schema is the same for all the retrieved records.

  • Clear the Target record only check box in order to retrieve from the tasks all source and target records.

    This will retrieve more than one row for each task.

  • In the Basic settings view of the tLogRow component, click Sync columns to synchronize the schema between the input link (that have two extra columns now) and the tLogRow component.

  • Save the Job and press F6 to execute it.

    The tStewardshipTaskInput component retrieves from the stewardship console database both the target and source data records from all resolved tasks that have been assigned a star rate of 0 through 2.

The above capture shows an example of the retrieved data from one of the tasks in the stewardship console database. Three rows have been outputted for this task: the target record, where TARGET = true, and two input records where TARGET = false and SOURCE = CRM.