tDataprepRun Standard properties - 6.4

Data Preparation

author
Talend Documentation Team
EnrichVersion
6.4
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Preparation components
Data Quality and Preparation > Third-party systems > Data Preparation components
Design and Development > Third-party systems > Data Preparation components
EnrichPlatform
Talend Data Preparation
Talend Studio

These properties are used to configure tDataprepRun running in the Standard Job framework.

The Standard tDataprepRun component belongs to the Talend Data Preparation family.

The component in this framework is available in Talend Data Management Platform, Talend Big Data Platform, Talend Real Time Big Data Platform, Talend Data Services Platform, Talend MDM Platform and in Talend Data Fabric.

Basic settings

URL

Type the URL to the Talend Data Preparation web application, between double quotes.

Username

Type the email address that you use to log in the Talend Data Preparation web application, between double quotes.

Note: If you are using Talend Data Preparation Cloud, you must use your Talend Cloud login instead.

Password

Click the [...] button and type your user password for the Talend Data Preparation web application, between double quotes.

Preparation

To complete the Preparation field, click one of the following:
  • Choose an existing preparation to select from a list of the preparations that were previously created in Talend Data Preparation.

  • Or create a new one to create a new preparation based on your input data.

Click this button to edit the preparation in Talend Data Preparation that corresponds to the ID defined in the Preparation field.

Version

If you have created several versions of your preparation, you can choose which one you want to use in the Job. To complete the Version field, click Choose a Version to select from the list of existing versions, including the current version of the preparation.

Fetch Schema

Click this button to retrieve the schema from the preparation defined in the Preparation field.

Schema and Edit Schema

A schema is a row description. It defines the number of fields (columns) to be processed and passed on to the next component. The schema is either Built-In or stored remotely in the Repository.

Click Edit schema to make changes to the schema. If the current schema is of the Repository type, three options are available:

  • View schema: choose this option to view the schema only.

  • Change to built-in property: choose this option to change the schema to Built-in for local changes.

  • Update repository connection: choose this option to change the schema stored in the repository and decide whether to propagate the changes to all the Jobs upon completion. If you just want to propagate the changes to the current Job, you can select No upon completion and choose this schema metadata again in the [Repository Content] window.

Click Sync columns to retrieve the schema from the previous component connected in the Job.

Advanced settings

Limit Preview

Specify the number of rows to which you want to limit the preview.

tStatCatcher Statistics

Select this check box to gather the Job processing metadata at the Job level as well as at each component level.

Global Variables

Global Variables

ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio User Guide.

Usage

Usage rule

This component is an intermediary step. It requires an input flow as well as an output.

Limitations

  • If the dataset is updated after the tDataprepRun component has been configured, the schema needs to be fetched again.

  • If a context variable was used in the URL of the dataset, you cannot use the button to edit the preparation directly in Talend Data Preparation.