tDatasetInput Standard properties - 7.1

Data Preparation

author
Talend Documentation Team
EnrichVersion
7.1
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Preparation components
Data Quality and Preparation > Third-party systems > Data Preparation components
Design and Development > Third-party systems > Data Preparation components
EnrichPlatform
Talend Data Preparation
Talend Studio

These properties are used to configure tDatasetInput running in the Standard Job framework.

The Standard tDatasetInput component belongs to the Talend Data Preparation family.

The component in this framework is available in all subscription-based Talend products.

Basic settings

Schema and Edit Schema

A schema is a row description. It defines the number of fields (columns) to be processed and passed on to the next component. When you create a Spark Job, avoid the reserved word line when naming the fields.

Click Edit schema to make changes to the schema. If the current schema is of the Repository type, three options are available:

  • View schema: choose this option to view the schema only.

  • Change to built-in property: choose this option to change the schema to Built-in for local changes.

  • Update repository connection: choose this option to change the schema stored in the repository and decide whether to propagate the changes to all the Jobs upon completion. If you just want to propagate the changes to the current Job, you can select No upon completion and choose this schema metadata again in the Repository Content window.

URL

Type the URL of the Talend Data Preparation web application, between double quotes.

Email

Type the email address that you use to log in the Talend Data Preparation web application, between double quotes.

Password

Click the [...] button and type your user password for the Talend Data Preparation web application, between double quotes.

If you are working with Talend Cloud Data Preparation and if:

  • MFA ( Multi Factor Authentication) is enabled, enter an access token in the field.
  • MFA is not enabled but SSO (Single Sign-On) is configured, enter either an access token or your password in the field.

    It is recommend to use tokens as passwords will soon be obsolete and disappear.

  • MFA is not enabled and SSO is not configured, enter either an access token or your password in the field.

Dataset ID

Type the name of the dataset to be used, between double quotes.

Fetch Schema

Click this button to retrieve the schema from the dataset defined in the Dataset ID field.

Advanced settings

Platform

Two options are available for this drop-down list:

  • Data Preparation is the default value. Use this option if you want to use the tDatasetInput component in the Talend Data Preparation context.
  • Data Catalog. This value only exists for internal testing pruposes, and should not be selected, unless explicitely asked to do so by your Talend support representative.

tStatCatcher Statistics

Select this check box to gather the Job processing metadata at the Job level as well as at each component level.

Usage

Usage rule

This component is used as a start component and requires an output link.

Limitation

  • If the column headers of your dataset in Talend Data Preparation contain whitespaces or special characters, the schema fetched in Talend Studio will be considered invalid.

  • In Talend Studio, after fetching a schema from a dataset, you can add new columns but you must then give the exact same name (case sensitive) to the equivalent column in Talend Data Preparation.

  • Variations between the schema of your data in Talend Data Preparation and in Talend Studio may lead to errors.