Configuring the input component - 7.1

Text standardization

author
Talend Documentation Team
EnrichVersion
7.1
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Quality components > Standardization components > Text standardization components
Data Quality and Preparation > Third-party systems > Data Quality components > Standardization components > Text standardization components
Design and Development > Third-party systems > Data Quality components > Standardization components > Text standardization components
EnrichPlatform
Talend Studio

Before you begin

You retrieved the tJapaneseTransliterate_standard_scenario.zip file.

Procedure

  1. Double-click tFixedFlowInput to open its Basic settings view in the Component tab.
  2. Click the Edit schema button to define the columns of the source dataset and their data type.
  3. Click the [+] button to add the schema columns.

    Example

    In this example, the input schema is made of six columns to show the ways to transliterate supported by the tJapaneseTransliterate component.

  4. Click OK to validate these changes and accept the propagation when prompted.
  5. In the Mode area, select Use Inline Content(delimited file).
  6. Define the characters to be used as Row Separator and Field Separator.
  7. In the Content field, enter the input data.
    In this example, the input sentences in Japanese are duplicated in each of the input columns.