Configuring the input component - 7.3

Natural Language Processing

Version
7.3
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Natural Language Processing
Data Quality and Preparation > Third-party systems > Natural Language Processing
Design and Development > Third-party systems > Natural Language Processing
The tokenized text in CoNLL format to be labeled is stored in a MySQL database.

Procedure

  1. Double-click the tMysqlInput component to open its Basic settings view and define its properties.
  2. From the Property Type list, select Repository if you have already stored the connection to database in the Metadata node of the Repository tree view. The property fields that follow are automatically filled in.
  3. Set the Schema as Built-in and click Edit schema to define the desired schema.

    In this example, three columns are added to the schema: text, tokens and labels.

  4. Set the Query Type as Built-In. In the Query box, enter the query required to retrieve the desired columns from the table.