Configuring the input component - 7.0

Natural Language Processing

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Natural Language Processing
Data Quality and Preparation > Third-party systems > Natural Language Processing
Design and Development > Third-party systems > Natural Language Processing
EnrichPlatform
Talend Studio
The tokenized text in CoNLL format to be labeled is stored in a MySQL database.

Procedure

  1. Double click the tMysqlInput component to open its Basic settings view and define its properties.
  2. From the Property Type list, select Repository if you have already stored the connection to database in the Metadata node of the Repository tree view. The property fields that follow are automatically filled in.
  3. Set the Schema as Built-in and click Edit schema to define the desired schema.

    In this example, three columns are added to the schema: text, tokens and labels.

  4. Set the Query Type as Built-In. In the Query box, enter the query required to retrieve the desired columns from the table.