Configuring the tMatchIndexPredict component - 7.0

Continuous matching

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Data Quality components > Matching components > Continuous matching components
Data Quality and Preparation > Third-party systems > Data Quality components > Matching components > Continuous matching components
Design and Development > Third-party systems > Data Quality components > Matching components > Continuous matching components
EnrichPlatform
Talend Data Stewardship
Talend Studio

Procedure

  1. Double-click the tMatchIndexPredict component to open its Basic settings view.
  2. In the ElasticSearch configuration area, enter the location of the cluster hosting the Elasticsearch system to be used in the Nodes field, for example:

    "localhost:9200"

  3. In the ElasticSearch configuration area, enter the name of the Elasticsearch index where the reference data is stored in the Index field, for example:

    "education-agencies-chicago"

  4. In the Models area, set the information about the pairing and matching models:
    1. Set the path to the folder containing the model files generated by the tMatchPairing component in the Pairing model folder field.
    2. Select from the Matching model location list where to get the model file generated by the tMatchModel component.

      In this example, select from file system because the classification Job using the tMatchModel component is not integrated to the current Job.

    3. Set the path to the folder containing the model file generated by the tMatchModel component in the Matching model folder field.
    4. Set the label used for the unique records output in the No-match label field.