Executing the Job

Procedure

Press F6 to save and execute the Job.

A single representation for each duplicates group is created and merged with the unique rows in a single file.

The data set is now clean and deduplicated.

You can use tMatchIndex to index this reference data set in Elasticsearch for continuous matching purposes.

For an example of how to index a reference data set, see Indexing a reference data set in Elasticsearch.

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!