A single representation for each duplicates group is created and merged with the unique rows in a single file.
The data set is now clean and deduplicated.
What to do next
You can use tMatchIndex to index this reference data set in Elasticsearch for continuous matching purposes.
For an example of how to index a reference data set, see Indexing a reference data set in Elasticsearch.