Unstructured text - 6.5

Talend Open Studio for Data Quality User Guide

Talend Open Studio for Data Quality
Data Quality and Preparation
Talend Studio

This is a new data mining type introduced by the studio. This data mining type is dedicated to handle unstructured textual data.

For example, the data mining type of a column called COMMENT that contains commentary text can not be Nominal, since the text in it is unstructured. Still, we could be interested in seeing the duplicate values of such a column and here comes the need for such a new data mining type.