Finding and grouping similar text can be used to harmonize content with only
small variations.
Note: The Find and group similar text
function does not support Asian
characters.
In the customers.xlsx file, there is information about the
occupation of your clients. Some of the values are closely similar to each other, for
example College/Grad Student and College
Student. A way to improve the readability, and thus the quality of your
data, would be to regroup some of these values together.
To find and group similar content, proceed as follows:
Procedure
-
Click the header of the Occupation column to select its
content.
You can confirm in the statistics box that there are occurrences of job
titles that only slightly differ.
-
In the functions list, select Find and group similar
text....
The Find and group similar text menu opens.
All similar occupations are grouped together in the second column. In this
case, College/Grad Student and College
Student. The third column suggests an occupation title that
could replace the values in the second column. You can choose another value
from the drop-down list, or type a whole new one. Clear the check boxes in
front of the values or groups of values you want to leave unchanged.
-
In the drop-down list of the third column, select College
Student.
-
Click Submit.
Results
All the occurrences of
College/Grad Student and
College Student have been regrouped under
College Student, the new harmonized value.