This column analysis uses out-of-box indicators to provide simple statistics such as row, blank and duplicate counts on the Email and Phone columns.
Before you begin
You have opened the Profiling perspective in the Studio.
You have created a column analysis and defined the connection to the database.
In the Data
Preview section in the analysis editor, click Select indicators to open the
Indicator Selection dialog
Expand Simple Statistics and select Row
Count, Blank Count and
Duplicate Count. Click OK to close the wizard.
You want to see the row, blank and duplicate counts in the Email and Phone columns to see how consistent the data is.
Indicators are added accordingly to the columns in the Analyzed Columns section.
Click the icon next to the Duplicate Count and Blank Count indicator and set
0 in the Upper threshold field.
Defining thresholds on the Email and Phone columns is very helpful as it will write in red the count of the duplicate and blank values in the analysis results.