After profiling the email and zip code columns and getting the detail results about the structure and consistency of the address data, you need to share these results with other business users.
You must first generate a report file on the analysis results from the studio and save the report in a data quality data mart. Business users can then access the report from Talend Data Quality Portal , which is a web-based platform that shares analysis results generated from the studio and saved in the data quality data mart.
In the Profiling perspective of the studio:
In the DQ Repository tree view, right-click the analysis name and select New Report.
The report editor is displayed with the selected analysis listed in the Analysis List.
In the Analysis list view and from the Template type list, select Evolution as the type for the report you want to generate.
In this example, you want to generate an evolution report which provides information showing the evolution through time of the indicators used on the email and postal columns. This report allows you to compare current and historical statistics to determine the improvement or degradation of the address data. Such information is vital to decide to intervene and resolve data at the right time and thus monitor the quality of data on an on-going basis.
Select the Refresh All check box to refresh the listed analysis before generating the report.
In the Generated Report Settings view and from the File Type list, select to generate a pdf report file.
In the Database Connection Settings view, set the connection parameters to the data mart where you want to store the report results.
Click the Check button to verify if your connection is successful.
A message confirms if the database exists and if the connection is successful.
If the database structure does not exist, click OK in the message to let the studio creates it for you.
Click OK to close the confirmation message.
Save the report and click on the editor toolbar to generate the report file.
A report file is generate and listed under the Reports node in the DQ Repository tree view. The report shows the evolution through time of the simple statistics indicators and the patterns used on the email and postal columns.
Below are the results of the email column:
This chart shows that 89.80% of the email addresses are valid right now.
For the simple statistics indicators, there are two charts: the first indicates the change in the statistics and the second indicates the percentage of that change.
Generating this report repeatedly will give a flat line if there is no change in data. The line will start to go upwards if data is fixed and downwards if data gets less accurate and consistent.
For further information on reports, see the Reports chapter in Talend Studio User Guide.
After generating this report in the studio, business users can access it from Talend Data Quality Portal.
To generate the evolution report from the Portal:
Access Talend Data Quality Portal using
tdq_useras username and
Click the User menu and slide the cursor on Reports > Column Report > Column Evolution.
Click the Report explore icon.
A dialog box opens to list all evolution reports generated on column analyses in the Profiling perspective. This list shows first the name of the report and then the name of the column analysis.
Select the check box of the evolution report you want to generate and then click Confirm at the bottom right corner of the dialog box.
Click Execute at the top of the Parameters panel.
A loading indicator is displayed and then the report is open in the page.
You will have in the Portal the same profiling results you generated from the studio:
For further information on the Portal, see the Talend Data Quality Portal User and Administrator Guide.