Generating a report file from Talend Studio - 7.3

Data Quality Job and Analysis Examples

Version
7.3
Language
English
Product
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Open Studio for Data Quality
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Quality and Preparation
Last publication date
2023-03-01

Talend DQ Portal is deprecated from Talend 7.1 onwards.

Procedure

  1. In the DQ Repository tree view, right-click the analysis name and select New Report.

    The report editor is displayed with the selected analysis listed in the Analysis List.

  2. In the Analysis list view and from the Template type list, select Evolution as the type for the report you want to generate.
    In this example, you want to generate an evolution report which provides information showing the evolution through time of the indicators used on the email and postal columns. This report allows you to compare current and historical statistics to determine the improvement or degradation of the address data. Such information is vital to decide to intervene and resolve data at the right time and thus monitor the quality of data on an on-going basis.
  3. Select the Refresh All check box to refresh the listed analysis before generating the report.
  4. In the Generated Report Settings view and from the File Type list, select to generate a PDF report file.
  5. In the Database Connection Settings view, set the connection parameters to the data mart where you want to store the report results.
  6. Click the Check button to verify if your connection is successful.
    A message confirms if the database exists and if the connection is successful.
  7. If the database structure does not exist, click OK in the message to let Talend Studio creates it for you.
  8. Click OK to close the confirmation message.
  9. Save the report and click on the editor toolbar to generate the report file.

Results

A report file is generate and listed under the Reports node in the DQ Repository tree view. The report shows the evolution through time of the simple statistics indicators and the patterns used on the email and postal columns.

Below are the results of the email column:

This chart shows that 89.80% of the email addresses are valid right now.

For the simple statistics indicators, there are two charts: the first indicates the change in the statistics and the second indicates the percentage of that change.

Generating this report repeatedly will give a flat line if there is no change in data. The line will start to go upwards if data is fixed and downwards if data gets less accurate and consistent.

For more information on reports, see Reports in the Talend Studio User Guide.

After generating this report in Talend Studio, business users can access it from Talend DQ Portal.