Launching basic or evolution reports - 6.3

Talend Data Quality Portal User and Administrator Guide

EnrichVersion
6.3
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance
Data Quality and Preparation
EnrichPlatform
Talend DQ Portal

Basic and evolution reports are report documents generated on the results of the different analysis types created in the Profiling perspective of Talend Studio. These reports include column reports, overview reports, table reports or redundancy reports.

Basic reports provide the statistics collected by given analyses. Evolution reports provides information showing the evolution through time of the statistics collected by the indicators used on given analyses. Evolution reports allow you to compare current and historical statistics to determine the improvement or degradation of the analyzed data.

From the Repors menu in the web user interface, you can:

  • Use the item - subitem combination to access the data quality information and statistics generated by the reports executed in the Studio and which results are stored in the data quality datamart.

  • Launch basic or evolution reports on all analysis types directly from the web user interface.

  • Insert a header logo in any of the launched reports in order to customize these reports to comply with the corporate graphical guidelines.

    The default logo file is a Talend logo, but you can decide to use a logo of your choice. For further information, see Customizing logos in reports.

The reports you generate from the Portal reuse the statistical information stored in the database and resulted from the reports generated in the Profiling perspective of Talend Studio but they are not an exact copy of these reports. While a report in the Studio can combine the results of different types of analyses in the same document, reports in the Portal are generated individually from different pages according to each analysis type and according to whether the report is a basic report or an evolution one.

Prerequisite(s):

  • You have accessed Talend Data Quality Portal as a user.

  • At least one report has been generated on an analysis in the Profiling perspective ofTalend Studio.

The following procedure describes an example on how to generate a report in the Portal on a column analysis that analyzes the email, fullname, and total_sales columns. Simple statistics indicators are used to analyze the first two columns; advanced statistics, pattern frequency statistics and soundex frequency statistics are used to analyze the email column and finally the Benford Law Frequency indicator is used to detect fraudulent data in the total_sales column.

To launch a standard report from Talend Data Quality Portal, do the following:

  1. Log in to Talend Data Quality Portal using the user authentication information.

  2. Click the icon, point to Reports > Column Report and click Column Basic.

    A page opens and a form is displayed to the right of the page.

  3. Click in the Header field and select YES if you want to insert a logo in the report to launch.

  4. Click the Report explore icon.

    A dialog box opens to list all the reports generated on column analyses in the Profiling perspective of Talend Studio. This list shows first the name of the report followed by the name of the column analysis.

  5. Select the column analysis on which you want to generate a report and then click Confirm at the bottom right corner of the dialog box.

  6. Click Execute.

    A loading indicator is displayed and then the report opens in the page. Aggregated data quality information and statistics show in the form of charts and matrices.

    The simple statistics results are represented as the following for the fullname column:

    And as the following for the email column:

    The advanced statistics results are represented as the following for the email column:

    The Benford's law frequency statistics are represented as the following for the total_sales column:

    The run date and time information in the basic and evolution reports is displayed in the server zone of the Portal.

  7. In the top right corner of the page, click to save the report parameters.

    You can run a saved report without redefining its parameters, for further information, see Accessing the list of defined reports.

Proceed similarly to generate reports on any other analysis type including Overview analysis, Table analysis and Redundancy analysis.

Below is one example of the results you can get in the Portal by launching a basic Overview Report:

For detailed information about the analysis types used to profile data, check the data profiling part in the Talend Studio User Guide.