Launching advanced reports - 7.0

Talend Data Quality Portal User and Administrator Guide

EnrichVersion
7.0
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance
Data Quality and Preparation
EnrichPlatform
Talend DQ Portal

Additional and more advanced reports can be launched only from Talend Data Quality Portal and not from Talend Studio. These reports include integrity reports based on the list of orphan tables and lists of tables without primary or foreign keys.

The run date and time information in the advanced reports is displayed in server zone of the Portal.

Potential Primary Keys

These reports analyze the data in the datamart using the following simple statistics indicators: row count, distinct count, unique count and duplicate count. They check which columns could be used as Primary Keys.

Prerequisite(s):

  • You have accessed Talend Data Quality Portal as a user.

  • At least one report has been generated in the Profiling perspective of Talend Studio on a column analysis that uses the simple statistics indicators.

To launch a report in order to locate the columns that can be used as Primary Keys, do the following:

  1. From the user interface, click the icon, point to Reports > Integrity reports and then click Potential PK.

    The corresponding page opens.

  2. Click in the Header field and select YES if you want to insert a logo in the report to launch.

    The default logo file is a Talend logo, but you can decide to use a logo of your choice. For further information, see Customizing logos in reports.

  3. Click the Report explore icon to display a dialog box that lists some of the reports generated in the Profiling perspective of Talend Studio.

  4. Select the report you want to analyze in order to check for any columns that can be used as Primary Keys.

  5. Select the report you want to generate and then click Confirm at the bottom right corner of the dialog box.

    The name of the selected report appears in the Report field.

  6. Click in the top right corner of the page.

    A loading indicator is displayed and then a POTENTIAL PRIMARY KEYS report is open in the page.

    In this example, the POTENTIAL PRIMARY KEYS report is launched on the column_client report already generated in the Profiling perspective of Talend Studio. The results show that one column in the client table can be used as Primary Key.

  7. In the top right corner of the page, click to save the report parameters.

    You can run a saved report without redefining its parameters, for further information, see Accessing the list of defined reports.

Orphan tables

These reports analyze the data in the datamart to check for tables that exist in isolation - not referred to by other tables. They use overview analyses (connection and catalog analyses) to compute the results.

Prerequisite(s):

  • You have accessed Talend Data Quality Portal as a user.

  • At least one report has been generated on a connection or catalog analysis in the Profiling perspective of Talend Studio.

To launch a report in order to locate isolated tables, do the following:

  1. From the web user interface, click the icon, point to Reports > Integrity reports and then click Orphan Table.

    The corresponding page opens.

  2. Click in the Header field and select YES if you want to insert a logo in the report to launch.

    The default logo file is a Talend logo, but you can decide to use a logo of your choice. For further information, see Customizing logos in reports.

  3. Click the Report explore icon to display a dialog box that lists some of the reports generated in the Profiling perspective of Talend Studio.

  4. From the Report list, select the overview report you want to analyze in order to check for any isolated table that is not referred to by any other table.

  5. Select the report you want to generate and then click Confirm at the bottom right corner of the dialog box.

    The name of the selected report appears in the Report field.

  6. Click Execute in the top of the Parameters panel.

    A loading indicator is displayed and then a LIST OF ORPHAN report opens.

    In this example, the LIST OF ORPHAN report is launched on a database content analysis. The results show how many tables in each schema in the database are not referred to by other tables. For example, in the cif catalog, the country and customer tables are not referred to by other tables.

  7. In the top right corner of the page, click to save the report parameters.

    You can run a saved report without redefining its parameters, for further information, see Accessing the list of defined reports.

Tables without Primary Keys

These reports analyze the data in the datamart to check for tables that do not have Primary Keys. They use the overview analyses (connection and catalog analyses) to compute the results.

Prerequisite(s):

  • You have accessed Talend Data Quality Portal as a user.

  • At least one report has been generated on a connection or catalog analysis in the Profiling perspective of Talend Studio.

To launch a report in order to locate the tables that do not have Primary Keys, do the following:

  1. From the web user interface, click the icon, point to Reports > Integrity reports, and then click Tables without PK.

    The corresponding page opens.

  2. Click in the Header field and select YES if you want to insert a logo in the report to launch.

    The default logo file is a Talend logo, but you can decide to use a logo of your choice. For further information, see Customizing logos in reports.

  3. Click the Report explore icon to display a dialog box that lists some of the reports generated in the Profiling perspective of Talend Studio.

  4. From the Report list, select the overview report you want to analyze in order to check for any tables that do not have Primary Keys.

  5. Click Confirm at the bottom right corner of the dialog box.

    The name of the selected report appears in the Report field.

  6. Click Execute in the top of the Parameters panel.

    A loading indicator is displayed and then a TABLES WITHOUT PK report opens.

    In this example, the TABLES WITHOUT PK report is launched on a database content analysis. The results show how many tables in each schema in the database do not have Primary keys. For example, in the employee catalog, the employee, petalsemployeeoutput and test tables do not have Primary Keys.

  7. In the top right corner of the page, click to save the report parameters.

    You can run a saved report without redefining its parameters, for further information, see Accessing the list of defined reports.

Tables without Foreign Keys

These reports analyze the data in the datamart to check for tables that do not have Foreign Keys. They use the overview analyses (connection and catalog analyses) to compute the results.

Prerequisite(s):

  • You have accessed Talend Data Quality Portal as a user.

  • At least one report has been generated on a connection or catalog analysis in the Profiling perspective of Talend Studio.

To launch a report in order to list the tables that do not have Foreign Keys, do the following:

  1. From the web user interface, click the icon, point to Reports > Integrity reports and then click Tables without FK.

    The corresponding page opens.

  2. Click in the Header field and select YES if you want to insert a logo in the report to launch.

    The default logo file is a Talend logo, but you can decide to use a logo of your choice. For further information, see Customizing logos in reports.

  3. Click the Report explore icon to display a dialog box that lists some of the reports generated in the Profiling perspective of Talend Studio.

  4. From the Report list, select the overview report you want to analyze in order to check for any tables that do not have Foreign Keys.

  5. Click Confirm at the bottom right corner of the dialog box.

    The name of the selected report appears in the Report field.

  6. Click Execute in the top of the Parameters panel.

    A loading indicator is displayed and then a TABLES WITHOUT FK report on the selected report opens in the page.

    In this example, the TABLES WITHOUT FK report is launched on a database content analysis. The results show how many tables in each schema in the database do not have Foreign keys. For example, in the crm catalog, the contact_tp and cust tables do not have Foreign Keys.

  7. In the top right corner of the page, click to save the report parameters.

    You can run a saved report without redefining its parameters, for further information, see Accessing the list of defined reports.

Column size analysis

This report analyzes the data in the datamart relative to a given database to check data storage requirements for specific columns. It highlights the columns which have the largest difference between the parametrized column size and the actual maximum size. This will help the administrator to tune the database server for better performance through making sure that the physical storage space is not wasted in any of the analyzed columns.

Warning

This report needs column analyses that use the text statistics, mainly Minimal Length, Maximal Length and Average Length. You can generate this type of report only from Talend Data Quality Portal , that is you can not generate it from the Profiling perspective of Talend Studio.

Prerequisite(s):

  • You have accessed Talend Data Quality Portal as a user.

  • At least one report has been generated on a column analysis in the Profiling perspective of Talend Studio. The column analysis must use the text statistics indicators, mainly Minimal Length, Maximal Length and Average Length.

To launch a report in order to analyze column size in a specific database, do the following:

  1. From the user interface, click the icon, point to Reports > Integrity reports and then click Column size analysis.

    The corresponding page opens.

  2. Click in the Header field and select YES if you want to insert a logo in the report to launch.

    The default logo file is a Talend logo, but you can decide to use a logo of your choice. For further information, see Customizing logos in reports.

  3. Click the CONNECTION explore icon to display a dialog box that lists the database connections created in the Profiling perspective of Talend Studio.

  4. From the CONNECTION list, select the database connection used for the column analyses carried out in the Profiling perspective of Talend Studio.

  5. Click Confirm at the bottom right corner of the dialog box.

    The name of the selected connection is displayed in the CONNECTION field.

  6. Click Execute in the top of the Parameters panel.

    A loading indicator is displayed and then a report on all column analyses, if more than one exists, that use the selected database connection opens in the page.

    In this example, we have three reports that have been initially generated on three column analyses in the Profiling perspective of Talend Studio. This report which is generated from Talend Data Quality Portal on the selected database connection gives information about all the analyzed columns in the three different analyses as the following:

    Column label

    Description

    Column

    names of the analyzed columns.

    Distance

    results of the subtraction of the Max Length from the column size. This will give information about the actual storage space used in the column.

    Column size

    data length defined for the column in the database.

    Min Length

    computes the minimal length of the text in the column.

    Average Length

    computes the average length of the text in the column.

    Max Length

    computes the maximum length of the text in the column.

    The results shown in the report help the administrator to reduce the physical storage requirements (column size) for certain columns and thus have some space savings in these columns. This column storage space tuning will result in reduced physical storage in the table and database size.

  7. In the top right corner of the page, click to save the report parameters.

    You can run a saved report without redefining its parameters, for further information, see Accessing the list of defined reports.