Creating a catalog or schema analysis - 6.2

Talend Data Fabric Studio User Guide

EnrichVersion
6.2
EnrichProdName
Talend Data Fabric
task
Data Quality and Preparation
Design and Development
EnrichPlatform
Talend Studio

You can use the Profiling perspective of the studio to analyze one specific catalog or schema in a database, if this entity is used in the physical structure of the database. The result of the analysis gives analytical information about the content of this schema, for example number of rows, number of tables, number of rows per table and so on.

Prerequisite(s): At least one database connection has been created to connect to a database that uses the "catalog" or "schema" entity. For further information, see Connecting to a database.

  1. Under DB connections in the DQ Repository tree view, right-click the catalog or schema for which you want to create content analysis and, select Overview analysis from the contextual menu.

    This example shows how to create a schema analysis.

  2. In the wizard that opens, enter a name for the current analysis.

    Note

    Avoid using special characters in the item names including:

    "~", "!", "`", "#", "^", "&", "*", "\\", "/", "?", ":", ";", "\"", ".", "(", ")", "'", "¥", "'", """, "«", "»", "<", ">".

    These characters are all replaced with "_" in the file system and you may end up creating duplicate items.

  3. If required, set the analysis metadata (purpose, description and author name) in the corresponding fields and click Next.

  4. Set filters on the tables and/or views you want to analyze in their corresponding fields according to your needs using the SQL language.

    By default, the analysis examines all tables and views in the catalog.

  5. Click Finish.

    A folder for the newly created analysis is listed under Analysis in the DQ Repository tree view, and the analysis editor opens with the defined metadata.

  6. Press F6 to execute the analysis.

    A message opens at the bottom of the editor to confirm that the operation is in progress and analysis results are opened in the Analysis Results view.

From the Statistical information view, you can:

  • Click the schema to list all tables included in it along with a summary of their content: number of rows, keys and user-defined indexes.

    The selected schema is highlighted in blue. Schemas highlighted in red indicate potential problems in data.

  • Right-click a schema and select Overview analysis to analyze the content of the selected item.

  • Right-click a table or a view and select Table analysis to create a table analysis on the selected item. You can also view the keys and indexes of a selected table. For further information, see Displaying keys and indexes of database tables.

  • Click any column header in the analytical table to sort the listed data alphabetically.