Selecting the primary and foreign keys - Cloud - 7.3

Talend Studio User Guide

Version
Cloud
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-13
Available in...

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

Procedure

  1. Click Analyzed Column Sets to display the corresponding view.
    In this example, you want to match the foreign keys in the customer_id column of the sales_fact_1998 table with the primary keys in the customer_id column of the customer table, and vice versa. This will explore the relationship between the two tables to show us for example if every customer has an order in the year 1998.
  2. From the Connection list, select the database connection relevant to the database to which you want to connect.
    You have in this list all the connections you create and centralize in the Talend Studio repository.
  3. Click A Column Set to open the Column Selection dialog box.
    If you want to check the validity of the foreign keys, select the column holding the foreign keys for the A set and the column holding the primary keys for the B set.
  4. Browse the catalogs/schemas in your database connection to reach the table holding the column you want to match.
    In this example, the column to be analyzed is customer_id that holds the foreign keys.
    You can filter the table or column lists by typing the desired text in the Table filter or Column filter fields respectively. The lists will show only the tables/columns that correspond to the text you type in.
  5. Click the table name to display all its columns in the right-hand panel of the Column Selection dialog box.
  6. In the list to the right, select the check box of the column holding the foreign keys and then click OK to proceed to the next step.
    You can drag the columns to be analyzed directly from the DQ Repository tree view to the editor.
    If you right-click any of the listed columns in the Analyzed Columns view and select Show in DQ Repository view, the selected column will be automatically located under the corresponding connection in the tree view.
  7. Click B Column Set and follow the same steps to select the column holding the primary keys or drag it from the DQ Repository to the right column panel.
    If you select the Compute only number of rows not in B check box, you will look for any missing primary keys in the column in the B set.
  8. Click Data Filter in the analysis editor to display the view where you can set a filter on each of the analyzed columns.
  9. Press F6 to execute this key-matching analysis.
    A confirmation message is displayed.
  10. Click OK in the message if you want to continue the operation.
    The execution of this type of analysis may takes some time. Wait till the Analysis Results view opens automatically showing the analysis results.

Results

In this example, every foreign key in the sales_fact_1998 table is identified with a primary key in the customer table. However, 98.22% of the primary keys in the customer table could not be identified with foreign keys in the sales_fact_1998 table. These primary keys are for the customers who did not order anything in 1998.