Importing data profiling items - 7.3

Talend Open Studio User Guide

Version
7.3
Language
English
Product
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Module
Talend Studio
Content
Design and Development
Last publication date
2023-10-11
Available in...

Open Studio for Data Quality

You can import data profiling items including analyses, database connections, patterns, and indicators, etc. into your current Talend Studio from various projects or different versions of Talend Studio.

You cannot import an item without all its dependencies. When you try to import an analysis for example, all its dependencies such as a metadata connection and the patterns and indicators used in this analysis will be selected by default and imported with the analysis.

Before you begin

You have access to the root directory of another Studio version in which data profiling items have been created.

Procedure

  1. In the Profiling perspective, either:
    • Right-click anywhere in the DQ Repository tree view and select Import Items.
    • Click the icon on the toolbar and select Import Items.
    All editors which are open in the Studio are automatically closed. The Import Item wizard is displayed.
  2. Select the root directory or the archive file option according to whether the data profiling items are in the workspace file within the Studio directory or are already exported into a zip file.
    • If you select the root directory option, click Browse and set the path to the project folder containing the items to be imported within the workspace file of the Talend Studio directory.

      All items and their dependencies that do not exist in your current Talend Studio are selected by default in the dialog box.

    • If you select the archive file option, click Browse and set the path to the archive file that holds the data profiling items you want to import.

      All items and their dependencies that do not exist in your current Talend Studio are selected by default in the dialog box.

  3. Select the Overwrite existing items check box if some error and warning messages are listed in the Error and Warning area.
    This means that items with the same names already exist in the current Studio.
    The imported items will replace the existing ones.
    When you import system indicators that are modified in a Studio version, they will not overwrite the indicators in the current Talend Studio. All modifications from older versions will be integrated with the system indicators in the current Talend Studio.
  4. Select or clear the check boxes of the data profiling items you want or do not want to import according to your needs.
    All dependencies for the selected item are selected by default. When you clear the check box of an item, the check boxes of the dependencies of this item are automatically cleared as well. Also, an error message will display on top of the dialog box if you clear the check box of any of the dependencies of the selected item.
  5. Click Finish to validate the operation.
    The imported items display under the corresponding folders in the DQ Repository tree view.
    You can also import local project folders from the login window of your Talend Studio. For more information, see the Getting Started Guide.
  6. Do the following to have every item working correctly:
    1. Run the analyses that have Java as their execution engine.
      This will compute and store locally the results of the indicators used in the analyses.
      You cannot open a list of the indicator results in the Analysis Results view in the current Talend Studio without running the analyses first as data is not imported with them from the old Talend Studio.
    2. Install missing third-party Java libraries or database drivers.
      When you import database connections for the first time, warning red icons may be docked on the connection names. This is because Talend Studio requires specific third-party Java libraries or database drivers (.jar files) to be installed to connect to sources and targets. Those libraries or drivers, known as external modules, can be required by some connection wizards. Due to license restrictions, Talend may not be able to ship certain external modules within Talend Studio.
    3. Set the path for the drivers of the SQL Servers (2005 or 2008).
      If you import SQL Servers (2005 or 2008) connections into your current Talend Studio, a warning red icon is docked on the connection names in the DB connections folder. This indicates that the driver path for these connections is empty. You must open the connection wizard and redefine the connection manually to set the path to a JDBC driver you can download from the Microsoft download center.
      For more information on editing a database connection, see Opening or editing a database connection.