Retrieving table schemas

Talend Open Studio for Data Integration User Guide

EnrichVersion
6.5
EnrichProdName
Talend Open Studio for Data Integration
task
Design and Development
EnrichPlatform
Talend Studio
  1. To retrieve table schemas from the database connection you have just set up, right-click the connection item from the Repository tree view and select Retrieve schema from the contextual menu.

    A new wizard opens up where you can filter and show different objects (tables, views and synonyms) in your database connection, select tables of interest, and define table schemas.

  2. Define a filter to filter databases objects according to your need. For details, see Filtering database objects.

    Click Next to open a view that lists your filtered database objects. The list offers all the databases with all their tables present on the database connection that meet you filter conditions.

    If no database is visible on the list, click Check connection to verify the database connection.

  3. Select one or more tables on the list to load them onto your repository file system. Your repository schemas will be based on these tables.

  4. Click Next. On the next window, four setting panels help you define the schemas to create. Modify the schemas if needed.

    Make sure the data type in the Type column is correctly defined.

    For more information regarding Java data types, including date pattern, see Java API Specification.

    Below are the commonly used Talend data types:

    • Object: a generic Talend data type that allows processing data without regard to its content, for example, a data file not otherwise supported can be processed with a tFileInputRaw component by specifying that it has a data type of Object.

    • List: a space-separated list of primitive type elements in an XML Schema definition, defined using the xsd:list element.

    • Document: a data type that allows processing an entire XML document without regarding to its content.

    Warning

    If your source database table contains any default value that is a function or an expression rather than a string, be sure to remove the single quotation marks, if any, enclosing the default value in the end schema to avoid unexpected results when creating database tables using this schema.

    For more information, see Verifying default values in a retrieved schema at https://help.talend.com.

    By default, the schema displayed on the Schema panel is based on the first table selected in the list of schemas loaded (left panel). You can change the name of the schema and according to your needs, you can also customize the schema structure in the schema panel.

    The tool bar allows you to add, remove or move columns in your schema. In addition, you can load an XML schema from a file or export the current schema as XML.

    To retrieve a schema based on one of the loaded table schemas, select the database table schema name in the drop-down list and click Retrieve schema. Note that the retrieved schema then overwrites any current schema and does not retain any custom edits.

    When done, click Finish to complete the database schema creation. All the retrieved schemas are displayed in the Table schemas sub-folder under the relevant database connection node.

    Now you can drag and drop any table schema of the database connection from the Repository tree view onto the design workspace as a new database component or onto an existing component to reuse the metadata. For more information, see How to use centralized metadata in a Job and How to set a repository schema in a Job.