Setting up a generic schema from scratch - 6.2

Talend Real-time Big Data Platform Studio User Guide

English (United States)
Talend Real-Time Big Data Platform
Talend Studio
Data Quality and Preparation
Design and Development

To create a generic schema from scratch, proceed as follows:

  1. Right-click Generic schemas under the Metadata node in the Repository tree view, and select Create generic schema.

  2. In the schema creation wizard that appears, fill in the generic schema properties such as schema Name and Description. The Status field is a customized field. For more information about how to define the field, see Status settings.

    Click Next to continue.

  3. Give a name to the schema or use the default one (metadata) and add a comment if needed. Customize the schema structure in the Schema panel according to your needs.

    The tool bar allows you to add, remove or move columns in your schema. You can also export the current schema as an XML file, or import a schema from an XML file, which must be an export of schema from the Studio, to replace the current schema.

    Make sure the data type in the Type column is correctly defined.

    For more information regarding Java data types, including date pattern, see Java API Specification.

    Below are the commonly used Talend data types:

    • Object: a generic Talend data type that allows processing data without regard to its content, for example, a data file not otherwise supported can be processed with a tFileInputRaw component by specifying that it has a data type of Object.

    • List: a space-separated list of primitive type elements in an XML Schema definition, defined using the xsd:list element.

    • Dynamic: a data type that can be set for a single column at the end of a schema to allow processing fields as VARCHAR(100) columns named either as 'Column<X>' or, if the input includes a header, from the column names appearing in the header. For more information, see Dynamic schema.

    • Document: a data type that allows processing an entire XML document without regarding to its content.

  4. Click Finish to complete the generic schema creation. The created schema is displayed under the relevant Generic schemas node.