Retrieving a file schema - Cloud - 7.3

Talend Studio User Guide

Version
Cloud
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-13
Available in...

Big Data

Big Data Platform

Cloud Big Data

Cloud Big Data Platform

Cloud Data Fabric

Data Fabric

Real-Time Big Data Platform

Procedure

  1. When you click Retrieve Schema, a new wizard opens up where you can filter and display different objects in the HDFS.
    Note: You can retrieve schema from CSV, Avro and Sequence files.
  2. In the Name filter field, you can enter the name of the file(s) you are looking for to filter it/them.
    Otherwise, you can expand the folders listed in this wizard by selecting the check box before them. Then, select the file(s) of which you need to retrieve the schema(s)
    Each time when the schema retrieval is done for a file selected, the Creation status of this file becomes Success.
  3. Click Next to open a new view on the wizard that lists the selected file schema(s). You can select any of them to display its details in the Schema area.
  4. Modify the selected schema if needed. You can change the name of the schema and according to your needs, you can also customize the schema structure in the Schema area.
    Indeed, the tool bar allows you to add, remove or move columns in your schema.
    To overwrite the modifications you made on this selected schema with its default one, click Retrieve schema. Note that this overwriting does not retain any custom edits.
  5. Click Finish to complete the HDFS file schema creation. All the retrieved schemas are displayed under the related HDFS connection node in the Repository view.
    If then you still need to edit a schema, right click this schema under the relevant HDFS connection node in the Repository view and from the contextual menu, select Edit Schema to open this wizard again and then make the modifications.
    Note:

    If you modify the schemas, ensure that the data type in the Type column is correctly defined.