Skip to main content Skip to complementary content
Close announcements banner

Retrieving a file schema

Procedure

  1. When you click Retrieve Schema, a new wizard opens up where you can filter and display different objects in the HDFS.
    Information noteNote: You can retrieve schema from CSV, Avro and Sequence files.
  2. In the Name filter field, you can enter the name of the file(s) you are looking for to filter it/them.
    Otherwise, you can expand the folders listed in this wizard by selecting the check box before them. Then, select the file(s) of which you need to retrieve the schema(s)
    Each time when the schema retrieval is done for a file selected, the Creation status of this file becomes Success.
  3. Click Next to open a new view on the wizard that lists the selected file schema(s). You can select any of them to display its details in the Schema area.
  4. Modify the selected schema if needed. You can change the name of the schema and according to your needs, you can also customize the schema structure in the Schema area.
    Indeed, the tool bar allows you to add, remove or move columns in your schema.
    To overwrite the modifications you made on this selected schema with its default one, click Retrieve schema. Note that this overwriting does not retain any custom edits.
  5. Click Finish to complete the HDFS file schema creation. All the retrieved schemas are displayed under the related HDFS connection node in the Repository view.
    If then you still need to edit a schema, right click this schema under the relevant HDFS connection node in the Repository view and from the contextual menu, select Edit Schema to open this wizard again and then make the modifications.
    Information noteNote:

    If you modify the schemas, ensure that the data type in the Type column is correctly defined.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!