Available in...
Big Data
Big Data Platform
Cloud Big Data
Cloud Big Data Platform
Cloud Data Fabric
Data Fabric
Real-Time Big Data Platform
Procedure
-
When you click Retrieve Schema, a new wizard opens up where
you can filter and display different objects in the HDFS.
Note: You can retrieve schema from CSV, Avro and Sequence files.
-
In the Name filter field, you can enter the
name of the file(s) you are looking for to filter it/them.
Otherwise, you can expand the folders listed in this wizard by selecting the check box before them. Then, select the file(s) of which you need to retrieve the schema(s)Each time when the schema retrieval is done for a file selected, the Creation status of this file becomes Success.
-
Click Next to open a new view on the wizard
that lists the selected file schema(s). You can select any of them to display
its details in the Schema area.
-
Modify the selected schema if needed. You can change the name of the schema
and according to your needs, you can also customize the schema structure in the
Schema area.
Indeed, the tool bar allows you to add, remove or move columns in your schema.To overwrite the modifications you made on this selected schema with its default one, click Retrieve schema. Note that this overwriting does not retain any custom edits.
-
Click Finish to complete the HDFS file schema
creation. All the retrieved schemas are displayed under the related HDFS
connection node in the Repository view.
If then you still need to edit a schema, right click this schema under the relevant HDFS connection node in the Repository view and from the contextual menu, select Edit Schema to open this wizard again and then make the modifications.Note:
If you modify the schemas, ensure that the data type in the Type column is correctly defined.