Defining the file parsing parameters - 7.3

Talend Open Studio User Guide

Version
7.3
Language
English
Product
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Module
Talend Studio
Content
Design and Development
Last publication date
2023-10-11
Available in...

Open Studio for Big Data

Open Studio for Data Integration

Open Studio for ESB

About this task

On this view, you can refine the various settings of your file so that the file schema can be properly retrieved.

Procedure

  1. Set the Encoding type,and the Field and Row separators in the File Settings area.
  2. Depending on your file type (CSV or delimited), set the Escape and Enclosure characters to be used.
  3. If the file preview shows a header message, exclude the header from the parsing. Set the number of header rows to be skipped. Also, if you know that the file contains footer information, set the number of footer lines to be ignored.
  4. The Limit of Rows allows you to restrict the extend of the file being parsed. If needed, select the Limit check box and set or select the desired number of rows.
  5. In the File Preview panel, view the new settings impact.
  6. Check the Set heading row as column names box to transform the first parsed row as labels for schema columns. Note that the number of header rows to be skipped is then incremented by 1.
  7. Click Refresh on the preview panel for the settings to take effect and view the result on the viewer.
  8. Click Next to proceed to the final step to check and customize the generated file schema.