Flat Properties - 6.3

Talend Data Mapper User Guide

EnrichVersion
6.3
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Design and Development
EnrichPlatform
Talend Studio

The following properties are used only for flat structures.

Initiator

The initiator is the sequence of characters that signals the start of this element. The initiator can be used for an element at any level, or having any group type. For example, you might have a situation where one of several elements can occur, and each element has a certain character sequence that precedes it. In this case, you can define a parent element to these elements with a group type of choice, and then define the initiator characters in each child element.

The special characters apply here.

The initiator characters are not considered part of the text of the element depending on the setting of the Include Initiator property below.

Initiator and Terminator Special Characters

The following are special characters that may be specified in the Initiator or Terminator properties:

  • \n - A single newline character(s), depending on the platform. This does not necessarily mean the actual newline character (ASCII 10), but specifies the newline character(s) as defined in the flat representation properties.

  • \t - A tab character is expected and emitted an output.

  • \uXXXX - A single Unicode character specified in hex.

  • \w - A single whitespace character which can be a newline, space or tab. On output a single space is emitted.

  • \W - One or more consecutive whitespace characters (newline, space or tab). Note that this will consume consecutive whitespace characters until a non-whitespace character. On output all spaces are emitted.

  • \\ - A single backslash character is expected and emitted on output.

Include Initiator

If set, the value of the initiator is included in the text value of the element. This is meaningful only for elements with a group type of none (that is, non-container elements).

This means that on input, the initiator value is added to the beginning of the text value of the element, and on output the initiator value is not written because it is assumed to be part of the value of the element.

Terminator

The terminator is the sequence of characters that signals the end of this element.

The special characters apply here.

The terminator characters are not considered part of the text of the element depending on the setting of the Include Initiator property below.

Include Terminator

If set, the value of the terminator is included in the text value of the element. This is meaningful only for elements with a group type of none (that is, non-container elements).

This means that on input, the terminator value is added to the end of the text value of the element, and on output the terminator value is not written because it is assumed to be part of the value of the element.

Start Offset

The start offset is used when working with positional formatted documents where the first character of this element does not immediately follow the last character of the previous element. This is the number of characters to skip before starting this element. If used in conjunction with the Column property, the characters are skipped after going to the specified column.

Column

Column indicates the column where the element starts. This causes the reader to advance to the next occurrence of the specified column, regardless of what row (line) you are on. For example, if the reader has finished the previous element on column 5 and column 20 is specified, it will go to column 20 on the same line. However, if the reader has finished the element and is at column 47, it will go to column 20 on the next line.

Quote Handling

Used for processing Comma Separated Value (CSV) files. This provides pre-defined options for handling possible quotation marks around elements. Select one of the following:

  • None - No special handling for quotation marks.

  • Optional Quotes - Double quotation marks may be present or absent for this element. If they are present, they will bound the element, and they will not be included in the data. If they are absent, the element is bounded by the normal initiator or terminator.

  • Required Quotes - Same as Optional Quotes, except that the quotation marks must be present for each element. If the quotation marks are missing, an error will occur.

Release

Defines the single character that causes an initiator or terminator not to be recognized. For example, if the terminator character is a double quote, and you specify a backslash as the release character, you can include a double quote within the value of the element by preceding it with a backslash. If you the release character is a backslash and you want to include a backslash in the value, simply enter two backslashes.