tNeo4jBatchOutput Standard properties - 6.5

Neo4j

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Database components > Neo4j components
Data Quality and Preparation > Third-party systems > Database components > Neo4j components
Design and Development > Third-party systems > Database components > Neo4j components
EnrichPlatform
Talend Studio

These properties are used to configure tNeo4jBatchOutput running in the Standard Job framework.

The Standard tNeo4jBatchOutput component belongs to the Big Data and the Databases families.

The component in this framework is available in all Talend products with Big Data.

Basic settings

Use an existing connection

Select this check box and in the Component List click the relevant connection component to reuse the connection details you already defined.

This component supports Neo4j version V3.2.X only and does not support the remote mode. Therefore, do not reuse the connection to versions other than V3.2.X defined in a tNeo4jConnection component and do not select the Remote server check box in tNeo4jConnection.

Do not use a 2.X.X version and a 3.X.X version in the same Job. Otherwise, class conflict issues occur.

Database path

Specify the directory to hold your data files. If the specified directory does not exist, it will be created.

This field appears only if you do not select the Use an existing connection check box.

Shutdown after job

Select this check box to shutdown the Neo4j database connection when no more operations on Neo4j are going to be performed after the current component.

Alternatively, you can use tNeo4jClose to shutdown the database.

This avoids errors such as "Id file not properly shutdown" at next execution of Jobs involving Neo4j.

This check box is available only if the Use an existing connection check box is not selected.

Field that contains the label list

Select the column from the input schema you have defined in the preceding components to provide labels for the nodes to be created.

When the input data provides multiple labels for a node, separate those labels by semi-colons (;) in the preceding components.

Index name

Enter, within double quotation marks, the name of the index to be created for the nodes. This index is used to store the identifiers of the nodes and help create relationships.

Import identifier

Select the column from the input schema you have defined in the preceding components to provide identifiers for the nodes to be created. These identifiers are stored in the index to be created and must be unique cross the index.

Therefore, ensure that the column to be used contains only unique values.

Save the identifier

Select this check box to save the technical identifiers (native graph ids).

For further information about the different types of Neo4j entity identifiers, see Neo4j documentation: Entity identifier.

Number of index elements in memory

Enter, without double quotation marks, the cache size used to keep the index elements in memory. This value must be enough to keep all of the index elements in memory.

For further information, see Neo4j documentation: Memory tuning.

Advanced settings

Neo4j configuration

Add parameters to the table to configure the database to be created.

For further information, see Neo4j documentation: Configuration settings.

When entering values, use the syntax demonstrated by the examples given alongside the column names of this Nodes files table.

tStatCatcher Statistics

Select this check box to gather the Job processing metadata at the Job level as well as at each component level.

Global Variables

Global Variables

NB_LINE: the number of rows read by an input component or transferred to an output component. This is an After variable and it returns an integer.

ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio User Guide.

Usage

Usage rule

This component is used as an output component and it always needs an incoming link.