Bulk-writing the actors data in Neo4j - 7.0

Neo4j

author
Talend Documentation Team
EnrichVersion
7.0
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Database components > Neo4j components
Data Quality and Preparation > Third-party systems > Database components > Neo4j components
Design and Development > Third-party systems > Database components > Neo4j components
EnrichPlatform
Talend Studio

Procedure

  1. Double-click the first tFileInputDelimited component to open its Component view.
  2. In the File name/Stream field, enter the path or browse to the CSV file that describes the actors' IDs, names and their labels to be used in Neo4j.

    The input CSV file used in this example reads as follows:

    keanu,"Keanu Reeves",Actor
    laurence,"Laurence Fishburne",Actor
    carrieanne,"Carrie-Anne Moss",Actor

    The double quotation marks on the actor names are not mandatory.

  3. Click the [...] button next to Edit schema to open the schema editor, and define the input schema based on the structure of the input file.

    In this example, the columns are id, name and label, all of type String.

  4. Click OK to close this editor and accept the propagation of the schema to the next component.
  5. In the Field separator field, enter a comma (,) to replace the default semicolon (;).
  6. Double-click the first tNeo4jBatchOutput component to open its Component view.
  7. Select the Use an existing connection check box to reuse the Neo4j database connection opened by the tNeo4jConnection component.
  8. Verify that the Shutdown after Job check box is clear.
  9. From the Field that contains the label list drop-down list, select the column that provides labels.
  10. In the Index name field, enter the name of the index to be created for the nodes.
  11. From Import identifier drop-down list, select the column that provides IDs.