Installing Talend Data Preparation manually - 7.0

Talend Data Fabric Installation Guide for Linux

English (United States)
Talend Data Fabric
Talend Activity Monitoring Console
Talend Administration Center
Talend Artifact Repository
Talend CommandLine
Talend Data Preparation
Talend Data Stewardship
Talend DQ Portal
Talend ESB
Talend Identity and Access Management
Talend Installer
Talend JobServer
Talend Log Server
Talend MDM Server
Talend MDM Web UI
Talend Repository Manager
Talend Runtime
Talend SAP RFC Server
Talend Studio
Installation and Upgrade
This procedure contains the steps to manually install Talend Data Preparation on your machine. For the automatic installation procedure using Talend Installer, see Using Talend Installer graphical installation mode.

Before you begin


  1. Download a MongoDB 3.4 instance from and install it.
    For more information on the supported MongoDB databases, see Compatible databases.
    For more information on how to install it, see MongoDB documentation.
    If you want to secure connections with MongoDB using SSL, MongoDB Enterprise Server has to be manually installed on your machine. For more information, see
  2. Unzip the file where you want Talend Data Preparation to be installed.
  3. Unzip the <Data_Preparation_Path>/services/ file where you want Components Catalog to be installed.
  4. To use Talend Data Preparation in a Big Data context, you need to install two additional tools, Streams Runner and Spark Job Server.
    Note that Streams Runner and Spark Job Server must be installed on a Linux machine.
    1. Unpack <Data_Preparation_Path>/services/data-streams-streamsrunner-svc-A.B.C.tgz file where you want Streams Runner to be installed.
    2. Unpack the <Data_Preparation_Path>/services/spark-jobserver-A.B.C.tar.gz file where you want Spark Job Server to be installed. This file contains Spark Job Server plus all the required dependencies.
      Additionally, you must have already installed curl, a command-line tool and library for transferring data with URLs. You can download it from if needed.
  5. Add mongo to the PATH environment variable.
  6. Create the dataprep database in MongoDB using the following command:
    use dataprep
  7. Create the following user for the dataprep database in MongoDB:
    • Username: dataprep-user
    • Password: duser
    To do this, you can use the following command:
    db.createUser( { user: "dataprep-user", pwd: "duser", roles: [{ role: "readWrite", db: "dataprep"}]})
    You can automatically create the user and password by executing the <Data_Preparation_Path>/ file.