Installing Talend Data Preparation manually - 6.3

Talend Big Data Installation Guide for Linux

EnrichVersion
6.3
EnrichProdName
Talend Big Data
task
Installation and Upgrade
EnrichPlatform
Talend Activity Monitoring Console
Talend Administration Center
Talend Artifact Repository
Talend CommandLine
Talend Data Preparation
Talend Data Stewardship
Talend Installer
Talend JobServer
Talend Log Server
Talend Project Audit
Talend Runtime
Talend Studio

This procedure contains the steps to manually install Talend Data Preparation on your machine. For the automatic installation procedure using Talend Installer, see Using Talend Installer graphical installation mode.

Prerequisites:

To manually install and configure Talend Data Preparation, follow this procedure:

  1. Download a MongoDB 3.2 instance from https://www.mongodb.com/download-center and install it. For more information on how to install it, see MongoDB documentation.

    If you want to secure connections with MongoDB using SSL, MongoDB Enterprise Server has to be manually installed on your machine. For more information, see https://docs.mongodb.com/v3.2/security/.

  2. Unzip the Talend-DataPreparation-Server-VA.B.C.zip file where you want Talend Data Preparation to be installed.

  3. Unzip the <Data_Preparation_Path>/services/components-api-service-rest-all-components-VA.B.C.zip file where you want Components Catalog to be installed.

  4. To use Talend Data Preparation in a big data context, you need to install two additional tools, Flow Runner and Spark Job Server.

    1. Unpack <Data_Preparation_Path>/services/datastreams-flowrunner-A.B.C.tgz file where you want Flow Runner to be installed.

    2. Unpack the <Data_Preparation_Path>/services/spark-jobserver-A.B.C.tar.gz file where you want Spark Job Server to be installed. This file contains Spark Job Server plus all the required dependencies.

      Note that Spark Job Server must be installed on a Linux machine.

      Additionally, you must have already installed curl, a command-line tool and library for transferring data with URLs. You can download it from https://curl.haxx.se/. if needed.

  5. Add mongo to the PATH environment variable.

  6. Create the dataprep database in MongoDB.

  7. Create the following user for the dataprep database in MongoDB:

    • Username: dataprep-user

    • Password: duser

    You can automatically create the user and password by executing the <Data_Preparation_Path>/create_mongo_user.sh file.

    Before you use Talend Data Preparation for the first time, you must also perform certain configuration steps. For more information, see Configuring Talend Data Preparation, Configuring the Components Catalog server and, if appropriate, Configuring Flow Runner and Configuring Spark Job Server,