This procedure contains the steps to manually install Talend Data Preparation on your machine. For the automatic installation procedure using Talend Installer, see Using Talend Installer graphical installation mode.
Talend Administration Center is installed and running. For more information on Talend Administration Center installation, see Using Talend Installer graphical installation mode for the automatic installation or Installing and configuring Talend Administration Center for the manual installation.
A Talend Data Preparation user exists in Talend Administration Center. For more information, see Talend Administration Center User Guide.
There are no other instances of MongoDB installed on your machine.
To use Talend Data Preparation with Big Data, use one of the supported Hadoop distribution. For more information, see Supported Hadoop distribution versions for Talend Data Preparation with Big Data.
Before installing Talend Data Preparation, make sure that you fulfill the hardware and software requirements. For more information, see Before installing your Talend product.
When installing your Talend product manually, the installation procedures must be executed in a particular order. For more information, see Installing your Talend product manually.
To manually install and configure Talend Data Preparation, follow this procedure:
If you want to secure connections with MongoDB using SSL, MongoDB Enterprise Server has to be manually installed on your machine. For more information, see https://docs.mongodb.com/v3.2/security/.
Unzip the Talend-DataPreparation-Server-VA.B.C.zip file where you want Talend Data Preparation to be installed.
Unzip the <Data_Preparation_Path>/services/components-api-service-rest-all-components-VA.B.C.zip file where you want Components Catalog to be installed.
To use Talend Data Preparation in a big data context, you need to install two additional tools, Flow Runner and Spark Job Server.
Unpack <Data_Preparation_Path>/services/datastreams-flowrunner-A.B.C.tgz file where you want Flow Runner to be installed.
Unpack the <Data_Preparation_Path>/services/spark-jobserver-A.B.C.tar.gz file where you want Spark Job Server to be installed. This file contains Spark Job Server plus all the required dependencies.
Note that Spark Job Server must be installed on a Linux machine.
Additionally, you must have already installed curl, a command-line tool and library for transferring data with URLs. You can download it from https://curl.haxx.se/. if needed.
dataprepdatabase in MongoDB.
Create the following user for the
dataprepdatabase in MongoDB:
Before you use Talend Data Preparation for the first time, you must also perform certain configuration steps. For more information, see Configuring Talend Data Preparation, Configuring the Components Catalog server and, if appropriate, Configuring Flow Runner and Configuring Spark Job Server,