Configuring Spark Job Server

Talend Big Data Platform Installation Guide for Windows

EnrichVersion
6.4
EnrichProdName
Talend Big Data Platform
task
Installation and Upgrade
EnrichPlatform
Talend Data Preparation
Talend CommandLine
Talend DQ Portal
Talend Studio
Talend Data Stewardship
Talend SAP RFC Server
Talend Log Server
Talend Project Audit
Talend Runtime
Talend Activity Monitoring Console
Talend Administration Center
Talend JobServer
Talend Installer
Talend Repository Manager
Talend Artifact Repository

This section contains information on how to configure Spark Job Server for use with Talend Data Preparation in a Big Data context.

  1. Open the <Spark_Job_Server_Path>/settings.sh file.

  2. Edit the following line to specify the IP address that you want the Spark Job Server service to use. The service will listen on this IP address.

    sjs_host=localhost
  3. Edit the following line to specify the port that you want the Spark Job Server service to use. The service will listen on this port.

    sjs_port=8098
  4. Edit the following line to specify the path to the Hadoop cluster settings directory. Note that this must be a local path: you can obtain the settings files (such as hdfs-site.xml, mapred-side.xml, core-site.xml, and yarn-site.xml) from your Hadoop cluster and copy them to the machine where you installed Spark Job Server.

    hadoop_conf_dir=/path/to/hadoop/cluster/settings/directory
  5. To use Spark Job Server server with a secure Hadoop cluster (using Kerberos), add the following line to the file. Note that this must be a local path: you can obtain the krb5.conf file from your Hadoop cluster and copy it to the machine where you installed Spark Job Server.

    krb5.config=/path/to/Kerberos/configuration/file/krb5.conf

  6. Save your changes to the settings file.

  7. Restart Spark Job Server for your changes to be taken into account.