Configuring Spark Job Server

Talend Real-time Big Data Platform Installation Guide for Linux

EnrichVersion
6.4
EnrichProdName
Talend Real-Time Big Data Platform
task
Installation and Upgrade
EnrichPlatform
Talend Installer
Talend Project Audit
Talend Runtime
Talend Data Preparation
Talend Log Server
Talend SAP RFC Server
Talend Studio
Talend Data Stewardship
Talend ESB
Talend DQ Portal
Talend Artifact Repository
Talend Activity Monitoring Console
Talend Repository Manager
Talend CommandLine
Talend Administration Center
Talend JobServer
Talend Identity Management

This section contains information on how to configure Spark Job Server for use with Talend Data Preparation in a Big Data context.

  1. Open the <Spark_Job_Server_Path>/settings.sh file.

  2. Edit the following line to specify the IP address that you want the Spark Job Server service to use. The service will listen on this IP address.

    sjs_host=localhost
  3. Edit the following line to specify the port that you want the Spark Job Server service to use. The service will listen on this port.

    sjs_port=8098
  4. Edit the following line to specify the path to the Hadoop cluster settings directory. Note that this must be a local path: you can obtain the settings files (such as hdfs-site.xml, mapred-side.xml, core-site.xml, and yarn-site.xml) from your Hadoop cluster and copy them to the machine where you installed Spark Job Server.

    hadoop_conf_dir=/path/to/hadoop/cluster/settings/directory
  5. To use Spark Job Server server with a secure Hadoop cluster (using Kerberos), add the following line to the file. Note that this must be a local path: you can obtain the krb5.conf file from your Hadoop cluster and copy it to the machine where you installed Spark Job Server.

    krb5.config=/path/to/Kerberos/configuration/file/krb5.conf

  6. Save your changes to the settings file.

  7. Restart Spark Job Server for your changes to be taken into account.