Procedure
-
Open the
<Data_Preparation_Path>/config/application.properties
file and edit the following Talend Data Preparation properties:
Field Action tac.url
Enter the URL to your Talend Administration Center followed by a /
.public.ip
Enter the URL you want to use to access Talend Data Preparation. server.port
Enter the port you want to use for Talend Data Preparation user interface. iam.ip
Enter the URL to your Talend Identity and Access Management instance. tac.user-name
Enter the username of your Data Preparation user in Talend Administration Center. tac.password
Enter the password of your Data Preparation user in Talend Administration Center. security.oauth2.client.clientId
Enter the Talend Identity and Access Management OIDC client identifier. security.oauth2.client.clientSecret
Enter the Talend Identity and Access Management OIDC client password. iam.scim.url
Make sure that Talend Identity and Access Management port is correct. app.products[0].id=TDS
app.products[0].name=Data Stewardship
app.products[0].url=<place_your_tds_url_here>
Enter the URL to your Talend Data Stewardship instance. All the passwords entered in the properties file are encrypted when you start your Talend Data Preparation instance.
-
Update the following fields with your MongoDB settings:
Field Description mongodb.host
Host name of your MongoDB instance mongodb.port
Port number of your MongoDB instance mongodb.database
Name of the database on which Talend Data Preparation is connected, dataprep
by default. The database is created when you first launch Talend Data Preparation.mongodb.user
Username used to connect to the database mongodb.password
Password used to connect to the database -
To enable the interaction between Talend Data Preparation and the Components Catalog service, edit the
following line with your Components Catalog server host and
port:
tcomp.server.url=http://<tcomp_host>:<tcomp_port>/tcomp
-
To enable the app switcher after installing Talend Data Preparation and Talend Data Stewardship, uncomment the
following lines and add the URL to your Talend Data Stewardship instance:
app.products[0].id=TDS app.products[0].name=Data Stewardship app.products[0].url=<place_your_tds_url_here>
You must also add the URL to your Talend Data Preparation instance to the configuration file for Talend Data Stewardship. For more information, see Configuring Talend Data Stewardship after installation.
- By default, audit logs are enabled. You must specify the correct appender.http.url parameter in the audit.properties file, or disable audit logs. For more information, see Enabling and configuring audit capabilities in Talend Data Preparation.
-
To configure the access to Talend Dictionary Service, edit the following
fields:
Field Description dataquality.semantic.update.enable
Set the value of this parameter to true
in order to enable the interaction between Talend Data Preparation and Talend Dictionary Service.dataquality.semantic.list.enable
Set the value of this parameter to true
in order to display the semantic type management interface in Talend Data Preparation.semanticservice.url
Enter the URL to your Talend Dictionary Service instance. spring.cloud.stream.kafka.binder.brokers
Enter the host corresponding to your Kafka broker. spring.cloud.stream.kafka.binder.defaultBrokerPort
Enter the port corresponding to your Kafka broker. spring.cloud.stream.kafka.binder.zkNodes
Enter the host corresponding to your Zookeeper node. spring.cloud.stream.kafka.binder.defaultZkPort
Enter the port corresponding to your Zookeeper node. -
Change the value of the
dataquality.indexes.file.location
property from${java.io.tmpdir}/org.talend.dataquality.semantic
to<other_location>/org.talend.dataquality.semantic
.By default, the custom semantic types that you create using Talend Dictionary Service are stored in a tmp directory. To avoid losing your changes, it is recommended to change the save location of your custom semantic types. You can set a path to the location of your choice, as long as it is not in a tmp folder. -
To enable to use of the Streams Runner with Talend Data Preparation, set the
streams.enable
property astrue
. -
To configure the access to the Streams Runner, edit the
following fields:
Field Description streams.flow.runner.url
Enter the URL to your Streams Runner. The URL has the following syntax: http://<local_machine_IP>:<Big_data_preparation_port>/
streams.kerberos.principal
Enter your Kerberos principal. streams.kerberos.keytab_path
Enter the path to your Kerberos keytab file. streams.hdfs.server.url
You can optionally set a default URL to be displayed in the input and output Path fields, when working with HDFS datasets, in Talend Data Preparation. The <Data_Preparation_Path>/config/tuning.properties file contains additional parameters for more advanced tuning. Make sure the parameters in this file match the sizing of your cluster. - Execute the start.sh file to start your Talend Data Preparation instance.