Procedure
-
Open the <Data_Preparation_Path>/config/application.properties file
and edit the following Talend Data Preparation properties:
Field Action public.ip
Enter the hostname you want to use to access Talend Data Preparation. server.port
Enter the port you want to use for Talend Data Preparation user interface. iam.ip
Enter the URL to your Talend Identity and Access Management instance. security.oauth2.client.clientId
Enter the Talend Identity and Access Management OIDC client identifier. security.oauth2.client.clientSecret
Enter the Talend Identity and Access Management OIDC client password. iam.scim.url
Make sure that Talend Identity and Access Management port is correct. app.products[0].id=TDS
app.products[0].name=Data Stewardship
app.products[0].url=<place_your_tds_url_here>
Enter the URL to your Talend Data Stewardship instance. All the passwords entered in the properties file are encrypted when you start your Talend Data Preparation instance.
-
Update the following fields with your MongoDB settings:
Field Description mongodb.host
Host name of your MongoDB instance mongodb.port
Port number of your MongoDB instance mongodb.database
Name of the database on which Talend Data Preparation is connected, dataprep
by default. The database is created when you first launch Talend Data Preparation.mongodb.user
Username used to connect to the database mongodb.password
Password used to connect to the database -
To enable the interaction between Talend Data Preparation and the
Components Catalog service, edit
the following line with your Components Catalog server host and port:
tcomp.server.url=http://<tcomp_host>:<tcomp_port>/tcomp
-
To enable the app switcher after installing Talend Data Preparation and Talend Data Stewardship, uncomment the following
lines and add the URL to your Talend Data Stewardship instance:
app.products[0].id=TDS app.products[0].name=Data Stewardship app.products[0].url=<place_your_tds_url_here>
You must also add the URL to your Talend Data Preparation instance to the configuration file for Talend Data Stewardship. For more information, see the section about configuring Talend Data Stewardship after installation.
- By default, audit logs are enabled. You must specify the correct appender.http.url parameter in the audit.properties file, or disable audit logs. For more information, see Enabling and configuring audit capabilities in Talend Data Preparation.
-
To configure the access to Talend Dictionary Service, edit the following fields:ⓘ
Available in:
Big Data Platform
Data Fabric
Data Management Platform
Data Services Platform
MDM Platform
Real-Time Big Data Platform
Field Description tsd.consumer.enabled
Set the value of this parameter to true
in order to enable the interaction between Talend Data Preparation and Talend Dictionary Service.dataquality.semantic.list.enable
Set the value of this parameter to true
in order to display the semantic type management interface in Talend Data Preparation.dataquality.server.url
Enter the URL to your Talend Dictionary Service instance. spring.cloud.stream.kafka.binder.brokers
Enter the host corresponding to your Kafka broker. spring.cloud.stream.kafka.binder.defaultBrokerPort
Enter the port corresponding to your Kafka broker. spring.cloud.stream.kafka.binder.zkNodes
Enter the host corresponding to your ZooKeeper node. spring.cloud.stream.kafka.binder.defaultZkPort
Enter the port corresponding to your ZooKeeper node. -
Change the value of theⓘ
Available in:
Big Data Platform
Data Fabric
Data Management Platform
Data Services Platform
MDM Platform
Real-Time Big Data Platform
dataquality.indexes.file.location
property from${java.io.tmpdir}/org.talend.dataquality.semantic
to<other_location>/org.talend.dataquality.semantic
.By default, the custom semantic types that you create using Talend Dictionary Service are stored in a tmp directory. To avoid losing your changes, it is recommended to change the save location of your custom semantic types. You can set a path to the location of your choice, as long as it is not in a tmp folder. -
To enable to use of the Streams Runner with Talend Data Preparation, set the
streams.enable
property astrue
. -
To configure the access to the Streams Runner, edit the following
fields:
Field Description streams.flow.runner.url
Enter the URL to your Streams Runner. The URL has the following syntax: http://<local_machine_IP>:<Big_data_preparation_port>/
streams.kerberos.principal
Enter your Kerberos principal. streams.kerberos.keytab_path
Enter the path to your Kerberos keytab file. streams.hdfs.server.url
You can optionally set a default URL to be displayed in the input and output Path fields, when working with HDFS datasets, in Talend Data Preparation. The <Data_Preparation_Path>/config/tuning.properties file contains additional parameters for more advanced tuning. Make sure the parameters in this file match the sizing of your cluster. -
To enable the semantic types, edit the following lines:
dataquality.semantic.list.enable=true
anddataquality.server.url=http://<local machine ip>:8187/
. - Execute the start.sh file to start your Talend Data Preparation instance.