Talend Data Preparation allows a direct
connection to various types of databases. You can use them as source to create new
datasets.
You can manually enrich the list of databases from which you can import
data.
The list of available database types for dataset creation actually depends on the JDBC
drivers that you have stored in the
<components_catalog_path>/.m2 folder.
Let's say that you have some customer data stored on an Oracle database, and you want to
import it in Talend Data Preparation to perform
cleansing operations. You will add a JDBC driver .jar file specific
to Oracle databases to the Components Catalog folder
structure to add this new source of data in the Talend Data Preparation interface.
Procedure
-
Download the latest Oracle JDBC driver called
ojdbc8-19.19.0.0.jar from MVN repository website.
-
Create the
<components_catalog_path>/.m2/com/oracle/database/jdbc/ojdbc8/19.19.0.0/
folder.
-
Copy the ojdbc8-19.19.0.0.jar in the newly created
folder.
-
Update the
<components_catalog_path>/config/jdbc_config.json file
by adding the following lines:
{
"id" : "ORACLE",
"class" : "oracle.jdbc.OracleDriver",
"url" : "jdbc:oracle:thin:@//<server ip>:<server port>/<database>",
"paths" :
[
{"path" : "mvn:com.oracle.database.jdbc/ojdbc8/19.19.0.0"}
]
}
Where:
-
id
is the value that will be displayed in the Talend Data Preparation interface as Database type.
-
class
is the driver class used to communicate with the
database.
-
url
is the URL template to access a database.
-
path
is the path of your database, it is identical to
the path mentioned in step 2.
Results
The Oracle database is now available in the database type
drop-down list in the import form.