From the Profiling perspective of Talend Studio, you can generate a column analysis on an ADLS Databricks file through Hive.
A JDBC connection is required to connect to Hive on Databricks.
To create a profiling analysis on an ADLS file, you must:
- Download the JDBC driver and add it to the Studio.
- Create a JDBC connection to the ADLS cluster.
Create a column analysis with simple indicators on the table
Note: Those steps are described in the following procedures.
What to do next
You can modify the analysis settings and add other indicators as needed. You can also create other analyses later on this ADLS file by using the same Hive table.