Profiling an ADLS Databricks file - Cloud - 8.0

Talend Studio User Guide

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Cloud
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Design and Development
Last publication date
2024-02-29
Available in...

Big Data Platform

Cloud API Services Platform

Cloud Big Data Platform

Cloud Data Fabric

Cloud Data Management Platform

Data Fabric

Data Management Platform

Data Services Platform

MDM Platform

Real-Time Big Data Platform

From the Profiling perspective of Talend Studio, you can generate a column analysis on an ADLS Databricks file through Hive.

A JDBC connection is required to connect to Hive on Databricks.

Procedure

To create a profiling analysis on an ADLS file, you must:

  1. Download the JDBC driver and add it to Talend Studio.
  2. Create a JDBC connection to the ADLS cluster.
  3. Create a column analysis with simple indicators on the table and columns.
    Those steps are described in the following procedures.

What to do next

You can modify the analysis settings and add other indicators as needed. You can also create other analyses later on this ADLS file by using the same Hive table.