Setting up the Job - Cloud - 8.0

Azure Data Lake Store

Version
Cloud
8.0
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > Cloud storages > Azure components > Azure Data Lake Storage Gen2 components
Data Quality and Preparation > Third-party systems > Cloud storages > Azure components > Azure Data Lake Storage Gen2 components
Design and Development > Third-party systems > Cloud storages > Azure components > Azure Data Lake Storage Gen2 components
Last publication date
2023-06-07

About this task

  • This procedure is specific to ADLS Databricks Gen2.
  • You can create this Job in the Big Data Batch or Big Data Streaming node.

Procedure

  1. Drop the following components from the Palette onto the design workspace: tJava and tAzureFSConfiguration.
  2. Go to your Databricks account.
  3. On the Configuration tab of your Databricks cluster page, expand the Advanced options.
  4. In the Spark tab, add the following Spark properties:
    spark.hadoop.fs.azure.account.key.<storage_account>.dfs.core.windows.net <key>

    This key is associated with the storage account to be used. You can find it in the Access keys blade of this storage account. Two keys are available for each account and by default, either of them can be used for this access.

    Ensure that the account to be used has the appropriate read/write rights and permissions.