Opening a dataset from a local file - Cloud

Talend Cloud Data Preparation Getting Started Guide

Version
Cloud
Language
English
Product
Talend Cloud
Module
Talend Data Preparation
Content
Data Quality and Preparation > Cleansing data
Last publication date
2024-03-05

You will now import the file containing the customer data and create your first preparation.

After logging in Talend Cloud Data Preparation, you are directed to the Preparations view.

This view shows all your preparations, in other words datasets on which you have started performing operations. It is empty for now, but this is where your work on the customer data will be saved. In this view, you can also add new preparations and organize them into folders.

To import the customer file containing the raw data, proceed as follows:

Before you begin

Download the file: customers.xlsx.

Procedure

  1. From the left panel menu, select Datasets to open the list of datasets, that is also empty at that moment.
  2. To import the customers.xlsx that you have previously downloaded, you can either:
    • Click the Drop a file or browse button to browse your files and select the dataset.
    • Directly drag and drop the dataset on the Talend Cloud Data Preparation screen.
    The Overview tab opens, where you can see information on your dataset.
  3. Open the Configure tab on the left side of the screen.
  4. In the Dataset name field, enter Customers.
  5. The Connection field is already filled in.
    Because this is your first local dataset import, a Local connection type connection will be created during the process. This underlying connection will be used to upload all your local datasets.
  6. In the Format config section of the form, click the Auto detect button to automatically fill in the Format information.
  7. Click the View sample button to preview the first 50 rows of your dataset.
  8. Click Validate to finalize the dataset creation.
  9. From the left panel menu, select Preparations and click the Add preparation button.
  10. In the Add a preparation panel that opens, select the Customers dataset previously created.
    Add a preparation panel.
    The Preparation name field is automatically filled in but you can change the preparation name. In this example, Customers Preparation is used.
  11. Click Submit.

Results

Your dataset opens in the form of a preparation with an empty recipe. Your data has not been modified yet, but has been saved as a preparation on which you can start applying preparation steps.

Because you imported the customers.xlsx dataset, and created the corresponding preparation using the Add preparation button, every change made to Customers Preparation will be automatically saved. As for the raw dataset you imported, it can be viewed in the Datasets view, and the data remains unchanged.

Dataset in the form of a preparation with an empty recipe.