Opening a dataset from a local file - Cloud

Talend Cloud Data Preparation Getting Started Guide

Version
Cloud
Language
English (United States)
Product
Talend Cloud
Module
Talend Data Preparation
Content
Data Quality and Preparation > Cleansing data

You will now import the file containing the customer data and create your first preparation.

After logging in Talend Cloud Data Preparation, you are directed to the Preparations view.

This view shows all your preparations, in other words datasets on which you have started performing operations. It is empty for now, but this is where your work on the customer data will be saved. In this view, you can also add new preparations and organize them into folders.

To import the customer file containing the raw data, proceed as follows:

Before you begin

Retrieve the customers.xlsx file from the Downloads tab of the menu located in the left panel of this page.

Procedure

  1. From the left panel menu, select Datasets to open the list of datasets, that is also empty at that moment.
  2. To import the customers.xlsx that you have previously downloaded, you can either:
    • Click the Drop a file or browse button to browse your files and select the dataset.
    • Directly drag and drop the dataset on the Talend Cloud Data Preparation screen.
    The Add a new dataset panel opens, where you can set the name and format of your dataset before importing it.
  3. In the Dataset name field, enter Customers.
  4. The Connection field is already filled in.
    Because this is your first local dataset import, a Local connection type connection will be created during the process. This underlying connection will be used to upload all your local datasets.
  5. Optionally enter a description for your dataset in the corresponding field.
  6. In the Format config section of the form, click the Auto detect button to automatically fill the Format information.
  7. Click the View sample button to preview the first 50 rows of your dataset.
  8. Click Validate to finalize the dataset creation.
  9. From the left panel menu, select Preparations and click the Add preparation button.
  10. In the Add a preparation panel that opens, give a name to your preparation, customers_preparation in this example, and from the Dataset list, select the Customers dataset previously created.
  11. Click Submit.

Results

Your dataset opens in the form of a preparation with an empty recipe. Your data has not been modified yet, but has been saved as a preparation on which you can start applying preparation steps.

Because you imported the customers.xlsx dataset, and created the corresponding preparation using the Add preparation button, every change made to customers_preparation will be automatically saved. As for the raw dataset you imported, it can be viewed in the Datasets view, and the data remains unchanged.