Talend Real-Time Big Data Platform in use - 6.5

Talend Real-Time Big Data Platform Getting Started Guide

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Real-Time Big Data Platform
task
Data Quality and Preparation > Cleansing data
Data Quality and Preparation > Profiling data
Design and Development
Installation and Upgrade

This chapter takes the example of a company that provides movie rental and streaming video services, and shows how such a company could make use of Talend Real-Time Big Data Platform.

You will work with data about movies and directors and data about your customers as you learn how to:

  • validate email addresses for customers and standardize phone numbers before sending them to the Customer Support System
  • upload data stored in a local file system to the HDFS file system of the company's Hadoop cluster
  • join the director data to the movie data to produce a new dataset and store this dataset in the HDFS system too