This chapter takes the example of a company that provides movie rental and streaming video services, and shows how such a company could make use of Talend Real-Time Big Data Platform.
You will work with data about movies and directors and data about your customers as you learn how to:
- validate email addresses for customers and standardize phone numbers before sending them to the Customer Support System
- upload data stored in a local file system to the HDFS file system of the company's Hadoop cluster
- join the director data to the movie data to produce a new dataset and store this dataset in the HDFS system too