Functional architecture of Talend Big Data Platform - 7.3

Talend Big Data Platform Getting Started Guide

Version
7.3
Language
English
Operating system
Big Data Platform
Product
Talend Big Data Platform
Module
Talend Administration Center
Talend DQ Portal
Talend Installer
Talend Runtime
Talend Studio
Content
Data Quality and Preparation > Cleansing data
Data Quality and Preparation > Profiling data
Design and Development
Installation and Upgrade
Last publication date
2023-07-24

The Talend Big Data Platform functional architecture is an architectural model that identifies Talend Big Data Platform functions, interactions and corresponding IT needs. The overall architecture has been described by isolating specific functionalities in functional blocks.

The following chart illustrates the main architectural functional blocks.

The different types of functional blocks are:

  • From Talend Studio, you design and launch Big Data Jobs that leverage a Hadoop cluster to handle large data sets. Once launched, these Jobs are sent to, deployed on and executed on this Hadoop cluster.

    From the Studio, you can also use predefined patterns and indicators to analyze data stored in different data sources, browse and query analysis results and remove corrupt, incomplete or inaccurate data.

  • A Hadoop cluster independent of the Talend system to handle large data sets.
  • A Talend JobServer or Runtime installed inside or outside the Hadoop cluster to deploy and execute Jobs.

    For a Hortonworks cluster, it is recommended to install the JobServer or Runtime in the EDGE node machine in order to avoid potential firewall and access issues.

    For an Amazon EMR cluster, it is also recommended to install the JobServer or Runtime in the cluster.

  • From the Talend DQ Portal, you can generate reports on analysis results and share them with other business users.