tRSSInput - 6.1

Talend Components Reference Guide

Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for Data Quality
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
Data Governance
Data Quality and Preparation
Design and Development
Talend Studio

tRSSInput Properties

Component family




tRSSInput reads RSS-Feeds using URLs.


tRSSInput makes it possible to keep track of blog entries on websites to gather and organize information for quick and easy access.

Basic settings

Schema and Edit Schema

A schema is a row description, it defines the number of fields to be processed and passed on to the next component.

The tRSSInput component has a read-only schema that is made of four columns: TITLE, DESCRIPTION, PUBDATE, and Link.



Enter the URL for the RSS_Feed to read.


Read articles from

If selected, tRSSInput reads articles on the RSS_Feed from the date set through the three-dot [...] button next to the date time field.


Max number of articles

If selected, tRSSInput reads as many articles as the number entered in the max amount field.


Die on error

This check box is selected by default. Clear the check box to skip the row on error and complete the process for error-free rows.

Global Variables

ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.

NB_LINE: the number of rows read by an input component or transferred to an output component. This is an After variable and it returns an integer.

A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio User Guide.


This component is generally used as an input component. It requires an output component.


Due to license incompatibility, one or more JARs required to use this component are not provided. You can install the missing JARs for this particular component by clicking the Install button on the Component tab view. You can also find out and add all missing JARs easily on the Modules tab in the Integration perspective of your studio. For details, see or the section describing how to configure the Studio in the Talend Installation Guide.

Scenario: Fetching frequently updated blog entries.

This two-component scenario aims at retrieving frequently updated blog entries from a Talend local news RSS feed using the tRSSInput component.

  1. Drop the following components from the Palette onto the design workspace: tRSSInput and tLogRow.

  2. Right-click to connect them using a Row > Main link.

  3. In the design workspace, select tRSSInput, and click the Component tab to define the basic settings for tRSSInput.

  4. Enter the URL for the RSS_Feed to access. In this scenario, tRSSInput links to the Talend RSS_Feed:

  5. Select/clear the other check boxes as required. In this scenario, we want to display the information about two articles dated from July 20, 2008.

  6. In the design workspace, select tLogRow and click the Component tab to define its basic settings. For more information about tLogRow properties, see tLogRow properties.

  7. Save the Job and press F6 to execute it.

    The tRSSInput component accessed the RSS feed of Talend website on your behalf and organized the information for you.

    Two blog entries are displayed on the console. Each entry has its own title, description, publication date, and the corresponding RSS feed URL address. Blogs show the last entry first, and you can scroll down to read earlier entries.