Accessing the selected data - 7.0

Processing (Integration)

EnrichVersion
7.0
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Open Studio for Big Data
Talend Open Studio for Data Integration
Talend Open Studio for ESB
Talend Open Studio for MDM
Talend Real-Time Big Data Platform
EnrichPlatform
Talend Studio
task
Data Governance > Third-party systems > Processing components (Integration)
Data Quality and Preparation > Third-party systems > Processing components (Integration)
Design and Development > Third-party systems > Processing components (Integration)

Procedure

  1. Double-click tCacheOut to open its Component view.
    This component stores the selected data into the cache.
  2. Click the [...] button next to Edit schema to open the schema editor to verify the schema is identical to the input one. If not so, click Sync columns.
  3. On the output side of the schema editor, click the button to export the schema to the local file system and click OK to close the editor.
  4. From the Storage level list, select Memory only.

    For further information about each of the storage level, see https://spark.apache.org/docs/latest/programming-guide.html#rdd-persistence.

  5. Double-click tCacheIn to open its Component view.
  6. Click the [...] button next to Edit schema to open the schema editor and click the button to import the schema you exported in the previous step. Then click OK to close the editor.
  7. From the Output cache list, select the tCacheOut component from which you need to read the cached data. At runtime, this data is loaded into the lookup flow of the Subjob that is used to process the web-click log.