StepĀ 1: Configuring CDC - 6.4

Change Data Capture

author
Talend Documentation Team
EnrichVersion
6.4
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Data Integration
Talend Data Management Platform
Talend Data Services Platform
Talend ESB
Talend MDM Platform
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Database components > Change Data Capture
Data Quality and Preparation > Third-party systems > Database components > Change Data Capture
Design and Development > Third-party systems > Database components > Change Data Capture
EnrichPlatform
Talend Studio

About this task

Before being able to retrieve modified data from the LEADFACT table, you must:

Procedure

  1. first set up the DB connection dedicated to CDC,
  2. second, set up a DB connection to the source data and identify the table to catch,
  3. finally, set the connection between the CDC and the data.
    To do that:
  4. In the Repository tree view and under Metadata, create a connection to your database dedicated to CDC, cdc_publisher in this scenario.
    Note:

    Ensure that the DB connection for CDC is on the same server with the source data to which changes are to be captured.

  5. In the Repository tree view and under Metadata, create a connection to the source data warehouse and identify the table to catch, LEADFACT in this scenario.
  6. Right-click the connection to the orcle database and select Retrieve schema from the drop-down menu to retrieve the schema of the table to catch.
  7. Right-click CDC Foundation of the cdc-data connection and select Create CDC in the drop-down menu.
    The [Create Change Data Capture] dialog box displays
  8. Click the three-dot button next to the Set link Connection field and select the connection that corresponds to CDC, cdc_publisher in this example.
  9. In the Options area, select the Log mode check box.
  10. Click Create Subscriber. The [Create Subscriber and Execute SQL Script] dialog box displays.
  11. Click Execute and then Close to close the dialog box.
  12. Click Finish to validate the creation of the subscriber table.
    In the CDC Foundation folder, the relevant subscriber table displays.
    You must specify which table the subscriber wants to subscribe to and then
  13. Right-click the LEADFACT schema of the cdc-data connection and select Add CDC in the drop-down list. The [Create Subscriber and Execute SQL Script] dialog box displays.
    Note:

    For Oracle databases and in Talend Studio version 3.2 or subsequent versions, the CDC system creates an alias for the source table(s) monitored. This avoids problems relating to identifier length upon creation of the table of changes and its associated view. For CDC systems which are already in place, the table names are maintained.

  14. Click Execute and then Close to validate the subscription.
    In the CDC Foundation folder, the two created tables display and the schema node of the catched table is marked with a green CDC symbol.