Extracting the replicated data of interest - 6.5

CouchDB

author
Talend Documentation Team
EnrichVersion
6.5
EnrichProdName
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Open Studio for Big Data
Talend Real-Time Big Data Platform
task
Data Governance > Third-party systems > Database components > CouchDB components
Data Quality and Preparation > Third-party systems > Database components > CouchDB components
Design and Development > Third-party systems > Database components > CouchDB components
EnrichPlatform
Talend Studio

Procedure

  1. Double-click the tCouchDBInput component to open its Component view.
  2. Click Edit schema to define the data structure to be read from the CouchDB database.
    By default, the Include docs check box is selected, so the id, key, value and jsonDoc columns are available in the schema.
    In this example, we define four columns to be extracted: id, title, author and category.
  3. Enter the Server and Port information.
  4. In the Database field, enter the name of the database from which the replicated data will be read. In this example, it is bookstore_new.
  5. In the Querying options area, type in the start key and end key to set the range of the data to be read: "001" and "006" in this example.
  6. Select the Extract JSON field check box to extract the desired data.
  7. Select jsonDoc from the JSON field list.
  8. In the Mapping area, click [+] to add items. Select the schema output column from the list and then type in the proper XPath query.