tCouchbaseOutput Standard properties - 7.3

Couchbase

Version
7.3
Language
English
Product
Talend Big Data
Talend Big Data Platform
Talend Data Fabric
Talend Real-Time Big Data Platform
Module
Talend Studio
Content
Data Governance > Third-party systems > NoSQL components > Couchbase components
Data Quality and Preparation > Third-party systems > NoSQL components > Couchbase components
Design and Development > Third-party systems > NoSQL components > Couchbase components
Last publication date
2024-02-21

These properties are used to configure tCouchbaseOutput running in the Standard Job framework.

The Standard tCouchbaseOutput component belongs to the Databases NoSQL family.

The component in this framework is available in all Talend products with Big Data and in Talend Data Fabric.

Basic settings

Bootstrap nodes

Enter the name or IP of the node to be bootstrapped by Couchbase SDK. As Couchbase recommends to specify multiple nodes to bootstrap, enter the names or IPs of these nodes in this field, separating them using commas (,).

For further information about Couchbase bootstrapping, see How Couchbase SDKs connect to the cluster.

You can find the node names on the Servers page in your Couchbase Web Console. If you need further information, contact the administrator of your Couchbase cluster or consult your Couchbase documentation.

Note that the Couchbase servers do not support proxies; for this reason, the Couchbase components from Talend do not support proxies either.

Username and Password

Provide the authentication credentials to your Couchbase cluster.

To enter the password, click the [...] button next to the password field, and then in the pop-up dialog box enter the password between double quotes and click OK to save the settings.

If you are using Couchbase V5.0 and onwards, enter the same value you put in the Bucket field as password, because since Couchbase V5.0, no password is associated with a bucket. However, on Couchbase, you need to create a user with appropriate role to access the buckets.

For further information about the access control and other important requirements on the Couchbase side, see Couchbase release note of your version.

Schema and Edit Schema

A schema is a row description. It defines the number of fields (columns) to be processed and passed on to the next component. When you create a Spark Job, avoid the reserved word line when naming the fields.

When using non-JSON documents, define an id column of the String type, then define a content column. The type of this content column should be String for the string documents and byte[] for the binary documents.

When it comes to JSON documents, define the fields that exist in your JSON documents.

Bucket

Enter, within double quotation marks, the name of the data bucket in the Couchbase database.

Ensure that the credentials you are using have the appropriate rights and permissions to access this bucket.

If you are using Couchbase V5.0 and onwards, this bucket name is the user name you have created in the Security tab of your Couchbase UI.

Document type

Data stored in a Couchbase database could be JSON, strings or binary. From this drop-down list, select the type of the data you need to use with Couchbase.

Note that it is not recommended to mix JSON, binary and string documents in a same bucket, as this mixture could make the document processing error-prone.

Field to use as ID

Enter, without double quotation marks, the name of the column from the schema to provide IDs for the documents to be written to Couchbase.

Partial update

Select this check box to update only a subset of a document, without changing any other property that is not provided by the incoming data.

If you leave this check box, when a document already exists in the database, that is to say, when this document and a document from the incoming data have the same ID, the whole existing document is replaced with the incoming one.

Use N1QL Query with parameters

Select this check box to apply variables in your N1QL queries. Once selecting it, the Query field and the Query Parameters wraps flat data into documents for storage in the Couchbase database. table are displayed for you to enter your query and define the variables to be used in your query.

Only one query is allowed per tCouchbaseOutput.

For example, enter this query in the Query field:
INSERT INTO 'travel-sample' (KEY, VALUE)
VALUES 
($nm, 
   {
    "name":$nm, 
    "type":$tp, 
    "country":$cnty, 
    "callsign":$call,  
    "id":$zid
   }
)

Then you need to define all of the variables (the strings starting with $) used in this query in the Query Parameters table.

Query Parameter Name     Column
nm                       name
tp                       type
cnty                     countries
call                     company
zid                      docid

This table creates a map between the variables in your query and the columns from the schema you have defined in the component for your data. The values in the Column column are the column names from this schema; the values in the Query Parameter Name column are the variables from your query.

Advanced settings

tStatCatcher Statistics

Select this check box to collect log data at the component level.

Use custom connection parameters

Select this option to set connection parameters, including Connection timeout, Query timeout, Analytics timeout, and Query threshold.

To set a connection parameter, add a row in the Connection parameters table by clicking the plus button on the bottom of the table, click the Parameter name column and select the desired parameter from the drop-down list, and then enter the parameter value in the Parameter value column.

Global Variables

Global Variables

NB_LINE: the number of rows read by an input component or transferred to an output component. This is an After variable and it returns an integer.

NB_SUCCESS: the number of rows successfully processed. This is an After variable and it returns an integer.

NB_REJECT: the number of rows rejected. This is an After variable and it returns an integer.

ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it.

For further information about variables, see Talend Studio User Guide.

Usage

Usage rule

Preceded by an input component, tCouchbaseOutput