These properties are used to configure tCouchbaseOutput running in the Standard Job framework.
The Standard tCouchbaseOutput component belongs to the Databases NoSQL family.
The component in this framework is available in all Talend products with Big Data and in Talend Data Fabric.
Enter the name or IP of the node to be bootstrapped by Couchbase SDK. As Couchbase recommends to specify multiple nodes to bootstrap, enter the names or IPs of these nodes in this field, separating them using commas (,).
For further information about Couchbase bootstrapping, see How Couchbase SDKs connect to the cluster.
You can find the node names on the Servers page in your Couchbase Web Console. If you need further information, contact the administrator of your Couchbase cluster or consult your Couchbase documentation.
Note that the Couchbase servers do not support proxies; for this reason, the Couchbase components from Talend do not support proxies either.
Username and Password
Provide the authentication credentials to your Couchbase cluster.
To enter the password, click the [...] button next to the password field, and then in the pop-up dialog box enter the password between double quotes and click OK to save the settings.
If you are using Couchbase V5.0 and onwards, enter the same value you put in the Bucket field as password, because since Couchbase V5.0, no password is associated with a bucket. However, on Couchbase, you need to create a user with appropriate role to access the buckets.
For further information about the access control and other important requirements on the Couchbase side, see Couchbase release note of your version.
Schema and Edit Schema
A schema is a row description. It defines the number of fields (columns) to be processed and passed on to the next component. When you create a Spark Job, avoid the reserved word line when naming the fields.
When using non-JSON documents, define an id column of the String type, then define a content column. The type of this content column should be String for the string documents and byte for the binary documents.
When it comes to JSON documents, define the fields that exist in your JSON documents.
Enter, within double quotation marks, the name of the data bucket in the Couchbase database.
Ensure that the credentials you are using have the appropriate rights and permissions to access this bucket.
If you are using Couchbase V5.0 and onwards, this bucket name is the user name you have created in the Security tab of your Couchbase UI.
Data stored in a Couchbase database could be JSON, strings or binary. From this drop-down list, select the type of the data you need to use with Couchbase.
Note that it is not recommended to mix JSON, binary and string documents in a same bucket, as this mixture could make the document processing error-prone.
Field to use as ID
Enter, without double quotation marks, the name of the column from the schema to provide IDs for the documents to be written to Couchbase.
Select this check box to update only a subset of a document, without changing any other property that is not provided by the incoming data.
If you leave this check box, when a document already exists in the database, that is to say, when this document and a document from the incoming data have the same ID, the whole existing document is replaced with the incoming one.
|Use N1QL Query with parameters
Select this check box to apply variables in your N1QL queries. Once selecting it, the Query field and the Query Parameters wraps flat data into documents for storage in the Couchbase database. table are displayed for you to enter your query and define the variables to be used in your query.
Only one query is allowed per tCouchbaseOutput.
For example, enter this query in the Query field:
Then you need to define all of the variables (the strings starting with $) used in this query in the Query Parameters table.
This table creates a map between the variables in your query and the columns from the schema you have defined in the component for your data. The values in the Column column are the column names from this schema; the values in the Query Parameter Name column are the variables from your query.
Select this check box to collect log data at the component level.
Use custom connection parameters
Select this option to set connection parameters, including Connection timeout, Query timeout, Analytics timeout, and Query threshold.
To set a connection parameter, add a row in the Connection parameters table by clicking the plus button on the bottom of the table, click the Parameter name column and select the desired parameter from the drop-down list, and then enter the parameter value in the Parameter value column.
NB_LINE: the number of rows read by an input component or transferred to an output component. This is an After variable and it returns an integer.
NB_SUCCESS: the number of rows successfully processed. This is an After variable and it returns an integer.
NB_REJECT: the number of rows rejected. This is an After variable and it returns an integer.
ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.
A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.
To fill up a field or expression with a variable, press Ctrl + Space to access the variable list and choose the variable to use from it.
For further information about variables, see Talend Studio User Guide.
Preceded by an input component, tCouchbaseOutput