Apache Cassandra NoSQL Database - Import - 7.1

Talend Data Catalog Bridges

author
Talend Documentation Team
EnrichVersion
7.1
EnrichProdName
Talend Big Data Platform
Talend Data Fabric
Talend Data Management Platform
Talend Data Services Platform
Talend MDM Platform
Talend Real-Time Big Data Platform
EnrichPlatform
Talend Data Catalog

Bridge Specifications

Vendor Apache
Tool Name Cassandra NoSQL Database
Tool Version Cassandra 3.x to 4.x
Tool Web Site http://cassandra.apache.org/
Supported Methodology [NoSQL Database] Data Store (NoSQL / Hierarchical, Physical Data Model) via JDBC API
Incremental Harvesting
Remote Repository Browsing for Model Selection
Data Profiling
Multi-Model Harvesting

SPECIFICATIONS
Tool: Apache / Cassandra NoSQL Database version Cassandra 3.x to 4.x via JDBC API
See http://cassandra.apache.org/
Metadata: [NoSQL Database] Data Store (NoSQL / Hierarchical, Physical Data Model)
Component: ApacheCassandra version 11.0.0

OVERVIEW
This bridge establishes a connection to the Cassandra server using native Java driver in order to extract the physical metadata.
REQUIREMENTS
n/a

FREQUENTLY ASKED QUESTIONS
n/a

LIMITATIONS
Refer to the current general known limitations at http://metaintegration.com/Products/MIMB/MIMBKnownLimitations.html or bundled in Documentation/ReadMe/MIMBKnownLimitations.html

SUPPORT
Provide a trouble shooting package with:
- the debug log (can be set in the UI or in conf/conf.properties with MIR_LOG_LEVEL=6)
- the metadata backup if available (can be set in the Miscellaneous parameter with option -backup)


Bridge Parameters

Parameter Name Description Type Values Default Scope
Java library directory A directory containing the JAR files necessary to access Cassandra.
Typically it's located in Cassandra installation directory, e.g.
~\DataStax-DDC\apache-cassandra\lib

"Shaded" version of the driver JAR should be put to the directory also, e.g.
cassandra-driver-core-3.4.0-shaded.jar
DIRECTORY     Mandatory
Host Enter the server address where Cassandra server is running. STRING      
Port Enter the port Cassandra server uses. STRING      
User The Cassandra user name on whose behalf the connection is being made. STRING      
Password The Cassandra user's password on whose behalf the connection is being made PASSWORD      
Keyspace Cassandra keyspace to be imported. REPOSITORY_SUBSET      
Use SSL protocol to connect Set this parameter to True when the Apache Cassandra uses TLS/SSL to encrypt Cassandra's network traffic.

Cassandra uses SSL to encrypt connections between the server and clients

If you need to ensure that the bridge is able to validate the SSL certificate presented by the server you need to set several JVM system properties using the -j option in the Miscellaneous parameter
BOOLEAN
False
True
False  
Miscellaneous Specify miscellaneous options identified with a -option followed by a value if required:

GENERAL OPTIONS
-m <Java Memory's maximum size>
1G by default on 64bits JRE or as set in conf/conf.properties, e.g.
-m 8G
-m 8000M

-j <Java Runtime Environment command line options>
This option must be the last one in the Miscellaneous parameter as all the text after -j is passed "as is" to the JRE, e.g.
-j -Dname=value -Xms1G
The following option must be set when a proxy is used to access internet (this is critical to access https://repo.maven.apache.org/maven2/ (and exceptionally a few other tool sites) in order to download the necessary third party software libraries.
-j -Dhttp.proxyHost=127.0.0.1 -Dhttp.proxyPort=3128 -Dhttps.proxyHost=127.0.0.1 -Dhttps.proxyPort=3128 -Dhttp.proxyUser=user -Dhttp.proxyPassword=pass -Dhttps.proxyUser=user -Dhttps.proxyPassword=pass

-jre <Java Runtime Environment full path name>
It can be an absolute path to javaw.exe on Windows or a link/script path on Linux, e.g.
-jre "c:\Program Files\Java\jre1.8.0_211\bin\javaw.exe"

-v <Environment variable value>
None by default, e.g.
-v var1=value1 -v var2="value2 with spaces"

-model.name <model name>
Override the model name, e.g.
-model.name "My Model Name"

-prescript <script name>
The script must be located in the bin directory, and have .bat or .sh extension.
The script path must not include any parent directory symbol (..).
The script should return exit code 0 to indicate success, or another value to indicate failure.
For example:
-prescript "script.bat arg1 arg2"

-cache.clear
Clears the cache before the import, and therefore will run a full import without incremental harvesting.
Warning: this is a system option managed by the application calling the bridge and should not be set by users.
STRING      

 

Bridge Mapping

Mapping information is not available