Cassandra Load Processor

Modified on Tue, 30 Nov 2021 at 10:31 AM

Overview

This processor loads a Dataset from a Key-Space of an Apache Cassandra database, the loaded data can be further pre-processed using a custom SQL query.


Cassandra database presentation

Cassandra is an open-source distributed NoSQL database management system(DBMS), that provides high availability and no failure thanks to its interesting architecture.

This BDMS enables users to work with large amount of data, and ONE DATA gives the user the possibility to load data from a specific table inside a predefined Key-Space.


Configuration

This processor does not need Dataset from a preceding processor (i.e: it does not have any input node)

The Processor interface looks as follows:



WARNING: table and Key-Space must NOT be blank and must NOT contain any white space

The output of this processor is the Data from the table of interest with the defined preprocessing (if provided within the third field)


Advanced Configuration

The "Query" field can be used to process the data of interest.

For example Grouping, Counting, Sorting or filtering and in general it supports SQL queries which gives the user more control over the result data.


Output

After searching for the Dataset in the given Key-Space and applying the specified preprocessing, this processor will generate the Dataset which can be used by other processors


NOTE THAT: This processor works on certain specific instances


Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select atleast one of the reasons

Feedback sent

We appreciate your effort and will try to fix the article