HBase, Cassandra and Accumulo are some examples of column family databases.
HBase is an Apache open source project that is suitable for handling various large datasets. It is designed to scale out horizontally in distributed clusters. HBase is based on column-oriented key/value data model. In fact, it provides flexible structured hosting for very large tables in a BigTable-like format. This column store is written in Java and uses the Hadoop distributed file system (HDFS.
180 A. Oussous et al.
Table 5 Overview of HBase, Cassandra and Accumulo features Column DB Properties HBase Cassandra Accumulo
Language Java Java Java Fault tolerance Replication. Partitioning Replication. Partitioning Replication Data model BigTable BigTable and Dynamo BigTable Community Apache Facebook Apache Protocol Custom API. Thrift. Reset Thrift Thrift Data storage HDFS Inspired by Amazon’s Dynamo for storing data HDFS
Query language Apl calls, Reset XML, Thrift API Apl calls, Thrift API Java API, thrift API, REST calls Map Reduce YES YES YES Replication mode Master-slave replication. Master-slave replication. Multi-master replication Best for Real-time access, bulk operation(indexing, …) When you write more than you read (logging). When you must use Java