Awesome HBase

A curated list of awesome HBase projects and resources.
HBase is a distributed, scalable, big data store.
Contents
Projects
Clients
-
asynchbase
- Fully asynchronous, non-blocking HBase client.
-
gohbase
- Pure Go client for HBase.
-
happybase
- Python client for HBase.
Cloud
Frameworks
Datasets
-
Kite - High-level data layer for Hadoop/HBase.
Document
-
HDocDB
- HBase as a JSON document database.
Entity/JPA
-
DataNucleus - JPA persistence layer with support for HBase.
-
Gora - Persistence library for big data with support for HBase.
-
HBase ORM
- A production-grade HBase ORM library.
-
HEntityDB
- HBase as an entity database.
-
Kundera
- JPA client with support for HBase.
Geospatial
-
GeoMesa - Spatial-temporal database with support for Accumulo, HBase, Cassandra, and Kafka.
Graph
-
Gradoop
- Research framework for scalable graph analytics built on Flink and HBase.
-
HGraphDB
- HBase as a TinkerPop graph database.
-
HugeGraph
- A graph database that supports more than 10+ billion data, high performance and scalability.
-
JanusGraph - Scalable graph database with support for Cassandra, HBase, Google Cloud Bigtable, and BerkeleyDB.
-
NebulaGraph
- A high performance distributed Graph database.
-
S2Graph - High-performance distributed graph database built on HBase.
SQL/OLAP
-
AntsDB - AntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase.
-
EsgynDB - Commercial SQL engine providing ACID transactions and BI analytics on top of Hadoop, based on Trafodian.
-
Kylin - Extreme OLAP engine for big data that stores data in HBase.
-
LeanXScale - Commercial full ACID full SQL product built on Hadoop/HBase.
-
Phoenix - SQL layer on top of HBase.
-
Splice Machine - Commercial RDBMS built on top of HBase.
-
Trafodian - Transactional SQL-on-Hadoop/HBase.
Time Series
-
Axibase - Distributed time series database built on HBase.
-
OpenTSDB - Scalable time series database built on HBase.
-
Warp 10 - Time series database for sensor data.
Infrastructure
Secondary Indices
Transactions
-
Haeinsa
- Multi-row/multi-table transaction library for HBase.
-
HBase-QoD
- Vector-field consistency for HBase fine-grained transactional inter-DC replication.
-
Omid
- Transactional support for HBase.
-
Tephra - Globally consistent transactions on top of HBase.
-
Themis
- Cross-row/cross-table transactions on HBase based on Google’s Percolator.
Integrations
-
Apex
- Apex-HBase connector.
-
Beam
- Beam HBase integration.
-
Camel - Camel HBase component.
-
Cascading
- HBase adapters for Cascading.
-
Cascalog
- Wrapper around Cascading.HBase for use in Cascalog.
-
Crunch
- HBase adapters for Crunch.
-
Drill - HBase storage plugin for Drill.
-
Elasticsearch
- Elasticsearch import river for HBase.
-
Flink
- Flink-HBase connector.
-
Gearpump
- Gearpump integration for HBase.
-
Giraph
- Giraph input and output formats for HBase.
-
HAWQ - HAWQ PXF external tables on HBase.
-
Hive - Hive HBase integration.
-
Impala - Impala support for querying HBase tables.
-
Kafka
- HBase Kafka proxy.
-
Pig
- Pig HBase integration.
-
Presto
- Presto-HBase connector.
-
Pulsar - HBase connector for Pulsar.
-
Ranger - HBase plugin for Apache Ranger.
-
Spark
- Spark-HBase connector.
-
Spring for Apache Hadoop - Spring-Hadoop integration, including HBase support.
-
Storm
- Storm/Trident integration for HBase.
-
Tajo - Tajo integration with HBase.
-
Zeppelin - HBase shell interpreter for Apache Zeppelin.
-
Ambari - Software for provisioning, managing, and monitor Hadoop/HBase clusters.
-
Cloudera Manager - Tool for managing Hadoop/HBase in production.
-
DbSchema - Diagram-oriented database designer with support for HBase.
-
Hannibal
- Tool to monitor and maintain HBase clusters.
-
h-rider
- GUI for viewing and manipulating data in HBase.
-
Hue - Smart analytics workbench that includes an HBase browser.
-
Sematext SPM - Tool for monitoring HBase, HDFS, etc.
Miscellaneous
-
HubSpot HBase support
- Configs and tools for HBase at HubSpot, including Hystrix integration and coprocessors.
Resources
Books
Papers
License