HBase Migration

The MapR Hadoop distribution includes HBase, with a number of MapR-exclusive enhancements.

HBase is the Hadoop database, which provides random, real-time read/write access to very large datasets. The MapR Hadoop distribution includes HBase and is fully integrated with MapR enhancements for speed, usability, and dependability. MapR provides a volume (normally mounted at /hbase) to store HBase data.

  • HBase bulk load jobs: If you are currently using HBase bulk load jobs to import data into the HDFS, make sure to load your data into a path under the /hbase volume.
  • Compression: The HBase write-ahead log (WAL) writes many tiny records, and compressing it would cause massive CPU load. Before using HBase, turn off MapR compression for directories in the HBase volume.