HBase Migration
The MapR Hadoop distribution includes HBase, with a number of MapR-exclusive enhancements.
HBase is the Hadoop database, which provides random, real-time read/write
access to very large datasets. The MapR Hadoop distribution includes HBase and is fully
integrated with MapR enhancements for speed, usability, and dependability. MapR provides a
volume (normally
mounted at /hbase
) to store HBase data.
- HBase bulk load jobs: If you are currently using HBase bulk load jobs to import
data into the HDFS, make sure to load your data into a path under the
/hbase
volume. - Compression: The HBase write-ahead log (WAL) writes many tiny records, and compressing it would cause massive CPU load. Before using HBase, turn off MapR compression for directories in the HBase volume.