HBase Migration

The HPE Ezmeral Data Fabric Hadoop distribution includes HBase, with a number of Data Fabric-exclusive enhancements.

HBase is the Hadoop database, which provides random, real-time read/write access to very large datasets. The Data Fabric Hadoop distribution includes HBase and is fully integrated with Data Fabric enhancements for speed, usability, and dependability. Data Fabric provides a volume (normally mounted at /hbase) to store HBase data.

  • HBase bulk load jobs: If you are currently using HBase bulk load jobs to import data into the HDFS, make sure to load your data into a path under the /hbase volume.
  • Compression: The HBase write-ahead log (WAL) writes many tiny records, and compressing it would cause massive CPU load. Before using HBase, turn off Data Fabric compression for directories in the HBase volume.