About Release 7.0
This site contains documentation for HPE Ezmeral Data Fabric release 7.0, including installation, configuration, administration, and reference content, as well as content for the associated ecosystem components and drivers.
7.0 Installation
This section contains information about installing and upgrading HPE Ezmeral Data Fabric software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a HPE Ezmeral Data Fabric cluster.
7.0 Data Fabric
HPE Ezmeral Data Fabric is the industry-leading data platform for AI and analytics that solves enterprise business needs.
- HPE Ezmeral Data Fabric File Store
  HPE Ezmeral Data Fabric File Store is a distributed file system for data storage, data management, and data protection. File Store supports mounting and cluster access via NFS and FUSE-based POSIX clients (basic, platinum, or PACC) and also supports access and management via HDFS APIs.
  - File System
    Discusses the features of the Data Fabric distributed file system and compares it to the Hadoop Distributed File System (HDFS).
    - Storage Pools
      Describes what storage pools are.
    - Containers and the CLDB
      Describes what containers are, and the role of the Container Location Database (CLDB) in managing them.
    - Volumes, Snapshots, and Mirrors
      Describes what Snapshots and Mirrors are, and the advantages of using them for replication.
    - Multitenancy on File System
      Describes what multitenancy is and how tenant data is kept private for each tenant.
  - Direct Access NFS
    Describes the Data Fabric direct access file system.
  - POSIX Clients
    Describes the usage of Data Fabric POSIX clients.
  - Copying Data from Apache Hadoop to a Data Fabric Cluster
    Describes the procedure to copy data from an Apache Hadoop to a Data Fabric cluster.
  - PACC
    This container gives you seamless access to HPE Ezmeral Data Fabric cluster services.
  - HPE Ezmeral Data Fabric Control System
    Provides a brief description of the HPE Ezmeral Data Fabric Control System.
  - Using HPE Ezmeral Data Fabric Monitoring (Spyglass Initiative)
    HPE Ezmeral Data Fabric Monitoring (part of the Spyglass initiative) provides the ability to collect, store, and view metrics and logs for nodes, services, and jobs/applications.
- HPE Ezmeral Data Fabric Object Store
  The HPE Ezmeral Data Fabric Object Store is a native object storage solution that efficiently stores objects and metadata for optimized access.
- HPE Ezmeral Data Fabric Database
  HPE Ezmeral Data Fabric Database is an enterprise-grade, high-performance, NoSQL database management system that you can use for real-time, operational analytics.
- HPE Ezmeral Data Fabric Streams
  HPE Ezmeral Data Fabric Streams brings integrated publish and subscribe messaging to the Data Fabric Converged Data Platform.
- HPE Ezmeral Unified Analytics
  Describes the HPE Ezmeral Unified Analytics Software and provides a link to more information.
- Kubernetes Interfaces for Data Fabric
  This section describes the Kubernetes Interfaces for Data Fabric, which include the Container Storage Interface (CSI) driver for multiple container-orchestration systems, and the FlexVolume driver for Kubernetes.
- Cluster Management
  Provides a synopsis of the various cluster components and their management.
- Performance
  Describes how to tune system performance, manage RDMA, and optimize CLDB tables.
- Security
  Provides an overview of the data-fabric security features.
- YARN
- Client Connections
  The following sections describe how a client connects to local and remote data-fabric clusters.
7.0 Administration
This section describes how to manage the nodes and services that make up a cluster.
7.0 Development
This section contains information related to application development for Ezmeral ecosystem components and HPE Ezmeral Data Fabric products, including the file system, Database (Key-Value and JSON), and Event Streams.
Other Docs
This section contains release-independent information, including: Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other data-fabric version documentation.
Glossary
Definitions for commonly used terms in MapR Converged Data Platform environments.

File System

Discusses the features of the Data Fabric distributed file system and compares it to the Hadoop Distributed File System (HDFS).

The Data Fabric distributed file system provides a unified data solution for structured data (tables) and unstructured data (files). The file system is fully compliant with POSIX and Hadoop and is case sensitive.

The Data Fabric file system is a random, read-write distributed file system that allows applications to concurrently read and write directly to disk. By contrast, the Hadoop Distributed File System (HDFS) has append-only writes and can only read from closed files. As HDFS is layered over the existing Linux file system, a large number of input/output (I/O) operations decrease cluster performance. The Data Fabric distributed file system also eliminates the Namenode associated with cluster failure in other Hadoop distributions, and enables special features for data management and high availability.

The storage system architecture used by the Data Fabric distributed file system is written in C/C++ and prevents locking contention, eliminating performance impact from Java garbage collection.

The following table highlights some of the features of the Data Fabric file system:

Feature	Description
Storage pools	A group of disks to which the Data Fabric file system writes data.
Containers	An abstract entity that stores files and directories in the Data Fabric file system. A container always belongs to exactly one volume, and can hold namespace information, file chunks, or table chunks for that volume.
CLDB	A service that tracks the location of every container.
Volumes	A management entity that stores and organizes containers. Used to distribute metadata, set permissions on data in the cluster, and for data backup. A volume consists of a single name container, and a number of data containers.
Direct Access NFS	Enables applications to read and write data directly on to the cluster.
POSIX Clients	The loopbacknfs and FUSE-based POSIX clients connect to one or more Data Fabric clusters, and allow app servers, web servers, and applications to write data directly and securely to the Data Fabric cluster.