HPE Ezmeral Data Fabric 6.1.x is In Maintenance and transitions to "End of Maintenance" in June 2024. Please see the latest documentation.

About MapR 6.1
This site contains the main documentation for Version 6.1 of the MapR Converged Data Platform, including installation, configuration, administration, and reference information.
6.1 Installation
This section contains information about installing and upgrading MapR software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a MapR cluster.
6.1 MapR Data Platform
MapR Data Platform is the industry-leading data platform for AI and analytics that solves enterprise business needs.
6.1 Administration
This section describes how to manage the nodes and services that make up a cluster.
- Administering Users and Clusters
  Lists topics that help manage a MapR cluster.
- Administering Nodes
  Provides a synopsis of managing nodes in a cluster.
- Administering Volumes
  This section provide information about how to organize and manage data using volumes, a unique feature of MapR clusters.
- Administering Files and Directories
- Administering Tables
  Administration of the MapR Database is done primarily via the command line (maprcli) or with the Managed Control System (MCS). Regardless of whether the MapR Database table is used for binary files or JSON documents, the same types of commands are used with slightly different parameter options. MapR Database administration is associated with tables, columns and column families, and table regions.
- Administering Streams
- Administering MapR Gateways
  A MapR gateway mediates one-way communication between a source MapR cluster and a destination cluster. You can replicate MapR Database tables (binary and JSON) and MapR Event Store For Apache Kafka streams. MapR gateways also apply updates from JSON tables to their secondary indexes and propagate Change Data Capture (CDC) logs.
- Administering Services
  - Managing Services
    Synopsis on managing services.
  - Setting Up Central Configuration from the Command-Line
    Describes the concept of a central location where customized MapR configuration files for MapR services are stored.
  - Viewing CLDB Information
    Describes how to view CLDB information from the CLDB page, and provides an explanation of each field that the page displays.
  - Managing Drill
    Provides a short description on managing Drill services.
  - Managing the MapR NFS Service
    Provides an overview of managing the NFS service on a licensed cluster.
    - Managing VIPs for NFS
      Explains how to use virtual IP addresses (VIPs) on NFS servers.
    - Accessing Data with NFS v3
      Describes how MapR works with NFS v3.
      - Starting, Stopping, and Restarting MapR NFSv3
        Explains how to start, stop, and restart NFS version 3 using either the Control System or the CLI.
      - Setting Up Aliases for NFS Exports
      - Mounting NFS to MapR File System on a Cluster Node
      - Mounting NFS on a Linux Client
        Explains how to mount NFS on a Linux client either automatically at start up or manually.
      - Mounting NFS on a Mac Client
        Describes how to mount a NFS server on a Mac client.
      - Mounting NFS on a Windows Client
        Describes how to mount an NFS share on a Windows client, and configure the relevant user and group IDs.
      - Configuring the Linux NFS Client
        Describes how to set the optimal number of RPC requests to the NFS server.
    - Accessing Data with NFS v4
      Describes how MapR works with the NFS v4 protocol. Presents an overview of the process flow to read and write MapR processes with NFS v4, and a list of NFS v4 features that MapR does not support.
    - Viewing the List of NFS Servers
      Explains how to view the list of NFS servers using the Control System.
    - Handling Heavy Write Loads on Red Hat Enterprise Linux
      Describes a fix to mitigate resource contention between NFS Clients and the NFS Server on Red Hat Linux.
    - Configure NFS Write Performance
      Describes how to set the optimal value for outstanding Remote Procedure Call (RPC) requests to the NFS server.
    - Adjusting NFS Memory Settings
    - Running NFS on a Non-standard Port
    - Enabling Debug Logging for NFS Using the CLI
    - Unmounting the MapR Cluster from the Command-Line
  - Managing MapR Data Platform POSIX Clients
    Provides a brief synopsis of MapR Data Platform POSIX clients.
  - Managing the MAST Gateway
  - Configuring NodeManager Restart
  - Managing Jobs and Applications
- Monitoring the Cluster
  This section describes how to monitor the health and performance of a MapR cluster.
- Configuring Security
  Describes how to configure security and manage secure clusters.
- Managing Secure Clusters
  Provides procedures that will enable you to use MapR clusters securely.
- Administering the MapR Data Access Gateway
  The MapR Data Access Gateway is a service that acts as a proxy and gateway for translating requests between lightweight client applications and the MapR cluster. This section describes considerations when upgrading the service, how to modify configuration settings, and how to administer and manage the service.
- Planning for High Availability
- Administrator's Reference
  This section contains in-depth reference information for the administrator.
- Troubleshooting Cluster Administration
  Lists the common errors and their solutions.
- Best Practices for Backing Up MapR Information
  Lists the best practices and performance considerations to follow when backing up MapR information.
6.1 Development
This section contains information related to application development for Ezmeral ecosystem components and MapR Data Platform products, including the file system, Database (Key-Value and JSON), and Event Streams.
Other Docs
This section contains release-independent information, including: MapR Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other MapR version documentation.
Glossary
Definitions for commonly used terms in MapR Converged Data Platform environments.

Accessing Data with NFS v3

Describes how MapR works with NFS v3.

Unlike other Hadoop distributions that only allow cluster data import or import as a batch operation, MapR lets you mount the cluster itself using NFS so that your applications can read and write data directly. MapR allows direct file modification and multiple concurrent reads and writes using POSIX semantics. With a NFS-mounted cluster, you can read and write data directly with standard tools, applications, and scripts. For example, you could run a MapReduce application that outputs to a CSV file, then import the CSV file directly into SQL using NFS.

MapR exports each cluster as the directory /mapr/<cluster name> (for example, /mapr/my.cluster.com). If you create a mount point with the local path /mapr, then Hadoop FS paths and NFS v3 paths to the cluster will be the same. This makes it easy to work on the same files using NFS v3 and Hadoop. In a multi-cluster setting, the clusters share a single namespace, and you can see them all by mounting the top-level /mapr directory.

WARNING MapR uses version 3 of the NFS protocol. NFS version 4 bypasses the port mapper and attempts to connect to the default port only. If you are running NFS on a non-standard port, mounts from NFS version 4 clients time out. Use the -o nfsvers=3 option to specify NFS v3.

You can mount the cluster on a Linux, Mac, or Windows client. Before you begin, make sure you know the hostname and directory of the NFS v3 share you plan to mount.