This site contains the main documentation for Version 6.1 of the MapR Converged Data Platform, including installation, configuration, administration, and reference information.
This section contains information about installing and upgrading MapR software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a MapR cluster.
MapR Data Platform is the industry-leading data platform for AI and analytics that solves enterprise business needs.
This section describes how to manage the nodes and services that make up a cluster.
Lists topics that help manage a MapR cluster.
Provides a synopsis of managing nodes in a cluster.
This section provide information about how to organize and manage data using volumes, a unique feature of MapR clusters.
Administration of the MapR Database is done primarily via the command line (maprcli) or with the Managed Control System (MCS). Regardless of whether the MapR Database table is used for binary files or JSON documents, the same types of commands are used with slightly different parameter options. MapR Database administration is associated with tables, columns and column families, and table regions.
A MapR gateway mediates one-way communication between a source MapR cluster and a destination cluster. You can replicate MapR Database tables (binary and JSON) and MapR Event Store For Apache Kafka streams. MapR gateways also apply updates from JSON tables to their secondary indexes and propagate Change Data Capture (CDC) logs.
This section describes how to monitor the health and performance of a MapR cluster.
Describes how to configure security and manage secure clusters.
Provides procedures that will enable you to use MapR clusters securely.
The MapR Data Access Gateway is a service that acts as a proxy and gateway for translating requests between lightweight client applications and the MapR cluster. This section describes considerations when upgrading the service, how to modify configuration settings, and how to administer and manage the service.
This section contains in-depth reference information for the administrator.
This section provides information about the MapR command API. Most commands can be run on the command-line interface (CLI), or by making REST requests programmatically or in a browser.
Contains information about various scripts and utilities, that help setup, maintain, and monitor clusters.
Describes the syntax and parameters of the configure.sh script that you run for a number of tasks including setting up MapR client nodes, and configuring services for a node.
configure.sh
Use the configure-crosscluster.sh utility to set up cross-cluster security between two clusters.
configure-crosscluster.sh
Monitors the activity of the Container Location Database (CLDB). This utility prints information about the CLDB service that is running on the node from which you run the utility.
Describes the disksetup command that formats disks for use by MapR storage.
disksetup
Dumps or checks the validity of the stripelets in the backend volume that is associated with the volume configured for warm tiering.
Describes how to use the expandaudit utility to expand IDs captured in the audit logs to their corresponding names.
expandaudit
Dynamically sets the log level to debug a library.
Detects and fixes inconsistencies in the filesystem.
Describes how you can use the gfsck command, under the supervision of Map R Support or Engineering, to perform consistency checks and appropriate repairs on a volume, or a volume snapshot.
gfsck
guts is a tool to measure/analyse performance. In the default mode, it prints one line every second, and counts the number of operations or bytes-processed in one second intervals. guts is an internal utility, and is subject to change without notice.
guts
Use the manageSSLKeys.sh utility to create and manage SSL certificates.
manageSSLKeys.sh
Collects information about a cluster's recent activity, to help MapR Support diagnose problems.
Collects node and cluster-level information for the node on which you invoke the script.
Authenticates logins to secure MapR clusters.
The mrconfig commands let you create, remove, and manage storage pools, disk groups, and disks; and provide information about containers.
mrconfig
Discusses the mrconfig cntr commands that allow you to manage containers and container replicas.
mrconfig cntr
Each instance of the file server on a node is responsible for processing and tracking activities that result from running database commands. The mrconfig dbinfo command displays information about the activities, including information related to containers, tablets, storage pools, tags, and threads processing operations on tables.
mrconfig dbinfo
This section discusses the mrconfig dg commands that allow you to configure disk groups.
mrconfig dg
Facilitiates creation of disk groups.
The mrconfig dg help command displays online help for disk group commands.
mrconfig dg help
The mrconfig dg list command lists the disk groups on all the MapR File System disks on a node.
mrconfig dg list
This section discusses the mrconfig disk commands.
The mrconfig info commands provide information about memory, threads, volumes, containers and other information about the MapR file system.
mrconfig info
This section describes the mrconfig mastgateway commands that allow you to test PUT, GET, and DELETE operations on the corresponding tier.
The mrconfig sp commands create and control storage pools.
mrconfig sp
Prints the space usage for each directory, for a container.
Returns the path to the file specified by ID (fid).
Pulls master configuration files from /var/mapr/configuration on the cluster to the local disk, on each node.
/var/mapr/configuration
Simulates a FUSE mount point to determine its read and write performance.
This section contains reference information about various configuration files.
The pages in this section provide details about all of the types of alarms.
This section provides information associated with the MapR Data Platform environment.
A sample metering JSON file for an 8-node cluster with no workloads enabled.
This table lists the metrics collected by the metering feature.
Lists the common errors and their solutions.
Lists the best practices and performance considerations to follow when backing up MapR information.
This section contains information related to application development for Ezmeral ecosystem components and MapR Data Platform products, including the file system, Database (Key-Value and JSON), and Event Streams.
This section contains release-independent information, including: MapR Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other MapR version documentation.
Definitions for commonly used terms in MapR Converged Data Platform environments.