This site contains the main documentation for Version 6.1 of the MapR Converged Data Platform, including installation, configuration, administration, and reference information.
This section contains information about installing and upgrading MapR software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a MapR cluster.
MapR Data Platform is the industry-leading data platform for AI and analytics that solves enterprise business needs.
This section describes how to manage the nodes and services that make up a cluster.
This section contains information related to application development for Ezmeral ecosystem components and MapR Data Platform products, including the file system, Database (Key-Value and JSON), and Event Streams.
Before you start developing applications on the MapR Data Platform platform, consider how you will get the data into the platform, the storage format of the data, the type of processing or modeling that is required, and how the data will be accessed.
The following sections provide information about accessing the MapR XD with C and Java applications.
This section contains information about developing client applications for JSON and key-value tables.
MapR Event Store For Apache Kafka brings integrated publish and subscribe messaging to MapR Data Platform.
This section contains information associated with developing YARN applications.
The MapR Data Science Refinery product is an easy-to-deploy and scalable data science toolkit with native access to all platform assets and superior out-of-the-box security.
This section describes how to leverage the capabilities of the MapR Data Fabric for Kubernetes.
The following sections provide information about each open-source project that is supported by the MapR Data Platform.
This section discusses topics associated with Maven and MapR.
This section contains in-depth information for the developer.
The mapr dbshell is a tool that enables you to create and perform basic manipulation of JSON tables and documents. You run dbshell by typing mapr dbshell on the command line after logging into a node in a MapR Data Platform cluster.
mapr dbshell
MapR Database JSON provides utilities to copy, export, and import data, compare table content, and verify the consistency of secondary indexes.
You can manage MapR Database tables using HBase shell commands and additional HBase shell commands included in the MapR Data Platform distribution of Hadoop.
MapR Database provides utilities to copy and compare data in MapR Database binary tables.
This section describes the YARN commands.
MapR releases source code to the open-source community for enhancements that HPE has made to the Apache Hadoop project and other ecosystem components.
This section describes the Hadoop commands.
The hadoop archive command creates a Hadoop archive, a file that contains other files. A Hadoop archive always has a *.har extension.
hadoop archive
*.har
The hadoop classpath command prints the class path needed to access the Hadoop jar and the required libraries.
hadoop classpath
The hadoop daemonlog command gets and sets the log level for each daemon.
hadoop daemonlog
The hadoop distcp command is a tool used for large inter- and intra-cluster copying.
hadoop distcp
The hadoop fs command runs a generic file system user client that interacts with the MapR File System.
hadoop fs
The hadoop jar command runs a program contained in a JAR file. Users can bundle their MapReduce code in a JAR file and execute it using this command.
hadoop jar
The hadoop job command enables you to manage MapReduce jobs.
hadoop job
The hadoop mfs command displays directory information and contents, creates symbolic links and hard links, sets, gets, and removes Access Control Expressions (ACE) on files and directories, and sets compression and chunk size on a directory.
hadoop mfs
The hadoop mradmin command runs Map-Reduce administrative commands.
hadoop mradmin
The hadoop pipes command runs a pipes job.
hadoop pipes
The hadoop queue command displays job queue information.
hadoop queue
The hadoop version command prints the hadoop software version.
hadoop version
The hadoop conf command outputs the configuration information for this node to standard output.
hadoop conf
MapR Data Platform supports public APIs for MapR File System, MapR Database, and MapR Event Store For Apache Kafka. These APIs are available for application-development purposes.
This section contains release-independent information, including: MapR Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other MapR version documentation.
Definitions for commonly used terms in MapR Converged Data Platform environments.
This section contains the following: