This site contains the main documentation for Version 6.1 of the MapR Converged Data Platform, including installation, configuration, administration, and reference information.
This section contains information about installing and upgrading MapR software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a MapR cluster.
MapR Data Platform is the industry-leading data platform for AI and analytics that solves enterprise business needs.
This section describes how to manage the nodes and services that make up a cluster.
This section contains information related to application development for Ezmeral ecosystem components and MapR Data Platform products, including the file system, Database (Key-Value and JSON), and Event Streams.
Before you start developing applications on the MapR Data Platform platform, consider how you will get the data into the platform, the storage format of the data, the type of processing or modeling that is required, and how the data will be accessed.
The following sections provide information about accessing the MapR XD with C and Java applications.
This section contains information about developing client applications for JSON and key-value tables.
MapR Event Store For Apache Kafka brings integrated publish and subscribe messaging to MapR Data Platform.
This section contains information associated with developing YARN applications.
The MapR Data Science Refinery product is an easy-to-deploy and scalable data science toolkit with native access to all platform assets and superior out-of-the-box security.
This section describes how to leverage the capabilities of the MapR Data Fabric for Kubernetes.
The following sections provide information about each open-source project that is supported by the MapR Data Platform.
You can use Tez, instead of MapReduce, for generic data processing tasks. Tez significantly increases the processing speed. Tez, working with Hive, provides lower latency for interactive queries and higher throughput for batch queries.
This section describes how to enable High Availability for HiveServer2 and HiveMetastore.
Describes MapR Data Platform-specific features in Hive.
This topic describes the public API changes that occurred between Hive 2.1 EEP 5.0.0 and Hive 2.3 EEP 6.0.0.
This section describes Hive logging for Hive 2.1 and later releases and includes information about log splitting.
Apache Livy is primarily used to provide integration between Hue and Spark.
Describes the supported MapR Event Store For Apache Kafka tools and clients.
The S3 gateway is a service that provides an S3-compatible interface to expose data in MapR Data Platform as objects. The S3 gateway manages all inbound S3 API requests to put data into and get data out of cloud storage.
This section discusses topics associated with Maven and MapR.
This section contains in-depth information for the developer.
MapR Data Platform supports public APIs for MapR File System, MapR Database, and MapR Event Store For Apache Kafka. These APIs are available for application-development purposes.
This section contains release-independent information, including: MapR Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other MapR version documentation.
Definitions for commonly used terms in MapR Converged Data Platform environments.