This guide contains a section for each open source project that MapR supports. You can learn how to install, configure, use, and integrate each project within the context of a MapR cluster.
MapR supports most Spark features. However, there a few exceptions.
When Spark runs on YARN, MapR client nodes require the hadoop-yarn-server-web-proxy JAR file to run Spark applications. A MapR client node (a node with the mapr-client package, but without mapr-core packages) is also known as an edge node.
hadoop-yarn-server-web-proxy
This section includes information about using Spark on YARN in a MapR cluster.
Apache Spark is an open-source processing engine that you can use to process Hadoop data. Although MapR does not yet ship a Spark 2.0.0 package, you can install and use Spark 2.0.0 on a non-secure MapR 5.1 cluster or on a secure MapR 5.1 cluster that uses MapR-SASL authentication.