MapR Data Science Refinery

The MapR Data Science Refinery product is an easy-to-deploy and scalable data science toolkit with native access to all platform assets and superior out-of-the-box security.

IMPORTANT This component is deprecated. Hewlett Packard Enterprise recommends using an alternate product. For more information, see Discontinued Ecosystem Components.


The MapR Data Science Refinery product offers:

Access to All Platform Assets
The MapR FUSE-Based POSIX Client allows application servers, web servers, and other client nodes and applications to read and write data directly and securely to a MapR cluster, like a Linux filesystem. In addition, connectors are provided for interacting with both MapR Database and MapR Event Store For Apache Kafka through Apache Spark connectors.
Superior Security
The MapR Platform provides enhanced security. Apache Zeppelin on the MapR leverages and integrates with this security layer using the built-in capabilities provided by the MapR Persistent Application Container (PACC).
Extensibility
Apache Zeppelin is paired with the Helium framework to offer pluggable visualization capabilities.
Simplified Deployment
A preconfigured Zeppelin Docker container provides the ability to leverage MapR as a persistent data store.

Getting Started Using the MapR Data Science Refinery with Zeppelin

You can deploy the Apache Zeppelin Docker container included in the MapR Data Science Refinery system on any of the following, listed in order of recommendation for best practice, starting with the most preferable option:

  • Container orchestration engines; for example: Docker Swarm, Kubernetes, OpenShift
  • Cloud instances
  • Shared edge node
  • Personal computers
NOTE Starting in version 1.2, you can deploy the MapR Data Science Refinery software on a MapR cluster node. Make sure you take into consideration the resource requirements of the MapR Data Science Refinery system, if you choose this deployment mode.

If you are already familiar with Apache Zeppelin on the MapR system and want to skip to the deployment instructions, see Running the Zeppelin Container.