Home
6.0 Development
This section contains information related to application development for ecosystem components and MapR products including MapR-DB (binary and JSON), MapR-FS, and MapR Streams.
Ecosystem Components
The following sections provide information about each open source project that MapR supports.
Sqoop

MapR 6.0 Documentation

6.0 Development
This section contains information related to application development for ecosystem components and MapR products including MapR-DB (binary and JSON), MapR-FS, and MapR Streams.
- Application Development Process
  Before you start developing applications on MapR’s Converged Data Platform, consider how you will get the data onto the platform, the format it will be stored in, the type of processing or modeling that is required, and how the data will be accessed.
- MapR-FS and Apps
  The following sections provide information about accessing MapR-FS with C and Java applications.
- MapR-DB and Apps
  This section contains information about developing client applications for JSON and binary tables.
- MapR-ES and Apps
  MapR-ES brings integrated publish and subscribe messaging to the MapR Converged Data Platform.
- MapReduce and Apps
  This section contains information associated with developing YARN applications.
- MapR Data Science Refinery
  The MapR Data Science Refinery is an easy-to-deploy and scalable data science toolkit with native access to all platform assets and superior out-of-the-box security.
- MapR Data Fabric for Kubernetes FlexVolume Driver
  This section describes how to use and troubleshoot the MapR Data Fabric for Kubernetes FlexVolume Driver.
- Ecosystem Components
  The following sections provide information about each open source project that MapR supports.
  - MapR Ecosystem Packs
    A MapR Ecosystem Pack (MEP) provides a set of ecosystem components that work together on one or more MapR cluster versions. Only one version of each ecosystem component is available in each MEP. For example, only one version of Hive and one version of Spark is supported in a MEP.
  - AsyncHBase
  - Cascading
  - Drill
  - Flume
  - HBase Client and MapR-DB Binary Tables
  - HCatalog
  - Hive
  - HttpFS
  - Hue
  - Impala
  - MapR-ES Clients and Tools
  - Myriad
  - OpenStack Manila
  - Oozie
  - Pig
  - Sentry
  - Spark
  - Sqoop
    - Sqoop1
    - MapR Connector for Teradata
  - Third Party Solutions
- Maven and MapR
  This section discusses topics associated with Maven and MapR.
- Developer's Reference
  This section contains in-depth information for the developer.
- API Documentation
  MapR supports public APIs for MapR-FS, MapR-DB, and MapR-ES. These APIs are available for application development purposes.

Sqoop

Apache Sqoop™ is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.

This documentation provides all relevant details about using Sqoop and Sqoop2 with MapR, but does not duplicate Apache documentation. You can refer also to documentation available from the Apache Sqoop website.

The following table describes the differences between Sqoop1 or Sqoop2:

Feature	Sqoop1	Sqoop2
Specialized connectors for all major RDBMS	Available.	Not available. However, you can use the `generic-jdbc-connector` , which has been tested on these databases: MySQL Microsoft SQL Server Oracle (Not supported in Sqoop 1.99.7) PostgreSQL The generic JDBC connector should also work with any other JDBC-compliant database, although specialized connectors probably give better performance.
Data transfer from RDBMS to Hive	Done automatically.	Must be done manually in two stages: Import data from RDBMS into MapR-FS. Load data into Hive using the `LOAD DATA` command NOTE: As of Sqoop 1.99.7, you can also use the `kite-connector` to load data into Hive.
Data transfer from Hive to RDBMS	Must be done manually in two stages: Extract data from Hive into MapR-FS, as a text file or as an Avro file. Export the output of step 1 to an RDBMS using Sqoop.	Must be done manually in two stages: Extract data from Hive into MapR-FS, as a text file or as an Avro file. NOTE: As of Sqoop 1.99.7, you can also use the `kite-connector` to extract data from Hive. Export the output of step 1 to an RDBMS using Sqoop.
Integrated Kerberos security	Supported.	Supported.
Password encryption	Not supported.	Supported as of Sqoop 1.99.7.