Spark 2.1.0-1901 (EEP 4.1.3 and EEP 3.0.5) Release Notes

This section provides reference information, including new features, patches, and known issues for Spark 2.1.0-1901.

The notes below relate specifically to the MapR Distribution for Apache Hadoop. You may also be interested in the open-source Spark 2.1.0 Release Notes.

These release notes contain only MapR-specific information and are not necessarily cumulative in nature. For information about how to use the release notes, see Ecosystem Component Release Notes.

Spark Version 2.1.0
Release Date February 2019
MapR Version Interoperability See EEP Components and OS Support.
Source on GitHub https://github.com/mapr/spark
GitHub Release Tag 2.1.0-mapr-1901
Maven Artifacts https://repository.mapr.com/maven/
Package Names Navigate to https://package.ezmeral.hpe.com/releases/MEP/ and select your EEP and OS to view the list of package names.
IMPORTANT
  • Spark 2.2 can connect to Hive Metastore 2.1. But, features of Hive added after Hive 1.2 are not supported by Spark.
  • Spark Yarn and Standalone modes are supported only on clusters in MRv2 (YARN) mode. They are not supported on clusters in MRv1 (classic) mode.

Hive Support

This version of Spark supports integration with Hive. However, note the following exceptions:

Fixes

This MapR release includes the following new fixes since the latest MapR Spark 2.2.1 release. For details, refer to the commit log for this project in GitHub.

GitHub Commit Date (YYYY-MM-DD) Comment
e6a9733 2018/11/01 [SPARK-19263] DAGScheduler should avoid sending conflicting task set
664b59f 2018/11/01 MapR [SPARK-266] Spark jobs can't finish correctly, when there is an error during job running
7c714d7 2019/01/22 MapR [SPARK-419] Update hive-maprdb-json-handler jar for spark

Known Issues

  • None.

Resolved Issues

  • None.