Known Issues
You may encounter the following known issues after upgrading to Version 5.1.0.
Installation and Configuration Issues
- CentOS Version 6.3 and Earlier
- MapR installations on Version 6.3 and earlier can fail because of an unresolved
dependency on the
redhat-lsb-core
package. This problem occurs when the CentOS repository points tovault.centos.org
, rather thanmirror.centos.org
. If you encounter this problem, use one of the following workarounds:- Add this repository: http://mirror.centos.org/centos/6/os/x86_64/
- Manually download and install the RPM:
wget http://mirror.centos.org/centos/6/os/x86_64/Packages/redhat-lsb-core-4.0-7.el6.centos.x86_64.rpm yum localinstall redhat-lsb-core-4.0-7.el6.centos.x86_64.rpm
- 16216
- When you run
configure.sh
with the-HS
option on client nodes, themapred-site.xml
is re-generated and does not retain existing user settings. To work around this problem, use the-R
option in the command. - 16155
- In order to reconfigure a Mac client from secure mode to non-secure mode (or vice
versa), you must follow these steps:
- Manually remove the entry for the current cluster from:
/opt/mapr/conf/mapr-clusters.conf
- Run
configure.sh
.
- Manually remove the entry for the current cluster from:
- 16386
-
If you enable centralized logging on a cluster that was using YARN log aggregation in Version 4.0.1 prior to upgrading to version 4.1.0, you can no longer access previously aggregated MapReduce logs from the HistoryServer UI.
Workaround: Perform the following steps to view previously aggregated MapReduce logs from the History Server UI:
- Use the yarn logs command to retrieve the logs for each MapReduce application. The output of this command contains stdout, stderr, syslog with specific delimiters.
- Parse the output of yarn logs command to create syslog, stdout, stderr files using UNIX tools such as sed or awk.
- Add the syslog, stdout, stderr files to the centralized logging directory with the
following directory
hierarchy:
/var/mapr/local/<NodeManager node>/logs/yarn/userlogs/application_<applicationID>/container_<containerID>/
You will need to create the application and container directories and provide the user that submitted the application the proper permissions on the files and directories.
For example, ifusera
submitted the application,usera
should have the following permissions on the directories and log files:- Application directories:
drwxr-s--- 5 usera mapr 4096 2015-01-07 11:32 /var/mapr/local/qa-node101.qa.lab/logs/yarn/userlogs/application_<id>
- Container directories:
drwxr-s--- - usera mapr 3 2015-01-07 11:32 /var/mapr/local/qa-node101.qa.lab/logs/yarn/userlogs/application_<id>/container_<id>
- Log files:
-rw-r----- 2 usera mapr 290 2015-01-07 11:32 /var/mapr/local/qa-node101.qa.lab/logs/yarn/userlogs/application_<id>/container_<id>/stderr
NOTE: After you complete the workaround, you will also be able to runmaprcli job linklogs
on these logs.
Issue with Removing Replicas When Indexing MapR-DB Data in Elasticsearch
Only one user should manage indexing of any given source MapR-DB table. If indexing of the
table in a given Elasticsearch type is no longer needed and any other user attempts to run
the command maprcli table replica elasticsearch remove
to stop replicating
from the table to that Elasticsearch type, the command will fail with the message that
permission is denied.
MapR Streams does not support producers and consumers running on Windows
Neither 32-bit and 64-bit Windows operating systems are supported for producers and consumers.
MapR Client
- The secure cluster must be listed first in
mapr-clusters.conf
. - A user must obtain a ticket for the secure cluster even if the user wants to access only the nonsecure cluster.
MCS Replication Tab Visible for JSON Tables
22455: In the MCS, the Table Replication tab is visible for JSON tables; however, replication is not supported for these tables.
Metrics Database Issues
- You cannot use the Metrics Database to record activity for applications that run in YARN (MRv2). The database only supports MRv1 jobs.
- MRv1 metrics are not supported on any RedHat 7.x system.
- When you use MRv1 metrics, be aware of the following issues:
- When the hoststats service crashes, MRv1 metrics will not be up-to-date in the mysql database.
- The hoststats service may occasionally consume a large amount of memory.
Workarounds:- When hoststats crashes or consumes an excess amount of memory, restart hoststats
with the following command:
maprcli node services -name hoststats -action restart -filter '[csvc==hoststats]'
- If restarting hoststats does not resolve your issue, complete the following
steps to delete the metrics and restart hoststats.
- Run the following command to delete the MRv1
metrics:
hadoop fs -rm /var/mapr/cluster/mapred/jobTracker/system/jobs/history/metrics
- Run the following command to restart Hoststats:
maprcli node services -name hoststats -action restart -filter '[csvc==hoststats]'
- Run the following command to delete the MRv1
metrics:
- If step 1 and 2 do not resolve the hoststats issue, for example in the case of a
memory leak, you may want complete the following steps to disable MRv1 metrics:
- On the nodes that have the metrics package installed, comment out the
following parameters in the
/opt/mapr/hadoop/hadoop-0.2.0/conf/hadoop-metrics.properties file:
- maprmepredvariant.class
- maprmepredvariant.period
- maprmapred.class
- maprmapred.period
For example:#maprmepredvariant.class=com.mapr.job.mngmnt.hadoop.metrics.MaprRPCContext #maprmepredvariant.period=60 #maprmapred.class=com.mapr.job.mngmnt.hadoop.metrics.MaprRPCContextFinal #maprmapred.period=60
- Run the following command to restart hoststats on all nodes where it is
running:
maprcli node services -name hoststats -action restart -filter '[csvc==hoststats]'
- On the nodes that have the metrics package installed, comment out the
following parameters in the
/opt/mapr/hadoop/hadoop-0.2.0/conf/hadoop-metrics.properties file:
Resource Manager Issues
- 14696/15100
- When automatic or manual ResourceManager failover is enabled and a job is submitted with impersonation turned ON by a user without impersonation privileges, the job submission eventually times out instead of returning an appropriate error. This behavior does not affect standard ecosystem services such as HiveServer because they are configured to run as the mapr user (with impersonation allowed). However, this problem does affect non-ecosystem applications or services that attempt to submit jobs with impersonation turned ON. MapR recommends that customers add the user in question to the impersonation list so that the job can proceed.
- 14907
- When several jobs are submitted and the ResourceManager is using the ZKRMStateStore for failover, the cluster may experience ZooKeeper timeouts and instability. MapR recommends that customers always use the FileSystemRMStateStore to support ResourceManager HA. See Recovery for the ResourceManager.
Services Do Not Start After CentOS 7.0 Reboot
This problem is specific to the CentOS 7.0 operating system. When a CentOS 7.0 node is
rebooted, Warden starts but fails to bring up other cluster services. To work around this
problem, manually add the fully qualified domain name to the /etc/hosts
file before rebooting the system.
SuSe 11 Does Not Support Secure Clusters
Secure MapR clusters are not supported on SuSe 11 platforms because of an incompatibility with the glibc version that is installed. This problem does not affect other operating systems that MapR supports, which have the required glibc version.