Installing Oozie
This topic includes instructions for using package managers to download and install Oozie from the MEP repository.
Prerequisites
About this task
mapr-oozie
and mapr-oozie-internal
, on the client node and
the server node. mapr-oozie
is dependent on mapr-oozie-internal.
mapr-oozie-internal
is automatically installed by the package manager when you
install mapr-oozie
.Execute the following commands as root
or using sudo
on a MapR cluster:
Procedure
-
Install
mapr-oozie
andmapr-oozie-internal
:RedHat/CentOSyum install mapr-oozie mapr-oozie-internal
SUSEzypper install mapr-oozie mapr-oozie-internal
Ubuntuapt-get install mapr-oozie mapr-oozie-internal
-
For non-secure clusters, add the following two properties to
core-site.xml
located in/opt/mapr/hadoop/hadoop-2.x.x/etc/hadoop/core-site.xml
:<property> <name>hadoop.proxyuser.mapr.hosts</name> <value>*</value> </property> <property> <name>hadoop.proxyuser.mapr.groups</name> <value>*</value> </property>
-
Restart the YARN services. For YARN mode, restart NodeManager and ResourceManager:
maprcli node services -name nodemanager -action restart -nodes <space delimited list of nodes> maprcli node services -name resourcemanager -action restart -nodes <space delimited list of nodes>
-
Run
configure.sh -R
:/opt/mapr/server/configure.sh -R
-
Export the Oozie URL to your environment:
Theexport OOZIE_URL='http://<Oozie_node>:<oozie_port_number>/oozie'
<oozie_port_number>
depends on whether your cluster is secure. For secure clusters, use:<oozie_port_number>=11443
For non-secure clusters, use:
<oozie_port_number>=11000
-
Check Oozie’s status with the following command:
# /opt/mapr/oozie/oozie-<version>/bin/oozie admin -status
-
If high availability for the Resource Manager is configured, edit the
job.properties
file for each workflow and insert the following statementJobTracker=maprfs:///
-
If high availability for the Resource Manager is not configured, provide the
address of the node running the active ResourceManager and the port used for
ResourceManager client RPCs (port 8032). For each workflow, edit the
job.properties
file and insert the following statement:JobTracker=<ResourceManager_address>:8032
-
Restart Oozie:
maprcli node services -name oozie -action restart -nodes <space delimited list of nodes>
NOTE: If high availability for the Resource Manager is not configured and the ResourceManager fails, you must update thejob.properties
with the active ResourceManager.