Installing Oozie

This topic includes instructions for using package managers to download and install Oozie from the EEP repository.

Prerequisites

For instructions on setting up the EEP repository, see Step 10: Install Ecosystem Components Manually.

About this task

The Oozie client/server architecture requires you to install two packages, mapr-oozie and mapr-oozie-internal, on the client node and the server node. mapr-oozie is dependent on mapr-oozie-internal. mapr-oozie-internal is automatically installed by the package manager when you install mapr-oozie.

Execute the following commands as root or using sudo on a HPE Ezmeral Data Fabric cluster:

Procedure

  1. Install mapr-oozie and mapr-oozie-internal:
    RedHat/CentOS
    yum install mapr-oozie mapr-oozie-internal
    SLES
    zypper install mapr-oozie mapr-oozie-internal
    Ubuntu
    apt-get install mapr-oozie mapr-oozie-internal
  2. For non-secure clusters, add the following two properties to core-site.xml located in /opt/mapr/hadoop/hadoop-2.x.x/etc/hadoop/core-site.xml:
    <property>
      <name>hadoop.proxyuser.mapr.hosts</name>
      <value>*</value>
    </property>
    <property>
      <name>hadoop.proxyuser.mapr.groups</name>
      <value>*</value>
    </property> 
  3. Restart the YARN services. For YARN mode, restart NodeManager and ResourceManager:
    maprcli node services -name nodemanager -action restart -nodes <space delimited list of nodes>
    maprcli node services -name resourcemanager -action restart -nodes <space delimited list of nodes>
  4. Run configure.sh -R:
    /opt/mapr/server/configure.sh -R
  5. On client or edge nodes installed with EEP 6.3.1 or EEP 7.0.0 or later, run the following configure.sh command:
    For secure clusters, use:
    /opt/mapr/oozie/oozie-5.1.0/bin/configure.sh -R -c --secure
    For non-secure clusters, use:
    /opt/mapr/oozie/oozie-5.1.0/bin/configure.sh -R -c --unsecure
  6. Export the Oozie URL to your environment:
    For secure clusters, use the following export command, and specify the oozie_port_number as 11443:
    export OOZIE_URL='<https://<Oozie_node>:<oozie_port_number>/oozie>'
    
    For non-secure clusters, use the following export command, and specify the oozie_port_number as 11000:
    export OOZIE_URL='<http://<Oozie_node>:<oozie_port_number>/oozie>'
    
  7. Check the Oozie status with the following command:
    /opt/mapr/oozie/oozie-<version>/bin/oozie admin -status
  8. If high availability for the Resource Manager is configured, edit the job.properties file for each workflow and insert the following statement
    JobTracker=maprfs:///
  9. If high availability for the Resource Manager is not configured, provide the address of the node running the active ResourceManager and the port used for ResourceManager client RPCs (port 8032). For each workflow, edit the job.properties file and insert the following statement:
    JobTracker=<ResourceManager_address>:8032
  10. Restart Oozie:
    maprcli node services -name oozie -action restart -nodes <space delimited list of nodes>
    NOTE If high availability for the Resource Manager is not configured and the ResourceManager fails, you must update the job.properties with the active ResourceManager.