Pig

IMPORTANT This component is deprecated. Hewlett Packard Enterprise recommends using an alternate product. For more information, see Discontinued Ecosystem Components.


Apache Pig™ is a platform for analyzing large data sets. Apache Pig™ is a high-level language for expressing data analysis programs coupled with an infrastructure to evaluate these programs. The infrastructure layer consists of a compiler that produces sequences of Map-Reduce programs. The language layer consists of a textual language called Pig Latin.

The following sections provide documentation about installing, upgrading, configuring and using Pig. You can also refer to the documentation available on the Apache Pig website.