Copying Data from Apache Hadoop to a Data Fabric Cluster

Describes the procedure to copy data from an Apache Hadoop to a Data Fabric cluster.

You can use the hdfs protocol, webhdfs protocol, or NFS for the HPE Ezmeral Data Fabric to copy data from Apache Hadoop to a data-fabric cluster.

The following table describes these methods:

Method Description
hdfs:// protocol You can use the hadoop distcp command with the hdfs:// protocol to copy data from a HDFS cluster into a data-fabric cluster if the HDFS cluster and the data-fabric cluster use the same RPC protocol version. For all other scenarios, use the webhdfs:// protocol or NFS for the HPE Ezmeral Data Fabric gateway to copy data to a data-fabric cluster.
webhdfs:// protocol You can use the hadoop distcp command with the webhdfs:// protocol to copy data from a HDFS cluster into a data-fabric cluster.
NFS You can mount a data-fabric cluster to a HDFS cluster using NFS for the HPE Ezmeral Data Fabric mount and then use the hadoop distcp command to copy data between the two clusters.

Refer to the following sections for information about how to copy data from Hadoop to a data-fabric cluster: