Copying Data from Apache Hadoop to a Data Fabric Cluster

Describes the procedure to copy data from an Apache Hadoop to a Data Fabric cluster.

You can use the hdfs protocol, webhdfs protocol, or NFS for the HPE Ezmeral Data Fabric to copy data from Apache Hadoop to a Data Fabric cluster.

The following table describes these methods:

Method Description
hdfs:// protocol Use the hadoop distcp command with the hdfs:// protocol to copy data from an HDFS cluster into a Data Fabric cluster if the HDFS cluster and the Data Fabric cluster use the same RPC protocol version. For all other scenarios, use the webhdfs:// protocol or NFS for the HPE Ezmeral Data Fabric gateway to copy data to a Data Fabric cluster.
webhdfs:// protocol Use the hadoop distcp command with the webhdfs:// protocol to copy data from an HDFS cluster into a Data Fabric cluster.
NFS Mount a Data Fabric cluster to an HDFS cluster using NFS for the HPE Ezmeral Data Fabric mount. Then use the hadoop distcp command to copy data between the two clusters.