Copying Data from Apache Hadoop to a Data Fabric Cluster
Describes the procedure to copy data from an Apache Hadoop to a Data Fabric cluster.
You can use the hdfs protocol, webhdfs protocol, or NFS for the HPE Ezmeral Data Fabric to copy data from Apache Hadoop to a data-fabric cluster.
The following table describes these methods:
Method | Description |
---|---|
hdfs:// protocol | You can use the hadoop distcp command with the
hdfs:// protocol to copy data from a HDFS cluster into a
data-fabric
cluster if the HDFS cluster and the data-fabric cluster use the same RPC protocol
version. For all other scenarios, use the webhdfs://
protocol or NFS for the
HPE Ezmeral Data Fabric gateway to copy data to a data-fabric
cluster. |
webhdfs:// protocol | You can use the hadoop distcp command with the
webhdfs:// protocol to copy data from a HDFS cluster
into a data-fabric cluster. |
NFS | You can mount a data-fabric cluster to a HDFS cluster using NFS for the HPE Ezmeral Data Fabric
mount and then use the hadoop distcp command to copy data
between the two clusters. |
Refer to the following sections for information about how to copy data from Hadoop to a data-fabric cluster: