For local file ingestion to HDFS in general we use hadoop command copyFromLocal like the following :
<b>
hadoop fs -copyFromLocal <file location local directoy location> <hdfs data copy directory location>
</b>
Let us assume that there is a hadoop cluster with a hadoop namenode and multiple data node and we have one huge file which is outside hadoop cluster and it needs to be ingested to the HDFS.
What would be the approach for ingestion ?
↧
Data Ingestion in Hadoop cluster
↧