Описание тега distcp
NoneHadoop tool used for large inter- and intra-cluster copying.
The hadoop distcp command is a tool used for large inter- and intra- cluster copying. It uses mapreduce to effect its distribution, error handling and recovery, and reporting. It expands a list of files and directories into input to map tasks, each of which will copy a partition of the files specified in the source list.