Owen O'Malley wrote:

To copy between clusters, there is a tool called distcp. Look at "bin/hadoop distcp". It runs a map/reduce job that copies a group of files. It can also be used to copy between versions of hadoop, if the source file system is hftp, which uses xml to read hdfs.

Can you further explain the hftp part of this? I'm not familiar with that. We have a similar need to go cross-data center. In an earlier post it
was suggested that there was no map/reduce model for that so this
sounds more like what we're looking for.
--
Steve Sapovits
Invite Media  -  http://www.invitemedia.com
[EMAIL PROTECTED]

Reply via email to