[ https://issues.apache.org/jira/browse/HDFS-3788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13434463#comment-13434463 ]
Jason Lowe commented on HDFS-3788: ---------------------------------- I tested out the patch on trunk and am unable to reproduce Eli's issue. Without the patch both -get and distcp via webhdfs fail, but after the patch I can successfully -get and distcp large files. This is on a pseudo-distributed tarball without security, distcp is {{hadoop distcp webhdfs://localhost:50070/user/someuser/distcpsrc hdfs://localhost:8020/user/someuser/distcpdest}} where distcpsrc/ contains a 3GB file. > distcp can't copy large files using webhdfs due to missing Content-Length > header > -------------------------------------------------------------------------------- > > Key: HDFS-3788 > URL: https://issues.apache.org/jira/browse/HDFS-3788 > Project: Hadoop HDFS > Issue Type: Bug > Components: webhdfs > Affects Versions: 0.23.3, 2.0.0-alpha > Reporter: Eli Collins > Assignee: Tsz Wo (Nicholas), SZE > Priority: Critical > Attachments: distcp-webhdfs-errors.txt, h3788_20120813.patch, > h3788_20120814.patch > > > The following command fails when data1 contains a 3gb file. It passes when > using hftp or when the directory just contains smaller (<2gb) files, so looks > like a webhdfs issue with large files. > {{hadoop distcp webhdfs://eli-thinkpad:50070/user/eli/data1 > hdfs://localhost:8020/user/eli/data2}} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira