Re: distcp fails with "source and target differ in block-size"

2016-05-24 Thread Chris Nauroth
There is also some discussion on that JIRA considering a checksum strategy independent of block size. I don't think anything was ever implemented though, and there would be some drawbacks to that approach. Sorry if this caused confusion. --Chris Nauroth On 5/24/16, 9:55 AM, "Dmitry Sivachenk

Re: distcp fails with "source and target differ in block-size"

2016-05-24 Thread Dmitry Sivachenko
> On 24 May 2016, at 19:53, Chris Nauroth wrote: > > Hello Dmitry, > > To clarify, the intent of MAPREDUCE-5065 was to message the user that > using different block sizes on source and destination might cause a > failure to checksum mismatch. The message to the user recommends either > the -pb

Re: distcp fails with "source and target differ in block-size"

2016-05-24 Thread Chris Nauroth
Hello Dmitry, To clarify, the intent of MAPREDUCE-5065 was to message the user that using different block sizes on source and destination might cause a failure to checksum mismatch. The message to the user recommends either the -pb (preserve block size) or -skipCrc (skip checksum validation) as p

Re: distcp fails with "source and target differ in block-size"

2016-05-22 Thread Dmitry Sivachenko
> On 21 May 2016, at 09:34, Dmitry Sivachenko wrote: > > >> On 21 May 2016, at 02:15, Chris Nauroth wrote: >> >> Hello Dmitry, >> >> MAPREDUCE-5065 has been included in these branches for a long time. Are >> you certain that you passed a dfs.blocksize equal to what was used in the >> source

Re: distcp fails with "source and target differ in block-size"

2016-05-20 Thread Dmitry Sivachenko
> On 21 May 2016, at 02:15, Chris Nauroth wrote: > > Hello Dmitry, > > MAPREDUCE-5065 has been included in these branches for a long time. Are > you certain that you passed a dfs.blocksize equal to what was used in the > source files? Did all source files use the same block size? > No, I a

Re: distcp fails with "source and target differ in block-size"

2016-05-20 Thread Chris Nauroth
Hello Dmitry, MAPREDUCE-5065 has been included in these branches for a long time. Are you certain that you passed a dfs.blocksize equal to what was used in the source files? Did all source files use the same block size? --Chris Nauroth On 5/20/16, 3:30 PM, "Dmitry Sivachenko" wrote: >Hell

distcp fails with "source and target differ in block-size"

2016-05-20 Thread Dmitry Sivachenko
Hello, When I copy files with distcp and -D dfs.blocksize=XXX (hadoop-2.7.2), it fails with "Source and target differ in block-size" error despite MAPREDUCE-5065 was committed 3 years ago. Is it possible to merge this change to 2.7 / 2.8 branches? Thanks. -