[jira] [Commented] (HADOOP-9230) TestUniformSizeInputFormat fails intermittently

2013-03-01 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13590412#comment-13590412
 ] 

Hudson commented on HADOOP-9230:


Integrated in Hadoop-Yarn-trunk #142 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/142/])
HADOOP-9230. TestUniformSizeInputFormat fails intermittently. (kkambatl via 
tucu) (Revision 1451291)

 Result = SUCCESS
tucu : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1451291
Files : 
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-tools/hadoop-distcp/src/test/java/org/apache/hadoop/tools/mapred/TestUniformSizeInputFormat.java


 TestUniformSizeInputFormat fails intermittently
 ---

 Key: HADOOP-9230
 URL: https://issues.apache.org/jira/browse/HADOOP-9230
 Project: Hadoop Common
  Issue Type: Bug
  Components: test
Affects Versions: 2.0.2-alpha
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla
  Labels: distcp
 Fix For: 2.0.4-beta

 Attachments: hadoop-9230.patch


 TestUniformSizeFileInputFormat fails intermittently. I ran the test 50 times 
 and noticed 5 failures.
 Haven't noticed any particular pattern to which runs fail.
 A sample stack trace is as follows:
 {noformat}
 java.lang.AssertionError: expected:1944 but was:1820
 at org.junit.Assert.fail(Assert.java:91)
 at org.junit.Assert.failNotEquals(Assert.java:645)
 at org.junit.Assert.assertEquals(Assert.java:126)
 at org.junit.Assert.assertEquals(Assert.java:470)
 at org.junit.Assert.assertEquals(Assert.java:454)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.checkAgainstLegacy(TestUniformSizeInputFormat.java:244)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:126)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:252)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-9230) TestUniformSizeInputFormat fails intermittently

2013-03-01 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13590483#comment-13590483
 ] 

Hudson commented on HADOOP-9230:


Integrated in Hadoop-Hdfs-trunk #1331 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1331/])
HADOOP-9230. TestUniformSizeInputFormat fails intermittently. (kkambatl via 
tucu) (Revision 1451291)

 Result = FAILURE
tucu : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1451291
Files : 
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-tools/hadoop-distcp/src/test/java/org/apache/hadoop/tools/mapred/TestUniformSizeInputFormat.java


 TestUniformSizeInputFormat fails intermittently
 ---

 Key: HADOOP-9230
 URL: https://issues.apache.org/jira/browse/HADOOP-9230
 Project: Hadoop Common
  Issue Type: Bug
  Components: test
Affects Versions: 2.0.2-alpha
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla
  Labels: distcp
 Fix For: 2.0.4-beta

 Attachments: hadoop-9230.patch


 TestUniformSizeFileInputFormat fails intermittently. I ran the test 50 times 
 and noticed 5 failures.
 Haven't noticed any particular pattern to which runs fail.
 A sample stack trace is as follows:
 {noformat}
 java.lang.AssertionError: expected:1944 but was:1820
 at org.junit.Assert.fail(Assert.java:91)
 at org.junit.Assert.failNotEquals(Assert.java:645)
 at org.junit.Assert.assertEquals(Assert.java:126)
 at org.junit.Assert.assertEquals(Assert.java:470)
 at org.junit.Assert.assertEquals(Assert.java:454)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.checkAgainstLegacy(TestUniformSizeInputFormat.java:244)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:126)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:252)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-9230) TestUniformSizeInputFormat fails intermittently

2013-03-01 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13590532#comment-13590532
 ] 

Hudson commented on HADOOP-9230:


Integrated in Hadoop-Mapreduce-trunk #1359 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1359/])
HADOOP-9230. TestUniformSizeInputFormat fails intermittently. (kkambatl via 
tucu) (Revision 1451291)

 Result = SUCCESS
tucu : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1451291
Files : 
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-tools/hadoop-distcp/src/test/java/org/apache/hadoop/tools/mapred/TestUniformSizeInputFormat.java


 TestUniformSizeInputFormat fails intermittently
 ---

 Key: HADOOP-9230
 URL: https://issues.apache.org/jira/browse/HADOOP-9230
 Project: Hadoop Common
  Issue Type: Bug
  Components: test
Affects Versions: 2.0.2-alpha
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla
  Labels: distcp
 Fix For: 2.0.4-beta

 Attachments: hadoop-9230.patch


 TestUniformSizeFileInputFormat fails intermittently. I ran the test 50 times 
 and noticed 5 failures.
 Haven't noticed any particular pattern to which runs fail.
 A sample stack trace is as follows:
 {noformat}
 java.lang.AssertionError: expected:1944 but was:1820
 at org.junit.Assert.fail(Assert.java:91)
 at org.junit.Assert.failNotEquals(Assert.java:645)
 at org.junit.Assert.assertEquals(Assert.java:126)
 at org.junit.Assert.assertEquals(Assert.java:470)
 at org.junit.Assert.assertEquals(Assert.java:454)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.checkAgainstLegacy(TestUniformSizeInputFormat.java:244)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:126)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:252)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-9230) TestUniformSizeInputFormat fails intermittently

2013-02-28 Thread Alejandro Abdelnur (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13589790#comment-13589790
 ] 

Alejandro Abdelnur commented on HADOOP-9230:


Agree, committing patch.

 TestUniformSizeInputFormat fails intermittently
 ---

 Key: HADOOP-9230
 URL: https://issues.apache.org/jira/browse/HADOOP-9230
 Project: Hadoop Common
  Issue Type: Bug
  Components: test
Affects Versions: 2.0.2-alpha
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla
  Labels: distcp
 Attachments: hadoop-9230.patch


 TestUniformSizeFileInputFormat fails intermittently. I ran the test 50 times 
 and noticed 5 failures.
 Haven't noticed any particular pattern to which runs fail.
 A sample stack trace is as follows:
 {noformat}
 java.lang.AssertionError: expected:1944 but was:1820
 at org.junit.Assert.fail(Assert.java:91)
 at org.junit.Assert.failNotEquals(Assert.java:645)
 at org.junit.Assert.assertEquals(Assert.java:126)
 at org.junit.Assert.assertEquals(Assert.java:470)
 at org.junit.Assert.assertEquals(Assert.java:454)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.checkAgainstLegacy(TestUniformSizeInputFormat.java:244)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:126)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:252)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-9230) TestUniformSizeInputFormat fails intermittently

2013-02-28 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13589809#comment-13589809
 ] 

Hudson commented on HADOOP-9230:


Integrated in Hadoop-trunk-Commit #3396 (See 
[https://builds.apache.org/job/Hadoop-trunk-Commit/3396/])
HADOOP-9230. TestUniformSizeInputFormat fails intermittently. (kkambatl via 
tucu) (Revision 1451291)

 Result = SUCCESS
tucu : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1451291
Files : 
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-tools/hadoop-distcp/src/test/java/org/apache/hadoop/tools/mapred/TestUniformSizeInputFormat.java


 TestUniformSizeInputFormat fails intermittently
 ---

 Key: HADOOP-9230
 URL: https://issues.apache.org/jira/browse/HADOOP-9230
 Project: Hadoop Common
  Issue Type: Bug
  Components: test
Affects Versions: 2.0.2-alpha
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla
  Labels: distcp
 Fix For: 2.0.4-beta

 Attachments: hadoop-9230.patch


 TestUniformSizeFileInputFormat fails intermittently. I ran the test 50 times 
 and noticed 5 failures.
 Haven't noticed any particular pattern to which runs fail.
 A sample stack trace is as follows:
 {noformat}
 java.lang.AssertionError: expected:1944 but was:1820
 at org.junit.Assert.fail(Assert.java:91)
 at org.junit.Assert.failNotEquals(Assert.java:645)
 at org.junit.Assert.assertEquals(Assert.java:126)
 at org.junit.Assert.assertEquals(Assert.java:470)
 at org.junit.Assert.assertEquals(Assert.java:454)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.checkAgainstLegacy(TestUniformSizeInputFormat.java:244)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:126)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:252)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-9230) TestUniformSizeInputFormat fails intermittently

2013-02-20 Thread Karthik Kambatla (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13582033#comment-13582033
 ] 

Karthik Kambatla commented on HADOOP-9230:
--

Thanks for the investigation, Tom. As tailoring the test just to make it pass 
doesn't make sense, I think we should get rid of the test. Pinged [~mithun] 
about 10 days ago to see if he has any additional insights on this.

I think it is safe to commit the patch. 

 TestUniformSizeInputFormat fails intermittently
 ---

 Key: HADOOP-9230
 URL: https://issues.apache.org/jira/browse/HADOOP-9230
 Project: Hadoop Common
  Issue Type: Bug
  Components: test
Affects Versions: 2.0.2-alpha
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla
  Labels: distcp
 Attachments: hadoop-9230.patch


 TestUniformSizeFileInputFormat fails intermittently. I ran the test 50 times 
 and noticed 5 failures.
 Haven't noticed any particular pattern to which runs fail.
 A sample stack trace is as follows:
 {noformat}
 java.lang.AssertionError: expected:1944 but was:1820
 at org.junit.Assert.fail(Assert.java:91)
 at org.junit.Assert.failNotEquals(Assert.java:645)
 at org.junit.Assert.assertEquals(Assert.java:126)
 at org.junit.Assert.assertEquals(Assert.java:470)
 at org.junit.Assert.assertEquals(Assert.java:454)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.checkAgainstLegacy(TestUniformSizeInputFormat.java:244)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:126)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:252)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-9230) TestUniformSizeInputFormat fails intermittently

2013-02-07 Thread Tom White (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13573423#comment-13573423
 ] 

Tom White commented on HADOOP-9230:
---

I guess that the legacy split check was included as a sanity check. Since the 
split calculation is slightly different, the occasional difference should not 
be too surprising. Nor should it be a problem for correctness, since the aim of 
the calculation is to produce roughly equal-sized splits for distcp. Therefore 
I think we can remove the legacy check in order to ensure that the test always 
passes. +1 for the patch.

bq. I would expect the math discrepancy to lead to more than 10% failure rate 
though.

Although the difference between floor and ceiling will almost always produce 
different target split sizes differing by 1 (whenever the number of splits 
doesn't exactly divide the total size), only very rarely will this affect 
whether a file is or is not included in any particular split. This is why the 
failures are relatively rare. 

E.g. on one failure I got the file sizes were 1482, 2012, 1860, ... (making a 
total size of 31439) and for 9 maps the legacy target size works out at 3493, 
while the new target size is 3494. The first legacy split only included 1482 
(since 1482 + 2012  3493) while the first split for the new code includes both 
1482 and 2012.


 TestUniformSizeInputFormat fails intermittently
 ---

 Key: HADOOP-9230
 URL: https://issues.apache.org/jira/browse/HADOOP-9230
 Project: Hadoop Common
  Issue Type: Bug
  Components: test
Affects Versions: 2.0.2-alpha
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla
  Labels: distcp
 Attachments: hadoop-9230.patch


 TestUniformSizeFileInputFormat fails intermittently. I ran the test 50 times 
 and noticed 5 failures.
 Haven't noticed any particular pattern to which runs fail.
 A sample stack trace is as follows:
 {noformat}
 java.lang.AssertionError: expected:1944 but was:1820
 at org.junit.Assert.fail(Assert.java:91)
 at org.junit.Assert.failNotEquals(Assert.java:645)
 at org.junit.Assert.assertEquals(Assert.java:126)
 at org.junit.Assert.assertEquals(Assert.java:470)
 at org.junit.Assert.assertEquals(Assert.java:454)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.checkAgainstLegacy(TestUniformSizeInputFormat.java:244)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:126)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:252)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-9230) TestUniformSizeInputFormat fails intermittently

2013-02-04 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13570898#comment-13570898
 ] 

Hadoop QA commented on HADOOP-9230:
---

{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12567942/hadoop-9230.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-tools/hadoop-distcp.

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-HADOOP-Build/2147//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-HADOOP-Build/2147//console

This message is automatically generated.

 TestUniformSizeInputFormat fails intermittently
 ---

 Key: HADOOP-9230
 URL: https://issues.apache.org/jira/browse/HADOOP-9230
 Project: Hadoop Common
  Issue Type: Bug
  Components: test
Affects Versions: 2.0.2-alpha
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla
  Labels: distcp
 Attachments: hadoop-9230.patch


 TestUniformSizeFileInputFormat fails intermittently. I ran the test 50 times 
 and noticed 5 failures.
 Haven't noticed any particular pattern to which runs fail.
 A sample stack trace is as follows:
 {noformat}
 java.lang.AssertionError: expected:1944 but was:1820
 at org.junit.Assert.fail(Assert.java:91)
 at org.junit.Assert.failNotEquals(Assert.java:645)
 at org.junit.Assert.assertEquals(Assert.java:126)
 at org.junit.Assert.assertEquals(Assert.java:470)
 at org.junit.Assert.assertEquals(Assert.java:454)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.checkAgainstLegacy(TestUniformSizeInputFormat.java:244)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:126)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:252)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HADOOP-9230) TestUniformSizeInputFormat fails intermittently

2013-01-18 Thread Karthik Kambatla (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13557502#comment-13557502
 ] 

Karthik Kambatla commented on HADOOP-9230:
--

checkAgainstLegacy() compares the generated splits against a legacy split 
generation. I don't quite understand the purpose behind this check. Can anyone 
who knows this better throw some light on why we need the test.

I noticed a conflict in the math between UniformSizeInputFormat split 
generation and the legacy generation:

Current:
{code}
long nBytesPerSplit = (long) Math.ceil(totalSizeBytes * 1.0 / numSplits);
{code}

Legacy:
{code}
final long targetsize = totalFileSize / numSplits;
{code}

I would expect the math discrepancy to lead to more than 10% failure rate 
though.

 TestUniformSizeInputFormat fails intermittently
 ---

 Key: HADOOP-9230
 URL: https://issues.apache.org/jira/browse/HADOOP-9230
 Project: Hadoop Common
  Issue Type: Bug
  Components: test
Affects Versions: 2.0.2-alpha
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla
  Labels: distcp

 TestUniformSizeFileInputFormat fails intermittently. I ran the test 50 times 
 and noticed 5 failures.
 Haven't noticed any particular pattern to which runs fail.
 A sample stack trace is as follows:
 {noformat}
 java.lang.AssertionError: expected:1944 but was:1820
 at org.junit.Assert.fail(Assert.java:91)
 at org.junit.Assert.failNotEquals(Assert.java:645)
 at org.junit.Assert.assertEquals(Assert.java:126)
 at org.junit.Assert.assertEquals(Assert.java:470)
 at org.junit.Assert.assertEquals(Assert.java:454)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.checkAgainstLegacy(TestUniformSizeInputFormat.java:244)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:126)
 at 
 org.apache.hadoop.tools.mapred.TestUniformSizeInputFormat.testGetSplits(TestUniformSizeInputFormat.java:252)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira