[jira] [Updated] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Siddharth Seth (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-2954:
--

Attachment: MR2954_2.patch

prime back to 31

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MAPREDUCE-2954-20110909.txt, MR2954_1.patch, 
> MR2954_2.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:360)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:355)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:113)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:619)
> "Thread-30":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doRemove(ConcurrentSkipListMap.java:1078)
> at 
> java.util.concurrent.ConcurrentSkipListMap.remove(ConcurrentSkipListMap.java:1673)
> at 
> java.util.concurrent.ConcurrentSkipListMap$Iter.remove(ConcurrentSkipListMap.java:2256)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.getNodeStatus(NodeStatusUpdaterImpl.java:223)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.access$300(NodeStatusUpdaterImpl.java:62)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl$1.run(NodeStatusUpdaterImpl.java:262)
> Found 1 deadlock.
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101036#comment-13101036
 ] 

Hadoop QA commented on MAPREDUCE-2954:
--

+1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12493735/MAPREDUCE-2954-20110909.txt
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 6 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-hs.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-shuffle.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-jobclient.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-core.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-resourcemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-nodemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-api.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/673//console

This message is automatically generated.

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MAPREDUCE-2954-20110909.txt, MR2954_1.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerM

[jira] [Commented] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Siddharth Seth (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101035#comment-13101035
 ] 

Siddharth Seth commented on MAPREDUCE-2954:
---

Looks ok - but am not sure about the large prime - will almost definitely cause 
the hashcode to wrap around the integer range which is likely not a problem. We 
could revert to the eclipse generated default of 31.

bq. We should be able to do better if we analyse more on our IDs, but this 
should work for now.
Completely agree with this though - clusterTimestamp is in ms, there's unlikely 
to be a very large number of attemptIds and container per app.

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MAPREDUCE-2954-20110909.txt, MR2954_1.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:360)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:355)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:113)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:619)
> "Thread-30":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doRemove(ConcurrentSkipListMap.java:1078)
> at 
> java.util.concurrent.ConcurrentSkipListMap.remove(ConcurrentSkipListMap.java:1673)
> at 
> java.util.concurrent.ConcurrentSkipListMap$Iter.remove(ConcurrentSkipListMap.java:2256)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.getNodeStatus(NodeStatusUpdaterImpl.java:223)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.access$300(NodeStatusUpdaterImpl.java:62)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl$1.run(NodeStatusUpdaterImpl.java:262)
> Found 1 deadlock.
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2965) Streamline hashCode(), equals(), compareTo() and toString() for all IDs

2011-09-08 Thread Vinod Kumar Vavilapalli (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101027#comment-13101027
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-2965:


Sid or I will take care of this.

> Streamline hashCode(), equals(), compareTo() and toString() for all IDs
> ---
>
> Key: MAPREDUCE-2965
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2965
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Vinod Kumar Vavilapalli
> Fix For: 0.23.0
>
>
> MAPREDUCE-2954 moved these methods to the record interfaces from the PB impls 
> for ContainerId, ApplicationId and ApplicationAttemptId. This is good as they 
> don't need to be tied to the implementation.
> We should do the same for all IDs. In fact some of these are missing for IDs 
> like MR AM JobId, TaskId etc.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-2965) Streamline hashCode(), equals(), compareTo() and toString() for all IDs

2011-09-08 Thread Vinod Kumar Vavilapalli (JIRA)
Streamline hashCode(), equals(), compareTo() and toString() for all IDs
---

 Key: MAPREDUCE-2965
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2965
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli
 Fix For: 0.23.0


MAPREDUCE-2954 moved these methods to the record interfaces from the PB impls 
for ContainerId, ApplicationId and ApplicationAttemptId. This is good as they 
don't need to be tied to the implementation.

We should do the same for all IDs. In fact some of these are missing for IDs 
like MR AM JobId, TaskId etc.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2957) The TT should not re-init if it has no good local dirs

2011-09-08 Thread Ravi Gummadi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101022#comment-13101022
 ] 

Ravi Gummadi commented on MAPREDUCE-2957:
-

The original implementation in MR-2413 already handles this case of 
all-disks-failed by throwing Exception from checkLocalDirs(). No ?

> The TT should not re-init if it has no good local dirs
> --
>
> Key: MAPREDUCE-2957
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2957
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: tasktracker
>Affects Versions: 0.20.204.0
>Reporter: Eli Collins
> Fix For: 0.20.205.0
>
>
> The TT will currently try to re-init itself on disk failure even if it has no 
> good local dirs. It should shutdown instead.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Vinod Kumar Vavilapalli (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-2954:
---

Hadoop Flags: [Reviewed]
  Status: Patch Available  (was: Open)

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MAPREDUCE-2954-20110909.txt, MR2954_1.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:360)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:355)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:113)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:619)
> "Thread-30":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doRemove(ConcurrentSkipListMap.java:1078)
> at 
> java.util.concurrent.ConcurrentSkipListMap.remove(ConcurrentSkipListMap.java:1673)
> at 
> java.util.concurrent.ConcurrentSkipListMap$Iter.remove(ConcurrentSkipListMap.java:2256)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.getNodeStatus(NodeStatusUpdaterImpl.java:223)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.access$300(NodeStatusUpdaterImpl.java:62)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl$1.run(NodeStatusUpdaterImpl.java:262)
> Found 1 deadlock.
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Vinod Kumar Vavilapalli (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-2954:
---

Attachment: MAPREDUCE-2954-20110909.txt

hashCode() methods generated by eclipse have better null checks etc. and also 
have double the product-sums.

Attaching patch using the eclipse-generated hashes. We should be able to do 
better if we analyse more on our IDs, but this should work for now.

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MAPREDUCE-2954-20110909.txt, MR2954_1.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:360)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:355)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:113)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:619)
> "Thread-30":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doRemove(ConcurrentSkipListMap.java:1078)
> at 
> java.util.concurrent.ConcurrentSkipListMap.remove(ConcurrentSkipListMap.java:1673)
> at 
> java.util.concurrent.ConcurrentSkipListMap$Iter.remove(ConcurrentSkipListMap.java:2256)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.getNodeStatus(NodeStatusUpdaterImpl.java:223)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.access$300(NodeStatusUpdaterImpl.java:62)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl$1.run(NodeStatusUpdaterImpl.java:262)
> Found 1 deadlock.
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2933) Change allocate call to return ContainerStatus for completed containers rather than Container

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2933:
-

Attachment: MAPREDUCE-2933.patch

Rebased.

> Change allocate call to return ContainerStatus for completed containers 
> rather than Container 
> --
>
> Key: MAPREDUCE-2933
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2933
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: applicationmaster, mrv2, nodemanager, resourcemanager
>Reporter: Arun C Murthy
>Assignee: Arun C Murthy
>Priority: Blocker
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2933.patch, MAPREDUCE-2933.patch, 
> MAPREDUCE-2933.patch
>
>
> Change allocate call to return ContainerStatus for completed containers 
> rather than Container, we should do this all the way from the NodeManager too.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2933) Change allocate call to return ContainerStatus for completed containers rather than Container

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2933:
-

Status: Patch Available  (was: Open)

> Change allocate call to return ContainerStatus for completed containers 
> rather than Container 
> --
>
> Key: MAPREDUCE-2933
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2933
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: applicationmaster, mrv2, nodemanager, resourcemanager
>Reporter: Arun C Murthy
>Assignee: Arun C Murthy
>Priority: Blocker
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2933.patch, MAPREDUCE-2933.patch, 
> MAPREDUCE-2933.patch
>
>
> Change allocate call to return ContainerStatus for completed containers 
> rather than Container, we should do this all the way from the NodeManager too.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Vinod Kumar Vavilapalli (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-2954:
---

Status: Open  (was: Patch Available)

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2954_1.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:360)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:355)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:113)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:619)
> "Thread-30":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doRemove(ConcurrentSkipListMap.java:1078)
> at 
> java.util.concurrent.ConcurrentSkipListMap.remove(ConcurrentSkipListMap.java:1673)
> at 
> java.util.concurrent.ConcurrentSkipListMap$Iter.remove(ConcurrentSkipListMap.java:2256)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.getNodeStatus(NodeStatusUpdaterImpl.java:223)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.access$300(NodeStatusUpdaterImpl.java:62)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl$1.run(NodeStatusUpdaterImpl.java:262)
> Found 1 deadlock.
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-2964) mapreduce trunk build fails with compile-mapred-test ant target

2011-09-08 Thread Giridharan Kesavan (JIRA)
mapreduce trunk build fails with compile-mapred-test ant target
---

 Key: MAPREDUCE-2964
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2964
 Project: Hadoop Map/Reduce
  Issue Type: Bug
 Environment: apache jenkins build env:
https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/869/console
Reporter: Giridharan Kesavan


{noformat}
compile-mapred-test:
[mkdir] Created dir: 
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-trunk-Commit/trunk/hadoop-mapreduce-project/build/test/mapred/classes

  [javac] found   : org.apache.hadoop.mapred.IFile.Writer
[javac] required: 
org.apache.hadoop.mapred.IFile.Writer
[javac] Writer mockWriter = mock(Writer.class);
[javac]  ^
[javac] 
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-trunk-Commit/trunk/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/mapred/TestCombineOutputCollector.java:128:
 warning: [unchecked] unchecked conversion
[javac] found   : org.apache.hadoop.mapred.IFile.Writer
[javac] required: 
org.apache.hadoop.mapred.IFile.Writer
[javac] Writer mockWriter = mock(Writer.class);
[javac]  ^
[javac] 
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-trunk-Commit/trunk/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/mapred/TestJvmManager.java:63:
 unreported exception java.io.IOException; must be caught or declared to be 
thrown
[javac] FileUtil.fullyDelete(TEST_DIR);
[javac] ^
[javac] Note: Some input files use or override a deprecated API.
[javac] Note: Recompile with -Xlint:deprecation for details.
[javac] 1 error
[javac] 2 warnings

BUILD FAILED
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-trunk-Commit/trunk/hadoop-mapreduce-project/build.xml:538:
 The following error occurred while executing this line:
/home/jenkins/jenkins-slave/workspace/Hadoop-Mapreduce-trunk-Commit/trunk/hadoop-mapreduce-project/build.xml:615:
 Compile failed; see the compiler error output for details.

{noformat}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2949) NodeManager in a inconsistent state if a service startup fails.

2011-09-08 Thread Arun C Murthy (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100997#comment-13100997
 ] 

Arun C Murthy commented on MAPREDUCE-2949:
--

Ravi, thanks for taking this up.

How does it relate to MAPREDUCE-2956?

> NodeManager in a inconsistent state if a service startup fails.
> ---
>
> Key: MAPREDUCE-2949
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2949
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2, nodemanager
>Affects Versions: 0.24.0
>Reporter: Ravi Teja Ch N V
>Assignee: Ravi Teja Ch N V
>
> When a service startup fails at the Nodemanager, the Nodemanager JVM doesnot 
> exit as the following threads are still running.
> Daemon Thread [Timer for 'NodeManager' metrics system] (Running)  
> Thread [pool-1-thread-1] (Running)
> Thread [Thread-11] (Running)  
> Thread [DestroyJavaVM] (Running).
> As a result, the NodeManager keeps running even though no services are 
> started.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2702) [MR-279] OutputCommitter changes for MR Application Master recovery

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2702:
-

Status: Open  (was: Patch Available)

Sharad, the approach looks reasonable I'm doing a detailed review. Can you pls 
rebase against trunk? Thanks.

> [MR-279] OutputCommitter changes for MR Application Master recovery
> ---
>
> Key: MAPREDUCE-2702
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2702
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: mrv2
>Reporter: Sharad Agarwal
>Assignee: Sharad Agarwal
> Attachments: mr2702_v1.patch, mr2702_v2.patch, mr2702_v3.patch
>
>
> In MR AM recovers from a crash, it only reruns the non completed tasks. The 
> completed tasks (along with their output, if any) needs to be recovered from 
> the previous life. This would require some changes in OutputCommitter.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2961) Increase the default threadpool size for container launching in the application master.

2011-09-08 Thread Arun C Murthy (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100996#comment-13100996
 ] 

Arun C Murthy commented on MAPREDUCE-2961:
--

Vinod, what heuristic do you have in mind?

> Increase the default threadpool size for container launching in the 
> application master.
> ---
>
> Key: MAPREDUCE-2961
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2961
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2961.patch
>
>
> Currently the default threadpool size is 10 for launching containers in 
> ContainerLauncherImpl. We should increase that to 100 for a reasonable 
> default, so that container launching is not backed up by a small thread pool 
> size.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Siddharth Seth (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100991#comment-13100991
 ] 

Siddharth Seth commented on MAPREDUCE-2954:
---

hashCode() wasn't generated by eclipse. they've taken elements from the MrV1 
JobID etc. ApplicationId specifically is something which may need more looking 
into (post RM re-factor and for JobHistory).
Agree with the bit about tying equals(), hashCode, compareTo.. and also 
toString() to all the records. ProtoBase is more of a convenience to provide 
this functionality in all records - and likely the reason for most of the races 
and sync in PBImpls (serializing the proto object for each equals, hashCode). 
This - along with some other PB performance related changes need to be made 
sometime later.


> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2954_1.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:360)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:355)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:113)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:619)
> "Thread-30":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doRemove(ConcurrentSkipListMap.java:1078)
> at 
> java.util.concurrent.ConcurrentSkipListMap.remove(ConcurrentSkipListMap.java:1673)
> at 
> java.util.concurrent.ConcurrentSkipListMap$Iter.remove(ConcurrentSkipListMap.java:2256)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.getNodeStatus(NodeStatusUpdaterImpl.java:223)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.access$300(NodeStatusUpdaterImpl.java:62)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl$1.run(NodeStatusUpdaterImpl.java:262)
> Found 1 deadlock.
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Tsz Wo (Nicholas), SZE (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tsz Wo (Nicholas), SZE updated MAPREDUCE-2711:
--

   Resolution: Fixed
Fix Version/s: 0.24.0
   0.23.0
 Hadoop Flags: [Reviewed]
   Status: Resolved  (was: Patch Available)

Thanks Arun for the review.

I have committed this.

> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Fix For: 0.23.0, 0.24.0
>
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch, m2711_20110908.patch
>
>
> {{TestBlockPlacementPolicyRaid}} access internal {{FSNamesystem}} directly.  
> It cannot be compiled after HDFS-2147.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Vinod Kumar Vavilapalli (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100986#comment-13100986
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-2954:


Patch looks good overall.

Assuming the {{hashCode()}} methods are generated(eclipse?) and good enough.

I also like the fact that now _equals()_, _hashCode()_, and _comparesTo()_ are 
tied to the records themselves instead of the PB implementations. We should do 
this for other records too, but another ticket.

+1 for the patch.

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2954_1.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:360)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:355)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:113)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:619)
> "Thread-30":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doRemove(ConcurrentSkipListMap.java:1078)
> at 
> java.util.concurrent.ConcurrentSkipListMap.remove(ConcurrentSkipListMap.java:1673)
> at 
> java.util.concurrent.ConcurrentSkipListMap$Iter.remove(ConcurrentSkipListMap.java:2256)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.getNodeStatus(NodeStatusUpdaterImpl.java:223)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.access$300(NodeStatusUpdaterImpl.java:62)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl$1.run(NodeStatusUpdaterImpl.java:262)
> Found 1 deadlock.
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100980#comment-13100980
 ] 

Hudson commented on MAPREDUCE-2711:
---

Integrated in Hadoop-Mapreduce-trunk-Commit #869 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/869/])
MAPREDUCE-2711. Update TestBlockPlacementPolicyRaid for the new namesystem 
and block management APIs.

szetszwo : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1167001
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/raid/src/test/org/apache/hadoop/hdfs/server/blockmanagement/TestBlockPlacementPolicyRaid.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/raid/src/test/org/apache/hadoop/hdfs/server/namenode/NameNodeRaidTestUtil.java


> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch, m2711_20110908.patch
>
>
> {{TestBlockPlacementPolicyRaid}} access internal {{FSNamesystem}} directly.  
> It cannot be compiled after HDFS-2147.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2656) Map Reduce Tasks are continously failing, when one among the several harddisks available on the TaskTracker fails.

2011-09-08 Thread Devaraj K (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100979#comment-13100979
 ] 

Devaraj K commented on MAPREDUCE-2656:
--

This is not related to MAPREDUCE-2647. That feature can be provided as 
contrib/tool and whoever interested can use it. We can have further discussions 
in that issue. 

This issue exists only in 0.20.2 version hence resolving it.


> Map Reduce Tasks are continously failing, when one among the several 
> harddisks available on the TaskTracker fails.
> --
>
> Key: MAPREDUCE-2656
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2656
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Affects Versions: 0.20.2, 0.20.3
>Reporter: Devaraj K
>Assignee: Devaraj K
> Fix For: 0.20.2, 0.20.3
>
> Attachments: HADOOP-7130.patch, MAPREDUCE-2656.patch
>
>
> 1. Pull out one hard disk from Task tracker node (out of 10 disks pull one). 
> Now it is noted that some jobs are failing. 
> However process is continued. 
> 2. Wait for sometime (15 mins) and pull out one disk from another Task 
> tracker. 
> 3. More number of jobs failed now and it can be seen from UI. Process is 
> getting paused.
> The exception can be seen in the job tracker UI for a failed job.
> {code:xml} 
> Error initializing attempt_201010221528_10174_m_11_0:
> java.io.IOException: Expecting a line not the end of stream
>  at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
>  at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
>  at org.apache.hadoop.util.Shell.run(Shell.java:137)
>  at org.apache.hadoop.fs.DF.getAvailable(DF.java:74)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:385)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:134)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:113)
>  at org.apache.hadoop.mapred.TaskTracker.localizeJob(TaskTracker.java:835)
>  at org.apache.hadoop.mapred.TaskTracker.startNewTask(TaskTracker.java:1790)
>  at org.apache.hadoop.mapred.TaskTracker.access$1200(TaskTracker.java:104)
>  at 
> org.apache.hadoop.mapred.TaskTracker$TaskLauncher.run(TaskTracker.java:1753)
> Error initializing attempt_201010221528_10174_m_11_1:
> org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any 
> valid local directory for taskTracker/jobcache/job_201010221528_10174/work
>  at 
> org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:454)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:134)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:113)
>  at org.apache.hadoop.mapred.TaskTracker.localizeJob(TaskTracker.java:835)
>  at org.apache.hadoop.mapred.TaskTracker.startNewTask(TaskTracker.java:1790)
>  at org.apache.hadoop.mapred.TaskTracker.access$1200(TaskTracker.java:104)
>  at 
> org.apache.hadoop.mapred.TaskTracker$TaskLauncher.run(TaskTracker.java:1753)
> {code} 
> Task Tracker log can be seen here :
> {code:xml} 
> 2010-10-25 16:36:24,215 ERROR mapred.TaskTracker 
> (TaskTracker.java:offerService(1211)) - Caught exception: 
> java.io.IOException: Expecting a line not the end of stream
> at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
> at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
> at org.apache.hadoop.util.Shell.run(Shell.java:137)
> at org.apache.hadoop.fs.DF.getAvailable(DF.java:74)
> at 
> org.apache.hadoop.mapred.TaskTracker.getFreeSpace(TaskTracker.java:1586)
> at 
> org.apache.hadoop.mapred.TaskTracker.transmitHeartBeat(TaskTracker.java:1274)
> at 
> org.apache.hadoop.mapred.TaskTracker.offerService(TaskTracker.java:1106)
> at org.apache.hadoop.mapred.TaskTracker.run(TaskTracker.java:1848)
> at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3022)
> 2010-10-25 16:36:24,216 INFO  mapred.TaskTracker (TaskTracker.java:run(1856)) 
> - Lost connection to JobTracker [/192.168.97.1:9001].  Retrying...
> java.lang.Exception: java.io.IOException: Expecting a line not the end of 
> stream
> at 
> org.apache.hadoop.mapred.TaskTracker.offerService(TaskTracker.java:1212)
> at org.apache.hadoop.mapred.TaskTracker.run(TaskTracker.java:1848)
> at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3022)
> Caused by: java.io.IOException: Expecting a line not the end of stream
> at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
> at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
> a

[jira] [Updated] (MAPREDUCE-2656) Map Reduce Tasks are continously failing, when one among the several harddisks available on the TaskTracker fails.

2011-09-08 Thread Devaraj K (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj K updated MAPREDUCE-2656:
-

Resolution: Won't Fix
Status: Resolved  (was: Patch Available)

> Map Reduce Tasks are continously failing, when one among the several 
> harddisks available on the TaskTracker fails.
> --
>
> Key: MAPREDUCE-2656
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2656
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Affects Versions: 0.20.2, 0.20.3
>Reporter: Devaraj K
>Assignee: Devaraj K
> Fix For: 0.20.2, 0.20.3
>
> Attachments: HADOOP-7130.patch, MAPREDUCE-2656.patch
>
>
> 1. Pull out one hard disk from Task tracker node (out of 10 disks pull one). 
> Now it is noted that some jobs are failing. 
> However process is continued. 
> 2. Wait for sometime (15 mins) and pull out one disk from another Task 
> tracker. 
> 3. More number of jobs failed now and it can be seen from UI. Process is 
> getting paused.
> The exception can be seen in the job tracker UI for a failed job.
> {code:xml} 
> Error initializing attempt_201010221528_10174_m_11_0:
> java.io.IOException: Expecting a line not the end of stream
>  at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
>  at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
>  at org.apache.hadoop.util.Shell.run(Shell.java:137)
>  at org.apache.hadoop.fs.DF.getAvailable(DF.java:74)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:385)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:134)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:113)
>  at org.apache.hadoop.mapred.TaskTracker.localizeJob(TaskTracker.java:835)
>  at org.apache.hadoop.mapred.TaskTracker.startNewTask(TaskTracker.java:1790)
>  at org.apache.hadoop.mapred.TaskTracker.access$1200(TaskTracker.java:104)
>  at 
> org.apache.hadoop.mapred.TaskTracker$TaskLauncher.run(TaskTracker.java:1753)
> Error initializing attempt_201010221528_10174_m_11_1:
> org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any 
> valid local directory for taskTracker/jobcache/job_201010221528_10174/work
>  at 
> org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:454)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:134)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:113)
>  at org.apache.hadoop.mapred.TaskTracker.localizeJob(TaskTracker.java:835)
>  at org.apache.hadoop.mapred.TaskTracker.startNewTask(TaskTracker.java:1790)
>  at org.apache.hadoop.mapred.TaskTracker.access$1200(TaskTracker.java:104)
>  at 
> org.apache.hadoop.mapred.TaskTracker$TaskLauncher.run(TaskTracker.java:1753)
> {code} 
> Task Tracker log can be seen here :
> {code:xml} 
> 2010-10-25 16:36:24,215 ERROR mapred.TaskTracker 
> (TaskTracker.java:offerService(1211)) - Caught exception: 
> java.io.IOException: Expecting a line not the end of stream
> at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
> at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
> at org.apache.hadoop.util.Shell.run(Shell.java:137)
> at org.apache.hadoop.fs.DF.getAvailable(DF.java:74)
> at 
> org.apache.hadoop.mapred.TaskTracker.getFreeSpace(TaskTracker.java:1586)
> at 
> org.apache.hadoop.mapred.TaskTracker.transmitHeartBeat(TaskTracker.java:1274)
> at 
> org.apache.hadoop.mapred.TaskTracker.offerService(TaskTracker.java:1106)
> at org.apache.hadoop.mapred.TaskTracker.run(TaskTracker.java:1848)
> at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3022)
> 2010-10-25 16:36:24,216 INFO  mapred.TaskTracker (TaskTracker.java:run(1856)) 
> - Lost connection to JobTracker [/192.168.97.1:9001].  Retrying...
> java.lang.Exception: java.io.IOException: Expecting a line not the end of 
> stream
> at 
> org.apache.hadoop.mapred.TaskTracker.offerService(TaskTracker.java:1212)
> at org.apache.hadoop.mapred.TaskTracker.run(TaskTracker.java:1848)
> at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3022)
> Caused by: java.io.IOException: Expecting a line not the end of stream
> at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
> at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
> at org.apache.hadoop.util.Shell.run(Shell.java:137)
> at org.apache.hadoop.fs.DF.getAvailable(DF.java:74)
> at 
> org.apache.hadoop.mapred.TaskTracker.getFreeSpace(TaskTracker.java:1586)
> at

[jira] [Commented] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100977#comment-13100977
 ] 

Hudson commented on MAPREDUCE-2711:
---

Integrated in Hadoop-Hdfs-trunk-Commit #935 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/935/])
MAPREDUCE-2711. Update TestBlockPlacementPolicyRaid for the new namesystem 
and block management APIs.

szetszwo : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1167001
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/raid/src/test/org/apache/hadoop/hdfs/server/blockmanagement/TestBlockPlacementPolicyRaid.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/raid/src/test/org/apache/hadoop/hdfs/server/namenode/NameNodeRaidTestUtil.java


> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch, m2711_20110908.patch
>
>
> {{TestBlockPlacementPolicyRaid}} access internal {{FSNamesystem}} directly.  
> It cannot be compiled after HDFS-2147.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100975#comment-13100975
 ] 

Hudson commented on MAPREDUCE-2711:
---

Integrated in Hadoop-Common-trunk-Commit #858 (See 
[https://builds.apache.org/job/Hadoop-Common-trunk-Commit/858/])
MAPREDUCE-2711. Update TestBlockPlacementPolicyRaid for the new namesystem 
and block management APIs.

szetszwo : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1167001
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/raid/src/test/org/apache/hadoop/hdfs/server/blockmanagement/TestBlockPlacementPolicyRaid.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/src/contrib/raid/src/test/org/apache/hadoop/hdfs/server/namenode/NameNodeRaidTestUtil.java


> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch, m2711_20110908.patch
>
>
> {{TestBlockPlacementPolicyRaid}} access internal {{FSNamesystem}} directly.  
> It cannot be compiled after HDFS-2147.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Assigned] (MAPREDUCE-2949) NodeManager in a inconsistent state if a service startup fails.

2011-09-08 Thread Ravi Teja Ch N V (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Teja Ch N V reassigned MAPREDUCE-2949:
---

Assignee: Ravi Teja Ch N V

> NodeManager in a inconsistent state if a service startup fails.
> ---
>
> Key: MAPREDUCE-2949
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2949
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2, nodemanager
>Affects Versions: 0.24.0
>Reporter: Ravi Teja Ch N V
>Assignee: Ravi Teja Ch N V
>
> When a service startup fails at the Nodemanager, the Nodemanager JVM doesnot 
> exit as the following threads are still running.
> Daemon Thread [Timer for 'NodeManager' metrics system] (Running)  
> Thread [pool-1-thread-1] (Running)
> Thread [Thread-11] (Running)  
> Thread [DestroyJavaVM] (Running).
> As a result, the NodeManager keeps running even though no services are 
> started.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2656) Map Reduce Tasks are continously failing, when one among the several harddisks available on the TaskTracker fails.

2011-09-08 Thread Arun C Murthy (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100961#comment-13100961
 ] 

Arun C Murthy commented on MAPREDUCE-2656:
--

Sorry, I meant to MAPREDUCE-2647 vis-a-vis 0.20.205.

My proposal is we drop this for 0.20.3 which is unlikely to be released now, 
afaics.

> Map Reduce Tasks are continously failing, when one among the several 
> harddisks available on the TaskTracker fails.
> --
>
> Key: MAPREDUCE-2656
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2656
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Affects Versions: 0.20.2, 0.20.3
>Reporter: Devaraj K
>Assignee: Devaraj K
> Fix For: 0.20.2, 0.20.3
>
> Attachments: HADOOP-7130.patch, MAPREDUCE-2656.patch
>
>
> 1. Pull out one hard disk from Task tracker node (out of 10 disks pull one). 
> Now it is noted that some jobs are failing. 
> However process is continued. 
> 2. Wait for sometime (15 mins) and pull out one disk from another Task 
> tracker. 
> 3. More number of jobs failed now and it can be seen from UI. Process is 
> getting paused.
> The exception can be seen in the job tracker UI for a failed job.
> {code:xml} 
> Error initializing attempt_201010221528_10174_m_11_0:
> java.io.IOException: Expecting a line not the end of stream
>  at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
>  at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
>  at org.apache.hadoop.util.Shell.run(Shell.java:137)
>  at org.apache.hadoop.fs.DF.getAvailable(DF.java:74)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:385)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:134)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:113)
>  at org.apache.hadoop.mapred.TaskTracker.localizeJob(TaskTracker.java:835)
>  at org.apache.hadoop.mapred.TaskTracker.startNewTask(TaskTracker.java:1790)
>  at org.apache.hadoop.mapred.TaskTracker.access$1200(TaskTracker.java:104)
>  at 
> org.apache.hadoop.mapred.TaskTracker$TaskLauncher.run(TaskTracker.java:1753)
> Error initializing attempt_201010221528_10174_m_11_1:
> org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any 
> valid local directory for taskTracker/jobcache/job_201010221528_10174/work
>  at 
> org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:454)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:134)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:113)
>  at org.apache.hadoop.mapred.TaskTracker.localizeJob(TaskTracker.java:835)
>  at org.apache.hadoop.mapred.TaskTracker.startNewTask(TaskTracker.java:1790)
>  at org.apache.hadoop.mapred.TaskTracker.access$1200(TaskTracker.java:104)
>  at 
> org.apache.hadoop.mapred.TaskTracker$TaskLauncher.run(TaskTracker.java:1753)
> {code} 
> Task Tracker log can be seen here :
> {code:xml} 
> 2010-10-25 16:36:24,215 ERROR mapred.TaskTracker 
> (TaskTracker.java:offerService(1211)) - Caught exception: 
> java.io.IOException: Expecting a line not the end of stream
> at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
> at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
> at org.apache.hadoop.util.Shell.run(Shell.java:137)
> at org.apache.hadoop.fs.DF.getAvailable(DF.java:74)
> at 
> org.apache.hadoop.mapred.TaskTracker.getFreeSpace(TaskTracker.java:1586)
> at 
> org.apache.hadoop.mapred.TaskTracker.transmitHeartBeat(TaskTracker.java:1274)
> at 
> org.apache.hadoop.mapred.TaskTracker.offerService(TaskTracker.java:1106)
> at org.apache.hadoop.mapred.TaskTracker.run(TaskTracker.java:1848)
> at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3022)
> 2010-10-25 16:36:24,216 INFO  mapred.TaskTracker (TaskTracker.java:run(1856)) 
> - Lost connection to JobTracker [/192.168.97.1:9001].  Retrying...
> java.lang.Exception: java.io.IOException: Expecting a line not the end of 
> stream
> at 
> org.apache.hadoop.mapred.TaskTracker.offerService(TaskTracker.java:1212)
> at org.apache.hadoop.mapred.TaskTracker.run(TaskTracker.java:1848)
> at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3022)
> Caused by: java.io.IOException: Expecting a line not the end of stream
> at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
> at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
> at org.apache.hadoop.util.Shell.run(Shell.java:137)
> at org.apache.hadoop.fs

[jira] [Commented] (MAPREDUCE-2961) Increase the default threadpool size for container launching in the application master.

2011-09-08 Thread Vinod Kumar Vavilapalli (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100963#comment-13100963
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-2961:


I definitely think that instead of a config, this should be a function of 
#nodes and #tasks with defaults to 10. Will provide a patch for the same.

> Increase the default threadpool size for container launching in the 
> application master.
> ---
>
> Key: MAPREDUCE-2961
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2961
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2961.patch
>
>
> Currently the default threadpool size is 10 for launching containers in 
> ContainerLauncherImpl. We should increase that to 100 for a reasonable 
> default, so that container launching is not backed up by a small thread pool 
> size.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2656) Map Reduce Tasks are continously failing, when one among the several harddisks available on the TaskTracker fails.

2011-09-08 Thread Devaraj K (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100959#comment-13100959
 ] 

Devaraj K commented on MAPREDUCE-2656:
--

@Eli : This issue is already rectified in branch-20-security. This patch is 
relevant to 20.2. If it is not useful further, we can close this issue.

@Arun: It doesn't relate to MAPREDUCE-2143. I am not getting why MAPREDUCE-2143 
has come into picture.

> Map Reduce Tasks are continously failing, when one among the several 
> harddisks available on the TaskTracker fails.
> --
>
> Key: MAPREDUCE-2656
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2656
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>Affects Versions: 0.20.2, 0.20.3
>Reporter: Devaraj K
>Assignee: Devaraj K
> Fix For: 0.20.2, 0.20.3
>
> Attachments: HADOOP-7130.patch, MAPREDUCE-2656.patch
>
>
> 1. Pull out one hard disk from Task tracker node (out of 10 disks pull one). 
> Now it is noted that some jobs are failing. 
> However process is continued. 
> 2. Wait for sometime (15 mins) and pull out one disk from another Task 
> tracker. 
> 3. More number of jobs failed now and it can be seen from UI. Process is 
> getting paused.
> The exception can be seen in the job tracker UI for a failed job.
> {code:xml} 
> Error initializing attempt_201010221528_10174_m_11_0:
> java.io.IOException: Expecting a line not the end of stream
>  at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
>  at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
>  at org.apache.hadoop.util.Shell.run(Shell.java:137)
>  at org.apache.hadoop.fs.DF.getAvailable(DF.java:74)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:385)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:134)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:113)
>  at org.apache.hadoop.mapred.TaskTracker.localizeJob(TaskTracker.java:835)
>  at org.apache.hadoop.mapred.TaskTracker.startNewTask(TaskTracker.java:1790)
>  at org.apache.hadoop.mapred.TaskTracker.access$1200(TaskTracker.java:104)
>  at 
> org.apache.hadoop.mapred.TaskTracker$TaskLauncher.run(TaskTracker.java:1753)
> Error initializing attempt_201010221528_10174_m_11_1:
> org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any 
> valid local directory for taskTracker/jobcache/job_201010221528_10174/work
>  at 
> org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:454)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:134)
>  at 
> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:113)
>  at org.apache.hadoop.mapred.TaskTracker.localizeJob(TaskTracker.java:835)
>  at org.apache.hadoop.mapred.TaskTracker.startNewTask(TaskTracker.java:1790)
>  at org.apache.hadoop.mapred.TaskTracker.access$1200(TaskTracker.java:104)
>  at 
> org.apache.hadoop.mapred.TaskTracker$TaskLauncher.run(TaskTracker.java:1753)
> {code} 
> Task Tracker log can be seen here :
> {code:xml} 
> 2010-10-25 16:36:24,215 ERROR mapred.TaskTracker 
> (TaskTracker.java:offerService(1211)) - Caught exception: 
> java.io.IOException: Expecting a line not the end of stream
> at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
> at org.apache.hadoop.util.Shell.runCommand(Shell.java:182)
> at org.apache.hadoop.util.Shell.run(Shell.java:137)
> at org.apache.hadoop.fs.DF.getAvailable(DF.java:74)
> at 
> org.apache.hadoop.mapred.TaskTracker.getFreeSpace(TaskTracker.java:1586)
> at 
> org.apache.hadoop.mapred.TaskTracker.transmitHeartBeat(TaskTracker.java:1274)
> at 
> org.apache.hadoop.mapred.TaskTracker.offerService(TaskTracker.java:1106)
> at org.apache.hadoop.mapred.TaskTracker.run(TaskTracker.java:1848)
> at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3022)
> 2010-10-25 16:36:24,216 INFO  mapred.TaskTracker (TaskTracker.java:run(1856)) 
> - Lost connection to JobTracker [/192.168.97.1:9001].  Retrying...
> java.lang.Exception: java.io.IOException: Expecting a line not the end of 
> stream
> at 
> org.apache.hadoop.mapred.TaskTracker.offerService(TaskTracker.java:1212)
> at org.apache.hadoop.mapred.TaskTracker.run(TaskTracker.java:1848)
> at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3022)
> Caused by: java.io.IOException: Expecting a line not the end of stream
> at org.apache.hadoop.fs.DF.parseExecResult(DF.java:110)
> at org.apache.hadoop.util.Shell.runCommand(Shell

[jira] [Commented] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Arun C Murthy (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100955#comment-13100955
 ] 

Arun C Murthy commented on MAPREDUCE-2711:
--

+1

> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch, m2711_20110908.patch
>
>
> {{TestBlockPlacementPolicyRaid}} access internal {{FSNamesystem}} directly.  
> It cannot be compiled after HDFS-2147.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2952) Application failure diagnostics are not consumed in a couple of cases

2011-09-08 Thread Vinod Kumar Vavilapalli (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100952#comment-13100952
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-2952:


bq. Vinod, you mean MAPREDUCE-2937.
Yes.

> Application failure diagnostics are not consumed in a couple of cases
> -
>
> Key: MAPREDUCE-2952
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2952
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2, resourcemanager
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Priority: Blocker
> Fix For: 0.23.0
>
>
> When Container crashes, the reason for failures isn't propagated because of a 
> bug in _RMAppAttemptImpl.AMContainerCrashedTransition_ which simply discards 
> the diagnostics of the container. Also RMAppAttemptImpl.diagnostics is never 
> consumed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100950#comment-13100950
 ] 

Hadoop QA commented on MAPREDUCE-2711:
--

+1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12493722/m2711_20110908.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 6 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-api.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-resourcemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-nodemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-core.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-hs.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-shuffle.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-jobclient.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/672//console

This message is automatically generated.

> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch, m2711_20110908.patch
>
>
> {{TestBlockPlacementPolicyRaid}} access internal {{FSNamesystem}} directly.  
> It cannot be compiled after HDFS-2147.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100949#comment-13100949
 ] 

Hadoop QA commented on MAPREDUCE-2711:
--

+1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12493722/m2711_20110908.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 6 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-hs.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-shuffle.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-jobclient.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-core.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-resourcemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-nodemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-api.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/671//console

This message is automatically generated.

> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch, m2711_20110908.patch
>
>
> {{TestBlockPlacementPolicyRaid}} access internal {{FSNamesystem}} directly.  
> It cannot be compiled after HDFS-2147.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2691) Finish up the cleanup of distributed cache file resources and related tests.

2011-09-08 Thread Vinod Kumar Vavilapalli (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100945#comment-13100945
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-2691:


bq. Draining events immediately - don't quite remember why I added the option 
for a delayed drain
The delayed drain was already there on trunk, just not used.

bq. Yep, deletion exists in MRv2 and works. For LRU, ref MR 2494, 2572.
Thanks, I'll file corresponding ticket for MRV2.

> Finish up the cleanup of distributed cache file resources and related tests.
> 
>
> Key: MAPREDUCE-2691
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2691
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Reporter: Amol Kekre
>Assignee: Siddharth Seth
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2691_1.patch, MR2691_2.patch, MR2691_3.patch
>
>
> Implement cleanup of distributed cache file resources

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Tsz Wo (Nicholas), SZE (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tsz Wo (Nicholas), SZE updated MAPREDUCE-2711:
--

Description: {{TestBlockPlacementPolicyRaid}} access internal 
{{FSNamesystem}} directly.  It cannot be compiled after HDFS-2147.
Environment: (was: {{TestBlockPlacementPolicyRaid}} access internal 
{{FSNamesystem}} directly.  It cannot be compiled after HDFS-2147.)

> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch, m2711_20110908.patch
>
>
> {{TestBlockPlacementPolicyRaid}} access internal {{FSNamesystem}} directly.  
> It cannot be compiled after HDFS-2147.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Tsz Wo (Nicholas), SZE (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tsz Wo (Nicholas), SZE updated MAPREDUCE-2711:
--

Status: Patch Available  (was: In Progress)

> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
> Environment: {{TestBlockPlacementPolicyRaid}} access internal 
> {{FSNamesystem}} directly.  It cannot be compiled after HDFS-2147.
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch, m2711_20110908.patch
>
>


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Tsz Wo (Nicholas), SZE (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tsz Wo (Nicholas), SZE updated MAPREDUCE-2711:
--

Attachment: m2711_20110908.patch

m2711_20110908.patch: updated with trunk

> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
> Environment: {{TestBlockPlacementPolicyRaid}} access internal 
> {{FSNamesystem}} directly.  It cannot be compiled after HDFS-2147.
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch, m2711_20110908.patch
>
>


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2952) Application failure diagnostics are not consumed in a couple of cases

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2952:
-

Priority: Blocker  (was: Major)

> Application failure diagnostics are not consumed in a couple of cases
> -
>
> Key: MAPREDUCE-2952
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2952
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2, resourcemanager
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Priority: Blocker
> Fix For: 0.23.0
>
>
> When Container crashes, the reason for failures isn't propagated because of a 
> bug in _RMAppAttemptImpl.AMContainerCrashedTransition_ which simply discards 
> the diagnostics of the container. Also RMAppAttemptImpl.diagnostics is never 
> consumed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2675) MR-279: JobHistory Server main page needs to be reformatted

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2675:
-

Status: Open  (was: Patch Available)

Cancelling patch while Sid's feedback is incorporated.

> MR-279: JobHistory Server main page needs to be reformatted
> ---
>
> Key: MAPREDUCE-2675
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2675
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Robert Joseph Evans
>Assignee: Robert Joseph Evans
> Fix For: 0.23.0
>
> Attachments: MR-2675-V1.patch, MR-2675-V2.patch, MR-2675-V3.patch, 
> MR-2675-V4.patch, MR-2675-preview.patch
>
>
> The main page of the Job History Server is based off of the Application 
> Master code.  It needs to be reformatted to be more useful and better match 
> what was there before.
> - The Active Jobs title needs to be replaced with something more appropriate 
> (i.e. Retired Jobs)
> - The table of jobs should have the following columns in it
>   - Submit time, Job Id, Job Name, User and just because I think it would be 
> useful state, maps completed, maps failed, reduces completed, reduces failed
> - The table needs more advanced filtering, something like 
> http://datatables.net/release-datatables/examples/api/multi_filter.html This 
> is to match the previous search functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2961) Increase the default threadpool size for container launching in the application master.

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2961:
-

Status: Open  (was: Patch Available)

Mahadev, you need to rebase after MAPREDUCE-2864. Thanks.

> Increase the default threadpool size for container launching in the 
> application master.
> ---
>
> Key: MAPREDUCE-2961
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2961
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2961.patch
>
>
> Currently the default threadpool size is 10 for launching containers in 
> ContainerLauncherImpl. We should increase that to 100 for a reasonable 
> default, so that container launching is not backed up by a small thread pool 
> size.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2691) Finish up the cleanup of distributed cache file resources and related tests.

2011-09-08 Thread Siddharth Seth (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100916#comment-13100916
 ] 

Siddharth Seth commented on MAPREDUCE-2691:
---

Thanks for the detailed review Vinod.

ContainerImpl.handle and ContainerManager.stopContainer()diagnostic - will 
revert the changes. 
Vaguely remember seing diagnostic messages stacking up - where a single 
container would end up with multiple copies of messages. Will create a jira for 
that when I see it again.

bq. RELEASE_CONTAINER_RESOURCES is always sent along with 
CLEANUP_CONTAINER_RESOURCES event. I think we should just merge these into 
CLEANUP_CONTAINER_RESOURCES event itself. This will also be inline with the 
fact that creation of container-dirs and the localization of files both happen 
as part of a single event INIT_CONTAINER_RESOURCES, so cleanup should also be a 
single event. We can send a Map> as the event payload. To be symmetric, we 
should probably also merge the multiple INIT_CONTAINER_RESOURCES calls one for 
each LocalResourceVisibility to be a single event. Thoughts?
Sounds good. Will make the changes. Had added separate events for 
RELEASE_CONTAINER_RESOURCES to be consistent with the way resources were 
requested - 1 event for each type. Don't really see a reason for the requests 
to be sent separately though.

Will make the changes in the test cases. 
Draining events immediately - don't quite remember why I added the option for a 
delayed drain - possibly to be able to drain events 1 at a time sometime later. 
Anyway, it can be added back if required.
Completely agree about the mocks - there's way too much and makes some of the 
tests hard to understand. Will try getting rid of some of them.

bq. There is existing code for purging of cache under disk pressure - See 
ResourceLocalization.CacheCleanup and ResourceRetentionSet. (We need tests for 
this though, will file a ticket) This only deletes files that aren't in use at 
all. By LRU, do you mean selective deletion of these files based on their 
usage? Can you please point me to the relevant MRV1 JIRA? Thanks!
Yep, deletion exists in MRv2 and works. For LRU, ref MR 2494, 2572.

> Finish up the cleanup of distributed cache file resources and related tests.
> 
>
> Key: MAPREDUCE-2691
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2691
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Reporter: Amol Kekre
>Assignee: Siddharth Seth
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2691_1.patch, MR2691_2.patch, MR2691_3.patch
>
>
> Implement cleanup of distributed cache file resources

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2963) TestMRJobs hangs waiting to connect to history server.

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100907#comment-13100907
 ] 

Hudson commented on MAPREDUCE-2963:
---

Integrated in Hadoop-Mapreduce-trunk-Commit #868 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/868/])
MAPREDUCE-2963. Fix hang in TestMRJobs. Contributed by Siddharth Seth.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166975
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/main/java/org/apache/hadoop/mapred/ClientServiceDelegate.java


> TestMRJobs hangs waiting to connect to history server.
> --
>
> Key: MAPREDUCE-2963
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2963
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Mahadev konar
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0
>
> Attachments: MR2963_v1.patch
>
>
> TestMRJobs is hanging waiting to connect to history server. I will post the 
> logs next.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-279) Map-Reduce 2.0

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100906#comment-13100906
 ] 

Hudson commented on MAPREDUCE-279:
--

Integrated in Hadoop-Mapreduce-trunk-Commit #868 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/868/])
Adding back 
hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources which 
was missed during the merge of MAPREDUCE-279.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166972
Files : 
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources/META-INF/services/org.apache.hadoop.mapreduce.protocol.ClientProtocolProvider


> Map-Reduce 2.0
> --
>
> Key: MAPREDUCE-279
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-279
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Reporter: Arun C Murthy
> Fix For: 0.23.0
>
> Attachments: MR-279-script-20110817.sh, MR-279-script-final.sh, 
> MR-279-script.sh, MR-279-script.sh, MR-279.patch, MR-279.patch, MR-279.sh, 
> MR-279_MR_files_to_move-20110817.txt, MR-279_MR_files_to_move.txt, 
> MR-279_MR_files_to_move.txt, MapReduce_NextGen_Architecture.pdf, 
> NodeManager.gv, NodeManager.png, ResourceManager.gv, ResourceManager.png, 
> capacity-scheduler-dark-theme.png, hadoop_contributors_meet_07_01_2011.pdf, 
> multi-column-stable-sort-default-theme.png, post-move-patch-20110817.2.txt, 
> post-move-patch-final.txt, post-move.patch, post-move.patch, post-move.patch, 
> yarn-state-machine.job.dot, yarn-state-machine.job.png, 
> yarn-state-machine.task-attempt.dot, yarn-state-machine.task-attempt.png, 
> yarn-state-machine.task.dot, yarn-state-machine.task.png
>
>
> Re-factor MapReduce into a generic resource scheduler and a per-job, 
> user-defined component that manages the application execution.
> Check it out by following [the instructions|http://goo.gl/rSJJC].

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2963) TestMRJobs hangs waiting to connect to history server.

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100905#comment-13100905
 ] 

Hudson commented on MAPREDUCE-2963:
---

Integrated in Hadoop-Hdfs-trunk-Commit #934 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/934/])
MAPREDUCE-2963. Fix hang in TestMRJobs. Contributed by Siddharth Seth.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166975
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/main/java/org/apache/hadoop/mapred/ClientServiceDelegate.java


> TestMRJobs hangs waiting to connect to history server.
> --
>
> Key: MAPREDUCE-2963
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2963
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Mahadev konar
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0
>
> Attachments: MR2963_v1.patch
>
>
> TestMRJobs is hanging waiting to connect to history server. I will post the 
> logs next.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-279) Map-Reduce 2.0

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100904#comment-13100904
 ] 

Hudson commented on MAPREDUCE-279:
--

Integrated in Hadoop-Hdfs-trunk-Commit #934 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/934/])
Adding back 
hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources which 
was missed during the merge of MAPREDUCE-279.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166972
Files : 
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources/META-INF/services/org.apache.hadoop.mapreduce.protocol.ClientProtocolProvider


> Map-Reduce 2.0
> --
>
> Key: MAPREDUCE-279
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-279
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Reporter: Arun C Murthy
> Fix For: 0.23.0
>
> Attachments: MR-279-script-20110817.sh, MR-279-script-final.sh, 
> MR-279-script.sh, MR-279-script.sh, MR-279.patch, MR-279.patch, MR-279.sh, 
> MR-279_MR_files_to_move-20110817.txt, MR-279_MR_files_to_move.txt, 
> MR-279_MR_files_to_move.txt, MapReduce_NextGen_Architecture.pdf, 
> NodeManager.gv, NodeManager.png, ResourceManager.gv, ResourceManager.png, 
> capacity-scheduler-dark-theme.png, hadoop_contributors_meet_07_01_2011.pdf, 
> multi-column-stable-sort-default-theme.png, post-move-patch-20110817.2.txt, 
> post-move-patch-final.txt, post-move.patch, post-move.patch, post-move.patch, 
> yarn-state-machine.job.dot, yarn-state-machine.job.png, 
> yarn-state-machine.task-attempt.dot, yarn-state-machine.task-attempt.png, 
> yarn-state-machine.task.dot, yarn-state-machine.task.png
>
>
> Re-factor MapReduce into a generic resource scheduler and a per-job, 
> user-defined component that manages the application execution.
> Check it out by following [the instructions|http://goo.gl/rSJJC].

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-279) Map-Reduce 2.0

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100902#comment-13100902
 ] 

Hudson commented on MAPREDUCE-279:
--

Integrated in Hadoop-Common-trunk-Commit #857 (See 
[https://builds.apache.org/job/Hadoop-Common-trunk-Commit/857/])
Adding back 
hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources which 
was missed during the merge of MAPREDUCE-279.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166972
Files : 
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources/META-INF/services/org.apache.hadoop.mapreduce.protocol.ClientProtocolProvider


> Map-Reduce 2.0
> --
>
> Key: MAPREDUCE-279
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-279
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Reporter: Arun C Murthy
> Fix For: 0.23.0
>
> Attachments: MR-279-script-20110817.sh, MR-279-script-final.sh, 
> MR-279-script.sh, MR-279-script.sh, MR-279.patch, MR-279.patch, MR-279.sh, 
> MR-279_MR_files_to_move-20110817.txt, MR-279_MR_files_to_move.txt, 
> MR-279_MR_files_to_move.txt, MapReduce_NextGen_Architecture.pdf, 
> NodeManager.gv, NodeManager.png, ResourceManager.gv, ResourceManager.png, 
> capacity-scheduler-dark-theme.png, hadoop_contributors_meet_07_01_2011.pdf, 
> multi-column-stable-sort-default-theme.png, post-move-patch-20110817.2.txt, 
> post-move-patch-final.txt, post-move.patch, post-move.patch, post-move.patch, 
> yarn-state-machine.job.dot, yarn-state-machine.job.png, 
> yarn-state-machine.task-attempt.dot, yarn-state-machine.task-attempt.png, 
> yarn-state-machine.task.dot, yarn-state-machine.task.png
>
>
> Re-factor MapReduce into a generic resource scheduler and a per-job, 
> user-defined component that manages the application execution.
> Check it out by following [the instructions|http://goo.gl/rSJJC].

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2963) TestMRJobs hangs waiting to connect to history server.

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100903#comment-13100903
 ] 

Hudson commented on MAPREDUCE-2963:
---

Integrated in Hadoop-Common-trunk-Commit #857 (See 
[https://builds.apache.org/job/Hadoop-Common-trunk-Commit/857/])
MAPREDUCE-2963. Fix hang in TestMRJobs. Contributed by Siddharth Seth.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166975
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/main/java/org/apache/hadoop/mapred/ClientServiceDelegate.java


> TestMRJobs hangs waiting to connect to history server.
> --
>
> Key: MAPREDUCE-2963
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2963
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Mahadev konar
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0
>
> Attachments: MR2963_v1.patch
>
>
> TestMRJobs is hanging waiting to connect to history server. I will post the 
> logs next.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2963) TestMRJobs hangs waiting to connect to history server.

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2963:
-

   Resolution: Fixed
Fix Version/s: (was: 0.24.0)
   Status: Resolved  (was: Patch Available)

I just committed this. Thanks Sid!

> TestMRJobs hangs waiting to connect to history server.
> --
>
> Key: MAPREDUCE-2963
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2963
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Mahadev konar
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0
>
> Attachments: MR2963_v1.patch
>
>
> TestMRJobs is hanging waiting to connect to history server. I will post the 
> logs next.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2963) TestMRJobs hangs waiting to connect to history server.

2011-09-08 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100899#comment-13100899
 ] 

Hadoop QA commented on MAPREDUCE-2963:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12493719/MR2963_v1.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-hs.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-shuffle.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-jobclient.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-core.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-resourcemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-nodemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-api.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/670//console

This message is automatically generated.

> TestMRJobs hangs waiting to connect to history server.
> --
>
> Key: MAPREDUCE-2963
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2963
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Mahadev konar
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2963_v1.patch
>
>
> TestMRJobs is hanging waiting to connect to history server. I will post the 
> logs next.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (MAPREDUCE-2711) TestBlockPlacementPolicyRaid cannot be compiled

2011-09-08 Thread Tsz Wo (Nicholas), SZE (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on MAPREDUCE-2711 started by Tsz Wo (Nicholas), SZE.

> TestBlockPlacementPolicyRaid cannot be compiled
> ---
>
> Key: MAPREDUCE-2711
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2711
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: contrib/raid
> Environment: {{TestBlockPlacementPolicyRaid}} access internal 
> {{FSNamesystem}} directly.  It cannot be compiled after HDFS-2147.
>Reporter: Tsz Wo (Nicholas), SZE
>Assignee: Tsz Wo (Nicholas), SZE
> Attachments: m2711_20110719_TestBlockPlacementPolicyRaid.java, 
> m2711_20110727.patch, m2711_20110818.patch
>
>


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2952) Application failure diagnostics are not consumed in a couple of cases

2011-09-08 Thread Arun C Murthy (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100896#comment-13100896
 ] 

Arun C Murthy commented on MAPREDUCE-2952:
--

Vinod, you mean MAPREDUCE-2937.

> Application failure diagnostics are not consumed in a couple of cases
> -
>
> Key: MAPREDUCE-2952
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2952
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2, resourcemanager
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
> Fix For: 0.23.0
>
>
> When Container crashes, the reason for failures isn't propagated because of a 
> bug in _RMAppAttemptImpl.AMContainerCrashedTransition_ which simply discards 
> the diagnostics of the container. Also RMAppAttemptImpl.diagnostics is never 
> consumed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Resolved] (MAPREDUCE-2958) mapred-default.xml not merged from mr279

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy resolved MAPREDUCE-2958.
--

Resolution: Fixed
  Assignee: Arun C Murthy  (was: Mahadev konar)

Thanks Thomas, I fixed it by doing an 'svn copy' to both trunk and branch-0.23.

> mapred-default.xml not merged from mr279
> 
>
> Key: MAPREDUCE-2958
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2958
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Thomas Graves
>Assignee: Arun C Murthy
>Priority: Critical
> Fix For: 0.23.0
>
>
> I have been running wordcount out of the 23 examples jar.  It says it 
> succeeds but doesn't actually output a file.
> hadoop jar 
> examples/hadoop-mapreduce-0.23.0-SNAPSHOT/hadoop-mapreduce-examples-0.23.0-SNAPSHOT.jar
>  wordcount input output2
> input file is really basic:
> fdksajl
> dlkfsajlfljda;j
> kldfsjallj
> test
> one
> two
> test

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2963) TestMRJobs hangs waiting to connect to history server.

2011-09-08 Thread Siddharth Seth (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-2963:
--

Attachment: MR2963_v1.patch

Trivial patch - getTaskReports wasn't setting request params. Haven't included 
any tests.

> TestMRJobs hangs waiting to connect to history server.
> --
>
> Key: MAPREDUCE-2963
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2963
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Mahadev konar
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2963_v1.patch
>
>
> TestMRJobs is hanging waiting to connect to history server. I will post the 
> logs next.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2963) TestMRJobs hangs waiting to connect to history server.

2011-09-08 Thread Siddharth Seth (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-2963:
--

Fix Version/s: 0.24.0
Affects Version/s: 0.24.0
   Status: Patch Available  (was: Open)

> TestMRJobs hangs waiting to connect to history server.
> --
>
> Key: MAPREDUCE-2963
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2963
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Mahadev konar
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2963_v1.patch
>
>
> TestMRJobs is hanging waiting to connect to history server. I will post the 
> logs next.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Assigned] (MAPREDUCE-2963) TestMRJobs hangs waiting to connect to history server.

2011-09-08 Thread Siddharth Seth (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth reassigned MAPREDUCE-2963:
-

Assignee: Siddharth Seth

> TestMRJobs hangs waiting to connect to history server.
> --
>
> Key: MAPREDUCE-2963
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2963
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0
>
>
> TestMRJobs is hanging waiting to connect to history server. I will post the 
> logs next.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2958) mapred-default.xml not merged from mr279

2011-09-08 Thread Thomas Graves (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Graves updated MAPREDUCE-2958:
-

Assignee: Mahadev konar  (was: Thomas Graves)
 Summary: mapred-default.xml not merged from mr279  (was: wordcount 
succeeds but doesn't output file)

> mapred-default.xml not merged from mr279
> 
>
> Key: MAPREDUCE-2958
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2958
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Thomas Graves
>Assignee: Mahadev konar
>Priority: Critical
> Fix For: 0.23.0
>
>
> I have been running wordcount out of the 23 examples jar.  It says it 
> succeeds but doesn't actually output a file.
> hadoop jar 
> examples/hadoop-mapreduce-0.23.0-SNAPSHOT/hadoop-mapreduce-examples-0.23.0-SNAPSHOT.jar
>  wordcount input output2
> input file is really basic:
> fdksajl
> dlkfsajlfljda;j
> kldfsjallj
> test
> one
> two
> test

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2958) wordcount succeeds but doesn't output file

2011-09-08 Thread Thomas Graves (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100889#comment-13100889
 ] 

Thomas Graves commented on MAPREDUCE-2958:
--

it looks like the ./mr-client/hadoop-mapreduce-client-core/src/main/resources/ 
directory never got merged to trunk from mr-279 branch. That directory included 
the mapred-default.xml that included the default reducers. 

Assigned to Mahadev to investigate if it was intentional or accidentally missed 
during the merge.

> wordcount succeeds but doesn't output file
> --
>
> Key: MAPREDUCE-2958
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2958
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Thomas Graves
>Assignee: Thomas Graves
>Priority: Critical
> Fix For: 0.23.0
>
>
> I have been running wordcount out of the 23 examples jar.  It says it 
> succeeds but doesn't actually output a file.
> hadoop jar 
> examples/hadoop-mapreduce-0.23.0-SNAPSHOT/hadoop-mapreduce-examples-0.23.0-SNAPSHOT.jar
>  wordcount input output2
> input file is really basic:
> fdksajl
> dlkfsajlfljda;j
> kldfsjallj
> test
> one
> two
> test

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2953) JobClient fails due to a race in RM, removes staged files and in turn crashes MR AM

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100883#comment-13100883
 ] 

Hudson commented on MAPREDUCE-2953:
---

Integrated in Hadoop-Mapreduce-trunk-Commit #867 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/867/])
MAPREDUCE-2953. Fix a race condition on submission which caused client to 
incorrectly assume application was gone by making submission synchronous for 
RMAppManager. Contributed by Thomas Graves.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166968
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/ClientRMService.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/RMAppManager.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/ResourceManager.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/MockRM.java


> JobClient fails due to a race in RM, removes staged files and in turn crashes 
> MR AM
> ---
>
> Key: MAPREDUCE-2953
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2953
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2, resourcemanager
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Thomas Graves
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2953-v2.patch, MAPREDUCE-2953-v3.patch, 
> MAPREDUCE-2953.patch
>
>
> [~Karams] ran into this multiple times. MR JobClient crashes immediately.
> {code}
> 11/09/08 10:52:35 INFO mapreduce.JobSubmitter: number of splits:2094
> 11/09/08 10:52:36 INFO mapred.YARNRunner: AppMaster capability = memory: 2048,
> 11/09/08 10:52:36 INFO mapred.YARNRunner: Command to launch container for 
> ApplicationMaster is : $JAVA_HOME/bin/java -Dhadoop.root.logger=INFO,console 
> -Xmx1536m org.apache.hadoop.mapreduce.v2.app.MRAppMaster 1315478927026 1 
>  1>/stdout 2>/stderr
> 11/09/08 10:52:36 INFO mapred.ResourceMgrDelegate: Submitted application 
> application_1315478927026_1 to ResourceManager
> 11/09/08 10:52:36 INFO mapreduce.JobSubmitter: Cleaning up the staging area 
> /user/gridperf/.staging/job_1315478927026_0001
> RemoteTrace:
>  at Local Trace:
> org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl: 
> failed to run job
> at 
> org.apache.hadoop.yarn.factories.impl.pb.YarnRemoteExceptionFactoryPBImpl.createYarnRemoteException(YarnRemoteExceptionFactoryPBImpl.java:39)
> at 
> org.apache.hadoop.yarn.ipc.RPCUtil.getRemoteException(RPCUtil.java:47)
> at org.apache.hadoop.mapred.YARNRunner.submitJob(YARNRunner.java:250)
> at 
> org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:377)
> at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1072)
> at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1069)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1135)
> at org.apache.hadoop.mapreduce.Job.submit(Job.java:1069)
> at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:1089)
> at org.apache.hadoop.examples.RandomWriter.run(RandomWriter.java:283)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69)
> at org.apache.hadoop.examples.RandomWriter.main(RandomWriter.java:294)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:597)
> at 
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
> at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:144)
> at 
> org.apache.hadoop.examples.ExampleDriver.main(ExampleDriver.java:68)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at 
> sun.reflect.D

[jira] [Commented] (MAPREDUCE-2937) Errors in Application failures are not shown in the client trace.

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100884#comment-13100884
 ] 

Hudson commented on MAPREDUCE-2937:
---

Integrated in Hadoop-Mapreduce-trunk-Commit #867 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/867/])
MAPREDUCE-2937. Ensure reason for application failure is displayed to the 
user. Contributed by Mahadev Konar.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166966
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/main/java/org/apache/hadoop/mapred/YARNRunner.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapreduce/v2/TestYARNRunner.java


> Errors in Application failures are not shown in the client trace.
> -
>
> Key: MAPREDUCE-2937
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2937
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
>Priority: Critical
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2937.patch, MAPREDUCE-2937.patch, 
> MAPREDUCE-2937.patch
>
>
> The client side does not show enough information on why the job failed. Here 
> is step to reproduce it:
> 1) set the scheduler to be capacity scheduler with queues a, b
> 2) submit a job to a queue that is not a,b
> The job just fails without saying why it failed. We should have enough trace 
> log at the client side to let the user know why it failed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2953) JobClient fails due to a race in RM, removes staged files and in turn crashes MR AM

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100879#comment-13100879
 ] 

Hudson commented on MAPREDUCE-2953:
---

Integrated in Hadoop-Hdfs-trunk-Commit #933 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/933/])
MAPREDUCE-2953. Fix a race condition on submission which caused client to 
incorrectly assume application was gone by making submission synchronous for 
RMAppManager. Contributed by Thomas Graves.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166968
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/ClientRMService.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/RMAppManager.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/ResourceManager.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/MockRM.java


> JobClient fails due to a race in RM, removes staged files and in turn crashes 
> MR AM
> ---
>
> Key: MAPREDUCE-2953
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2953
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2, resourcemanager
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Thomas Graves
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2953-v2.patch, MAPREDUCE-2953-v3.patch, 
> MAPREDUCE-2953.patch
>
>
> [~Karams] ran into this multiple times. MR JobClient crashes immediately.
> {code}
> 11/09/08 10:52:35 INFO mapreduce.JobSubmitter: number of splits:2094
> 11/09/08 10:52:36 INFO mapred.YARNRunner: AppMaster capability = memory: 2048,
> 11/09/08 10:52:36 INFO mapred.YARNRunner: Command to launch container for 
> ApplicationMaster is : $JAVA_HOME/bin/java -Dhadoop.root.logger=INFO,console 
> -Xmx1536m org.apache.hadoop.mapreduce.v2.app.MRAppMaster 1315478927026 1 
>  1>/stdout 2>/stderr
> 11/09/08 10:52:36 INFO mapred.ResourceMgrDelegate: Submitted application 
> application_1315478927026_1 to ResourceManager
> 11/09/08 10:52:36 INFO mapreduce.JobSubmitter: Cleaning up the staging area 
> /user/gridperf/.staging/job_1315478927026_0001
> RemoteTrace:
>  at Local Trace:
> org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl: 
> failed to run job
> at 
> org.apache.hadoop.yarn.factories.impl.pb.YarnRemoteExceptionFactoryPBImpl.createYarnRemoteException(YarnRemoteExceptionFactoryPBImpl.java:39)
> at 
> org.apache.hadoop.yarn.ipc.RPCUtil.getRemoteException(RPCUtil.java:47)
> at org.apache.hadoop.mapred.YARNRunner.submitJob(YARNRunner.java:250)
> at 
> org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:377)
> at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1072)
> at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1069)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1135)
> at org.apache.hadoop.mapreduce.Job.submit(Job.java:1069)
> at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:1089)
> at org.apache.hadoop.examples.RandomWriter.run(RandomWriter.java:283)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69)
> at org.apache.hadoop.examples.RandomWriter.main(RandomWriter.java:294)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:597)
> at 
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
> at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:144)
> at 
> org.apache.hadoop.examples.ExampleDriver.main(ExampleDriver.java:68)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at 
> sun.reflect.DelegatingM

[jira] [Commented] (MAPREDUCE-2937) Errors in Application failures are not shown in the client trace.

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100880#comment-13100880
 ] 

Hudson commented on MAPREDUCE-2937:
---

Integrated in Hadoop-Hdfs-trunk-Commit #933 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/933/])
MAPREDUCE-2937. Ensure reason for application failure is displayed to the 
user. Contributed by Mahadev Konar.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166966
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/main/java/org/apache/hadoop/mapred/YARNRunner.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapreduce/v2/TestYARNRunner.java


> Errors in Application failures are not shown in the client trace.
> -
>
> Key: MAPREDUCE-2937
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2937
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
>Priority: Critical
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2937.patch, MAPREDUCE-2937.patch, 
> MAPREDUCE-2937.patch
>
>
> The client side does not show enough information on why the job failed. Here 
> is step to reproduce it:
> 1) set the scheduler to be capacity scheduler with queues a, b
> 2) submit a job to a queue that is not a,b
> The job just fails without saying why it failed. We should have enough trace 
> log at the client side to let the user know why it failed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2953) JobClient fails due to a race in RM, removes staged files and in turn crashes MR AM

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100876#comment-13100876
 ] 

Hudson commented on MAPREDUCE-2953:
---

Integrated in Hadoop-Common-trunk-Commit #856 (See 
[https://builds.apache.org/job/Hadoop-Common-trunk-Commit/856/])
MAPREDUCE-2953. Fix a race condition on submission which caused client to 
incorrectly assume application was gone by making submission synchronous for 
RMAppManager. Contributed by Thomas Graves.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166968
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/ClientRMService.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/RMAppManager.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/ResourceManager.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/MockRM.java


> JobClient fails due to a race in RM, removes staged files and in turn crashes 
> MR AM
> ---
>
> Key: MAPREDUCE-2953
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2953
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2, resourcemanager
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Thomas Graves
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2953-v2.patch, MAPREDUCE-2953-v3.patch, 
> MAPREDUCE-2953.patch
>
>
> [~Karams] ran into this multiple times. MR JobClient crashes immediately.
> {code}
> 11/09/08 10:52:35 INFO mapreduce.JobSubmitter: number of splits:2094
> 11/09/08 10:52:36 INFO mapred.YARNRunner: AppMaster capability = memory: 2048,
> 11/09/08 10:52:36 INFO mapred.YARNRunner: Command to launch container for 
> ApplicationMaster is : $JAVA_HOME/bin/java -Dhadoop.root.logger=INFO,console 
> -Xmx1536m org.apache.hadoop.mapreduce.v2.app.MRAppMaster 1315478927026 1 
>  1>/stdout 2>/stderr
> 11/09/08 10:52:36 INFO mapred.ResourceMgrDelegate: Submitted application 
> application_1315478927026_1 to ResourceManager
> 11/09/08 10:52:36 INFO mapreduce.JobSubmitter: Cleaning up the staging area 
> /user/gridperf/.staging/job_1315478927026_0001
> RemoteTrace:
>  at Local Trace:
> org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl: 
> failed to run job
> at 
> org.apache.hadoop.yarn.factories.impl.pb.YarnRemoteExceptionFactoryPBImpl.createYarnRemoteException(YarnRemoteExceptionFactoryPBImpl.java:39)
> at 
> org.apache.hadoop.yarn.ipc.RPCUtil.getRemoteException(RPCUtil.java:47)
> at org.apache.hadoop.mapred.YARNRunner.submitJob(YARNRunner.java:250)
> at 
> org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:377)
> at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1072)
> at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1069)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1135)
> at org.apache.hadoop.mapreduce.Job.submit(Job.java:1069)
> at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:1089)
> at org.apache.hadoop.examples.RandomWriter.run(RandomWriter.java:283)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69)
> at org.apache.hadoop.examples.RandomWriter.main(RandomWriter.java:294)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:597)
> at 
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
> at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:144)
> at 
> org.apache.hadoop.examples.ExampleDriver.main(ExampleDriver.java:68)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at 
> sun.reflect.Delegat

[jira] [Commented] (MAPREDUCE-2937) Errors in Application failures are not shown in the client trace.

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100877#comment-13100877
 ] 

Hudson commented on MAPREDUCE-2937:
---

Integrated in Hadoop-Common-trunk-Commit #856 (See 
[https://builds.apache.org/job/Hadoop-Common-trunk-Commit/856/])
MAPREDUCE-2937. Ensure reason for application failure is displayed to the 
user. Contributed by Mahadev Konar.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166966
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/main/java/org/apache/hadoop/mapred/YARNRunner.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapreduce/v2/TestYARNRunner.java


> Errors in Application failures are not shown in the client trace.
> -
>
> Key: MAPREDUCE-2937
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2937
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
>Priority: Critical
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2937.patch, MAPREDUCE-2937.patch, 
> MAPREDUCE-2937.patch
>
>
> The client side does not show enough information on why the job failed. Here 
> is step to reproduce it:
> 1) set the scheduler to be capacity scheduler with queues a, b
> 2) submit a job to a queue that is not a,b
> The job just fails without saying why it failed. We should have enough trace 
> log at the client side to let the user know why it failed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2953) JobClient fails due to a race in RM, removes staged files and in turn crashes MR AM

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2953:
-

Resolution: Fixed
Status: Resolved  (was: Patch Available)

I just committed this. Thanks Thomas!

> JobClient fails due to a race in RM, removes staged files and in turn crashes 
> MR AM
> ---
>
> Key: MAPREDUCE-2953
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2953
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2, resourcemanager
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Thomas Graves
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2953-v2.patch, MAPREDUCE-2953-v3.patch, 
> MAPREDUCE-2953.patch
>
>
> [~Karams] ran into this multiple times. MR JobClient crashes immediately.
> {code}
> 11/09/08 10:52:35 INFO mapreduce.JobSubmitter: number of splits:2094
> 11/09/08 10:52:36 INFO mapred.YARNRunner: AppMaster capability = memory: 2048,
> 11/09/08 10:52:36 INFO mapred.YARNRunner: Command to launch container for 
> ApplicationMaster is : $JAVA_HOME/bin/java -Dhadoop.root.logger=INFO,console 
> -Xmx1536m org.apache.hadoop.mapreduce.v2.app.MRAppMaster 1315478927026 1 
>  1>/stdout 2>/stderr
> 11/09/08 10:52:36 INFO mapred.ResourceMgrDelegate: Submitted application 
> application_1315478927026_1 to ResourceManager
> 11/09/08 10:52:36 INFO mapreduce.JobSubmitter: Cleaning up the staging area 
> /user/gridperf/.staging/job_1315478927026_0001
> RemoteTrace:
>  at Local Trace:
> org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl: 
> failed to run job
> at 
> org.apache.hadoop.yarn.factories.impl.pb.YarnRemoteExceptionFactoryPBImpl.createYarnRemoteException(YarnRemoteExceptionFactoryPBImpl.java:39)
> at 
> org.apache.hadoop.yarn.ipc.RPCUtil.getRemoteException(RPCUtil.java:47)
> at org.apache.hadoop.mapred.YARNRunner.submitJob(YARNRunner.java:250)
> at 
> org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.java:377)
> at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1072)
> at org.apache.hadoop.mapreduce.Job$2.run(Job.java:1069)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1135)
> at org.apache.hadoop.mapreduce.Job.submit(Job.java:1069)
> at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:1089)
> at org.apache.hadoop.examples.RandomWriter.run(RandomWriter.java:283)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69)
> at org.apache.hadoop.examples.RandomWriter.main(RandomWriter.java:294)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:597)
> at 
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
> at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:144)
> at 
> org.apache.hadoop.examples.ExampleDriver.main(ExampleDriver.java:68)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:597)
> at org.apache.hadoop.util.RunJar.main(RunJar.java:189)
> }
> {code}
> The client crashes due to a race in RM.
> Because the client fails, it immediately removes the staged files which in 
> turn makes the MR AM itself to crash due to failed localization on the NM.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2690) Construct the web page for default scheduler

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100873#comment-13100873
 ] 

Hudson commented on MAPREDUCE-2690:
---

Integrated in Hadoop-Mapreduce-trunk-Commit #866 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/866/])
MAPREDUCE-2690. Web-page for FifoScheduler. Contributed by Eric Payne.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166958
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fifo/FifoScheduler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/DefaultSchedulerPage.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/TestRMWebApp.java


> Construct the web page for default scheduler
> 
>
> Key: MAPREDUCE-2690
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2690
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Ramya Sunil
>Assignee: Eric Payne
> Fix For: 0.23.0
>
> Attachments: DefSchedulerScreenshot.2011-09-08.png.jpg, 
> DefaultSchedulerPage.jpg, MAPREDUCE-2690.2011-09-02.patch, 
> MAPREDUCE-2690.2011-09-05.patch, MAPREDUCE-2690.2011-09-08.patch
>
>
> Currently, the web page for default scheduler reads as "Under construction". 
> This is a long known issue, but could not find a tracking ticket. Hence 
> opening one.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2937) Errors in Application failures are not shown in the client trace.

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2937:
-

Resolution: Fixed
Status: Resolved  (was: Patch Available)

I just committed this. Thanks Mahadev!

> Errors in Application failures are not shown in the client trace.
> -
>
> Key: MAPREDUCE-2937
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2937
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
>Priority: Critical
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2937.patch, MAPREDUCE-2937.patch, 
> MAPREDUCE-2937.patch
>
>
> The client side does not show enough information on why the job failed. Here 
> is step to reproduce it:
> 1) set the scheduler to be capacity scheduler with queues a, b
> 2) submit a job to a queue that is not a,b
> The job just fails without saying why it failed. We should have enough trace 
> log at the client side to let the user know why it failed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2690) Construct the web page for default scheduler

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100867#comment-13100867
 ] 

Hudson commented on MAPREDUCE-2690:
---

Integrated in Hadoop-Common-trunk-Commit #855 (See 
[https://builds.apache.org/job/Hadoop-Common-trunk-Commit/855/])
MAPREDUCE-2690. Web-page for FifoScheduler. Contributed by Eric Payne.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166958
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fifo/FifoScheduler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/DefaultSchedulerPage.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/TestRMWebApp.java


> Construct the web page for default scheduler
> 
>
> Key: MAPREDUCE-2690
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2690
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Ramya Sunil
>Assignee: Eric Payne
> Fix For: 0.23.0
>
> Attachments: DefSchedulerScreenshot.2011-09-08.png.jpg, 
> DefaultSchedulerPage.jpg, MAPREDUCE-2690.2011-09-02.patch, 
> MAPREDUCE-2690.2011-09-05.patch, MAPREDUCE-2690.2011-09-08.patch
>
>
> Currently, the web page for default scheduler reads as "Under construction". 
> This is a long known issue, but could not find a tracking ticket. Hence 
> opening one.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2690) Construct the web page for default scheduler

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100869#comment-13100869
 ] 

Hudson commented on MAPREDUCE-2690:
---

Integrated in Hadoop-Hdfs-trunk-Commit #932 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/932/])
MAPREDUCE-2690. Web-page for FifoScheduler. Contributed by Eric Payne.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166958
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fifo/FifoScheduler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/DefaultSchedulerPage.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/TestRMWebApp.java


> Construct the web page for default scheduler
> 
>
> Key: MAPREDUCE-2690
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2690
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Ramya Sunil
>Assignee: Eric Payne
> Fix For: 0.23.0
>
> Attachments: DefSchedulerScreenshot.2011-09-08.png.jpg, 
> DefaultSchedulerPage.jpg, MAPREDUCE-2690.2011-09-02.patch, 
> MAPREDUCE-2690.2011-09-05.patch, MAPREDUCE-2690.2011-09-08.patch
>
>
> Currently, the web page for default scheduler reads as "Under construction". 
> This is a long known issue, but could not find a tracking ticket. Hence 
> opening one.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2690) Construct the web page for default scheduler

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2690:
-

Resolution: Fixed
Status: Resolved  (was: Patch Available)

I just committed this. Thanks Eric!

> Construct the web page for default scheduler
> 
>
> Key: MAPREDUCE-2690
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2690
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Ramya Sunil
>Assignee: Eric Payne
> Fix For: 0.23.0
>
> Attachments: DefSchedulerScreenshot.2011-09-08.png.jpg, 
> DefaultSchedulerPage.jpg, MAPREDUCE-2690.2011-09-02.patch, 
> MAPREDUCE-2690.2011-09-05.patch, MAPREDUCE-2690.2011-09-08.patch
>
>
> Currently, the web page for default scheduler reads as "Under construction". 
> This is a long known issue, but could not find a tracking ticket. Hence 
> opening one.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2958) wordcount succeeds but doesn't output file

2011-09-08 Thread Thomas Graves (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100861#comment-13100861
 ] 

Thomas Graves commented on MAPREDUCE-2958:
--

It looks like the mapred-defaults.xml isn't included in any of the 
mapreduce-client jar files.  It used to be in the client-core jar.  That is 
what was setting the default number of reduces to 1.  The code current defaults 
to 0 if not specified.

job.numReduceTasks = job.conf.getInt(MRJobConfig.NUM_REDUCES, 0);


> wordcount succeeds but doesn't output file
> --
>
> Key: MAPREDUCE-2958
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2958
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Thomas Graves
>Assignee: Thomas Graves
>Priority: Critical
> Fix For: 0.23.0
>
>
> I have been running wordcount out of the 23 examples jar.  It says it 
> succeeds but doesn't actually output a file.
> hadoop jar 
> examples/hadoop-mapreduce-0.23.0-SNAPSHOT/hadoop-mapreduce-examples-0.23.0-SNAPSHOT.jar
>  wordcount input output2
> input file is really basic:
> fdksajl
> dlkfsajlfljda;j
> kldfsjallj
> test
> one
> two
> test

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2864) Renaming of configuration property names in yarn

2011-09-08 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-2864:
-

Resolution: Fixed
Status: Resolved  (was: Patch Available)

This was a biggie, thanks for being patient Robert - I just committed this.

> Renaming of configuration property names in yarn
> 
>
> Key: MAPREDUCE-2864
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2864
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: jobhistoryserver, mrv2, nodemanager, resourcemanager
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Robert Joseph Evans
>Assignee: Robert Joseph Evans
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR-2864-0.23-v10.patch, MR-2864-trunk-v10.patch, 
> MR-2864-v1.patch, MR-2864-v2.patch, MR-2864-v3.patch, MR-2864-v4.patch, 
> MR-2864-v5.patch, MR-2864-v6.patch, MR-2864-v7.patch, MR-2864-v8.patch, 
> MR-2864-v9.patch, update.pl, yarnConfigs.csv
>
>
> Now that YARN has been put in to trunk we should do something similar to 
> MAPREDUCE-849.  We should go back and look at all of the configurations that 
> have been added in and rename them as needed to be consistent and subdivided 
> by component.
> # We should use all lowercase in the config names. e.g., we should use 
> appsmanager instead of appsManager etc.
> # history server config names should be prefixed with mapreduce instead of 
> yarn.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2864) Renaming of configuration property names in yarn

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100858#comment-13100858
 ] 

Hudson commented on MAPREDUCE-2864:
---

Integrated in Hadoop-Mapreduce-trunk-Commit #865 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/865/])
MAPREDUCE-2864. Normalize configuration variable names for YARN. 
Contributed by Robert Evans.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166955
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/INSTALL
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/LocalContainerLauncher.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/TaskAttemptListenerImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/WrappedProgressSplitsBlock.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/jobhistory/JobHistoryEventHandler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/AMConstants.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/MRAppMaster.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/client/MRClientService.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/job/impl/TaskAttemptImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/launcher/ContainerLauncherImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMCommunicator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerRequestor.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/speculate/DefaultSpeculator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/speculate/ExponentiallySmoothedTaskRuntimeEstimator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/MRApp.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/TestRecovery.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/MRConstants.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/jobhistory/JHAdminConfig.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/jobhistory/JHConfig.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/jobhistory/JobHistoryUtils.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/security/client/ClientHSSecurityInfo.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/test/java/org/apache/hadoop/mapreduce/v2/util/TestMRApps.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/MRJobConfig.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main

[jira] [Commented] (MAPREDUCE-2864) Renaming of configuration property names in yarn

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100854#comment-13100854
 ] 

Hudson commented on MAPREDUCE-2864:
---

Integrated in Hadoop-Common-trunk-Commit #854 (See 
[https://builds.apache.org/job/Hadoop-Common-trunk-Commit/854/])
MAPREDUCE-2864. Normalize configuration variable names for YARN. 
Contributed by Robert Evans.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166955
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/INSTALL
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/LocalContainerLauncher.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/TaskAttemptListenerImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/WrappedProgressSplitsBlock.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/jobhistory/JobHistoryEventHandler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/AMConstants.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/MRAppMaster.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/client/MRClientService.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/job/impl/TaskAttemptImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/launcher/ContainerLauncherImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMCommunicator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerRequestor.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/speculate/DefaultSpeculator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/speculate/ExponentiallySmoothedTaskRuntimeEstimator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/MRApp.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/TestRecovery.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/MRConstants.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/jobhistory/JHAdminConfig.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/jobhistory/JHConfig.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/jobhistory/JobHistoryUtils.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/security/client/ClientHSSecurityInfo.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/test/java/org/apache/hadoop/mapreduce/v2/util/TestMRApps.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/MRJobConfig.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/

[jira] [Commented] (MAPREDUCE-2864) Renaming of configuration property names in yarn

2011-09-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100855#comment-13100855
 ] 

Hudson commented on MAPREDUCE-2864:
---

Integrated in Hadoop-Hdfs-trunk-Commit #931 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/931/])
MAPREDUCE-2864. Normalize configuration variable names for YARN. 
Contributed by Robert Evans.

acmurthy : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1166955
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/INSTALL
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/LocalContainerLauncher.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/TaskAttemptListenerImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapred/WrappedProgressSplitsBlock.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/jobhistory/JobHistoryEventHandler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/AMConstants.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/MRAppMaster.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/client/MRClientService.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/job/impl/TaskAttemptImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/launcher/ContainerLauncherImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMCommunicator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerRequestor.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/speculate/DefaultSpeculator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/speculate/ExponentiallySmoothedTaskRuntimeEstimator.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/MRApp.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/TestRecovery.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/MRConstants.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/jobhistory/JHAdminConfig.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/jobhistory/JHConfig.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/jobhistory/JobHistoryUtils.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/security/client/ClientHSSecurityInfo.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/test/java/org/apache/hadoop/mapreduce/v2/util/TestMRApps.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/MRJobConfig.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/

[jira] [Commented] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100845#comment-13100845
 ] 

Hadoop QA commented on MAPREDUCE-2954:
--

+1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12493709/MR2954_1.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 6 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-hs.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-shuffle.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-jobclient.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-core.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-resourcemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-nodemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-api.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/669//console

This message is automatically generated.

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2954_1.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(

[jira] [Updated] (MAPREDUCE-2843) [MR-279] Node entries on the RM UI are not sortable

2011-09-08 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-2843:
-

Status: Open  (was: Patch Available)

@Abhijit,
 Can you please look at the test failure in the patch?

> [MR-279] Node entries on the RM UI are not sortable
> ---
>
> Key: MAPREDUCE-2843
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2843
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Ramya Sunil
>Assignee: Abhijit Suresh Shingate
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2843- scr shot.jpg, MAPREDUCE-2843.patch, 
> MAPREDUCE-2843.patch
>
>
> The nodemanager entries on the RM UI is not sortable unlike the other web 
> pages. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Siddharth Seth (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-2954:
--

Attachment: MR2954_1.patch

compareTo and equals should not be synchronized in ApplicationAttemptId

The patch fixes this +
- ads synchronization for other methods in ApplicationId and ContainerId.
- hashCode, equals, toString, compareTo implemented for ApplicationId, 
ApplicationAttemptId and ContainerId - so that the backing ProtoBuf object is 
not serialized for each of these calls.
- Moved these methods up one level (AppId, AppAttemptId, CId changed to 
abstract classes)
The tests are kindof lame. Can be dropped if they're not required.

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2954_1.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:360)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:355)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:113)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:619)
> "Thread-30":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doRemove(ConcurrentSkipListMap.java:1078)
> at 
> java.util.concurrent.ConcurrentSkipListMap.remove(ConcurrentSkipListMap.java:1673)
> at 
> java.util.concurrent.ConcurrentSkipListMap$Iter.remove(ConcurrentSkipListMap.java:2256)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.getNodeStatus(NodeStatusUpdaterImpl.java:223)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.access$300(NodeStatusUpdaterImpl.java:62)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl$1.run(NodeStatusUpdaterImpl.java:262)
> Found 1 deadlock.
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Siddharth Seth (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-2954:
--

Fix Version/s: 0.24.0
Affects Version/s: 0.24.0
   Status: Patch Available  (was: Open)

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2954_1.patch
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:360)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:355)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:113)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:619)
> "Thread-30":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doRemove(ConcurrentSkipListMap.java:1078)
> at 
> java.util.concurrent.ConcurrentSkipListMap.remove(ConcurrentSkipListMap.java:1673)
> at 
> java.util.concurrent.ConcurrentSkipListMap$Iter.remove(ConcurrentSkipListMap.java:2256)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.getNodeStatus(NodeStatusUpdaterImpl.java:223)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.access$300(NodeStatusUpdaterImpl.java:62)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl$1.run(NodeStatusUpdaterImpl.java:262)
> Found 1 deadlock.
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2413) TaskTracker should handle disk failures at both startup and runtime

2011-09-08 Thread Eli Collins (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100835#comment-13100835
 ] 

Eli Collins commented on MAPREDUCE-2413:


Another testing question - was this tested in conjunction with a mapred health 
checker script?

> TaskTracker should handle disk failures at both startup and runtime
> ---
>
> Key: MAPREDUCE-2413
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2413
> Project: Hadoop Map/Reduce
>  Issue Type: Sub-task
>  Components: task-controller, tasktracker
>Affects Versions: 0.20.204.0
>Reporter: Bharath Mundlapudi
>Assignee: Ravi Gummadi
> Fix For: 0.20.204.0
>
> Attachments: MR-2413.v0.1.patch, MR-2413.v0.2.patch, 
> MR-2413.v0.3.patch, MR-2413.v0.patch
>
>
> At present, TaskTracker doesn't handle disk failures properly both at startup 
> and runtime.
> (1) Currently TaskTracker doesn't come up if any of the mapred-local-dirs is 
> on a bad disk. TaskTracker should ignore that particular mapred-local-dir and 
> start up and use only the remaining good mapred-local-dirs.
> (2) If a disk goes bad while TaskTracker is running, currently TaskTracker 
> doesn't do anything special. This results in either
>(a) TaskTracker continues to "try to use that bad disk" and this results 
> in lots of task failures and possibly job failures(because of multiple TTs 
> having bad disks) and eventually these TTs getting graylisted for all jobs. 
> And this needs manual restart of TT with modified configuration of 
> mapred-local-dirs avoiding the bad disk. OR
>(b) Health check script identifying the disk as bad and the TT gets 
> blacklisted. And this also needs manual restart of TT with modified 
> configuration of mapred-local-dirs avoiding the bad disk.
> This JIRA is to make TaskTracker more fault-tolerant to disk failures solving 
> (1) and (2). i.e. TT should start even if at least one of the 
> mapred-local-dirs is on a good disk and TT should adjust its in-memory list 
> of mapred-local-dirs and avoid using bad mapred-local-dirs.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-434) local map-reduce job limited to single reducer

2011-09-08 Thread Forest Tan (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100834#comment-13100834
 ] 

Forest Tan commented on MAPREDUCE-434:
--

Can't wait for this patch to be submitted...

> local map-reduce job limited to single reducer
> --
>
> Key: MAPREDUCE-434
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-434
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
> Environment: local job tracker
>Reporter: Yoram Arnon
>Assignee: Aaron Kimball
>Priority: Minor
> Attachments: MAPREDUCE-434.2.patch, MAPREDUCE-434.3.patch, 
> MAPREDUCE-434.4.patch, MAPREDUCE-434.5.patch, MAPREDUCE-434.6.patch, 
> MAPREDUCE-434.patch
>
>
> when mapred.job.tracker is set to 'local', my setNumReduceTasks call is 
> ignored, and the number of reduce tasks is set at 1.
> This prevents me from locally debugging my partition function, which tries to 
> partition based on the number of reduce tasks.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2749) [MR-279] NM registers with RM even before it starts various servers

2011-09-08 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100832#comment-13100832
 ] 

Hadoop QA commented on MAPREDUCE-2749:
--

+1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12493701/MAPREDUCE-2749-v4.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 24 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-hs.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-core.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-shuffle.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-jobclient.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-api.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-nodemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-resourcemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-common.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/668//console

This message is automatically generated.

> [MR-279] NM registers with RM even before it starts various servers
> ---
>
> Key: MAPREDUCE-2749
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2749
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Thomas Graves
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2749-v2.patch, MAPREDUCE-2749-v3.patch, 
> MAPREDUCE-2749-v4.patch, MAPREDUCE-2749.patch
>
>
> In case NM eventually fails to start the ContainerManager server because of 
> say a port clash, RM will have to wait for expiry to detect the NM crash.
> It is desirable to make NM register with RM only after it can start all of 
> its components successfully.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2843) [MR-279] Node entries on the RM UI are not sortable

2011-09-08 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100828#comment-13100828
 ] 

Hadoop QA commented on MAPREDUCE-2843:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12493702/MAPREDUCE-2843.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 4 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed these unit tests:
  
org.apache.hadoop.yarn.server.resourcemanager.webapp.TestNodesPage

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-hs.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-shuffle.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-jobclient.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-core.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-resourcemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-nodemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-api.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/667//console

This message is automatically generated.

> [MR-279] Node entries on the RM UI are not sortable
> ---
>
> Key: MAPREDUCE-2843
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2843
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Ramya Sunil
>Assignee: Abhijit Suresh Shingate
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2843- scr shot.jpg, MAPREDUCE-2843.patch, 
> MAPREDUCE-2843.patch
>
>
> The nodemanager entries on the RM UI is not sortable unlike the other web 
> pages. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2676) MR-279: JobHistory Job page needs reformatted

2011-09-08 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100818#comment-13100818
 ] 

Hadoop QA commented on MAPREDUCE-2676:
--

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12493619/MR-2676-trunk-V5.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 12 new or modified tests.

-1 patch.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/666//console

This message is automatically generated.

> MR-279: JobHistory Job page needs reformatted
> -
>
> Key: MAPREDUCE-2676
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2676
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Robert Joseph Evans
>Assignee: Robert Joseph Evans
> Fix For: 0.23.0
>
> Attachments: MR-2676-trunk-V2.patch, MR-2676-trunk-V3.patch, 
> MR-2676-trunk-V4.patch, MR-2676-trunk-V5.patch, MR-2676-v1.patch
>
>
> The Job page, The Maps page and the Reduces page for the job history server 
> needs to be reformatted.
> The Job Overview needs to add in the User, a link to the Job Conf, and the 
> Job ACLs
> It also needs Submitted at, launched at, and finished at, depending on how 
> they relates to Started and Elapsed.
> In the attempts table we need to remove the new and the running columns
> In the tasks table we need to remove progress, pending, and running columns 
> and add in a failed count column
> We also need to investigate what it would take to add in setup and cleanup 
> statistics.  Perhaps these should be more generally Application Master 
> statistics and links.
> The Maps page and Reduces page should have the progress column removed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2843) [MR-279] Node entries on the RM UI are not sortable

2011-09-08 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-2843:
-

Attachment: MAPREDUCE-2843.patch

Reuploading Abhijit's patch to kick of hudson  CI on the right patch.

> [MR-279] Node entries on the RM UI are not sortable
> ---
>
> Key: MAPREDUCE-2843
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2843
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Ramya Sunil
>Assignee: Abhijit Suresh Shingate
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2843- scr shot.jpg, MAPREDUCE-2843.patch, 
> MAPREDUCE-2843.patch
>
>
> The nodemanager entries on the RM UI is not sortable unlike the other web 
> pages. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2843) [MR-279] Node entries on the RM UI are not sortable

2011-09-08 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-2843:
-

Status: Open  (was: Patch Available)

> [MR-279] Node entries on the RM UI are not sortable
> ---
>
> Key: MAPREDUCE-2843
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2843
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Ramya Sunil
>Assignee: Abhijit Suresh Shingate
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2843- scr shot.jpg, MAPREDUCE-2843.patch, 
> MAPREDUCE-2843.patch
>
>
> The nodemanager entries on the RM UI is not sortable unlike the other web 
> pages. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2843) [MR-279] Node entries on the RM UI are not sortable

2011-09-08 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-2843:
-

Status: Patch Available  (was: Open)

> [MR-279] Node entries on the RM UI are not sortable
> ---
>
> Key: MAPREDUCE-2843
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2843
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Ramya Sunil
>Assignee: Abhijit Suresh Shingate
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2843- scr shot.jpg, MAPREDUCE-2843.patch, 
> MAPREDUCE-2843.patch
>
>
> The nodemanager entries on the RM UI is not sortable unlike the other web 
> pages. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2749) [MR-279] NM registers with RM even before it starts various servers

2011-09-08 Thread Thomas Graves (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Graves updated MAPREDUCE-2749:
-

Status: Patch Available  (was: Open)

> [MR-279] NM registers with RM even before it starts various servers
> ---
>
> Key: MAPREDUCE-2749
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2749
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Thomas Graves
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2749-v2.patch, MAPREDUCE-2749-v3.patch, 
> MAPREDUCE-2749-v4.patch, MAPREDUCE-2749.patch
>
>
> In case NM eventually fails to start the ContainerManager server because of 
> say a port clash, RM will have to wait for expiry to detect the NM crash.
> It is desirable to make NM register with RM only after it can start all of 
> its components successfully.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2749) [MR-279] NM registers with RM even before it starts various servers

2011-09-08 Thread Thomas Graves (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Graves updated MAPREDUCE-2749:
-

Attachment: MAPREDUCE-2749-v4.patch

> [MR-279] NM registers with RM even before it starts various servers
> ---
>
> Key: MAPREDUCE-2749
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2749
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Thomas Graves
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2749-v2.patch, MAPREDUCE-2749-v3.patch, 
> MAPREDUCE-2749-v4.patch, MAPREDUCE-2749.patch
>
>
> In case NM eventually fails to start the ContainerManager server because of 
> say a port clash, RM will have to wait for expiry to detect the NM crash.
> It is desirable to make NM register with RM only after it can start all of 
> its components successfully.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2691) Finish up the cleanup of distributed cache file resources and related tests.

2011-09-08 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-2691:
-

Status: Open  (was: Patch Available)

Cancelling patch so that the issues can be addressed.

> Finish up the cleanup of distributed cache file resources and related tests.
> 
>
> Key: MAPREDUCE-2691
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2691
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Reporter: Amol Kekre
>Assignee: Siddharth Seth
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR2691_1.patch, MR2691_2.patch, MR2691_3.patch
>
>
> Implement cleanup of distributed cache file resources

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-2963) TestMRJobs hangs waiting to connect to history server.

2011-09-08 Thread Mahadev konar (JIRA)
TestMRJobs hangs waiting to connect to history server.
--

 Key: MAPREDUCE-2963
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2963
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Mahadev konar
Priority: Critical
 Fix For: 0.23.0


TestMRJobs is hanging waiting to connect to history server. I will post the 
logs next.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2963) TestMRJobs hangs waiting to connect to history server.

2011-09-08 Thread Mahadev konar (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100807#comment-13100807
 ] 

Mahadev konar commented on MAPREDUCE-2963:
--

{code}
Call: 
protocol=org.apache.hadoop.yarn.proto.MRClientProtocol.MRClientProtocolService.BlockingInterface,
 method=getTaskReports
2011-09-08 17:10:50,808 INFO  mapred.ClientServiceDelegate 
(ClientServiceDelegate.java:invoke(242)) - Failed to contact AM for job 
job_1315526960327_0001  Will retry..
Call: 
protocol=org.apache.hadoop.yarn.proto.ClientRMProtocol.ClientRMProtocolService.BlockingInterface,
 method=getApplicationReport
2011-09-08 17:10:50,808 INFO  mapred.ClientServiceDelegate 
(ClientServiceDelegate.java:getProxy(199)) - Application state is completed. 
Redirecting to job history server null
Call: 
protocol=org.apache.hadoop.yarn.proto.MRClientProtocol.MRClientProtocolService.BlockingInterface,
 method=getTaskReports
2011-09-08 17:10:50,809 INFO  mapred.ClientServiceDelegate 
(ClientServiceDelegate.java:invoke(242)) - Failed to contact AM for job 
job_1315526960327_0001  Will retry..
Call: 
protocol=org.apache.hadoop.yarn.proto.ClientRMProtocol.ClientRMProtocolService.BlockingInterface,
 method=getApplicationReport
2011-09-08 17:10:50,809 INFO  mapred.ClientServiceDelegate 
(ClientServiceDelegate.java:getProxy(199)) - Application state is completed. 
Redirecting to job history server null
Call: 
protocol=org.apache.hadoop.yarn.proto.MRClientProtocol.MRClientProtocolService.BlockingInterface,
 method=getTaskReports
2011-09-08 17:10:50,809 INFO  mapred.ClientServiceDelegate 
(ClientServiceDelegate.java:invoke(242)) - Failed to contact AM for job 
job_1315526960

{code}

This is trace I see when the first job completes in the TestMRJobs.

> TestMRJobs hangs waiting to connect to history server.
> --
>
> Key: MAPREDUCE-2963
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2963
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Priority: Critical
> Fix For: 0.23.0
>
>
> TestMRJobs is hanging waiting to connect to history server. I will post the 
> logs next.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2961) Increase the default threadpool size for container launching in the application master.

2011-09-08 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100803#comment-13100803
 ] 

Hadoop QA commented on MAPREDUCE-2961:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12493691/MAPREDUCE-2961.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-hs.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-shuffle.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-jobclient.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-core.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-resourcemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-server-nodemanager.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-yarn-api.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/665//console

This message is automatically generated.

> Increase the default threadpool size for container launching in the 
> application master.
> ---
>
> Key: MAPREDUCE-2961
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2961
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2961.patch
>
>
> Currently the default threadpool size is 10 for launching containers in 
> ContainerLauncherImpl. We should increase that to 100 for a reasonable 
> default, so that container launching is not backed up by a small thread pool 
> size.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2864) Renaming of configuration property names in yarn

2011-09-08 Thread Mahadev konar (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100802#comment-13100802
 ] 

Mahadev konar commented on MAPREDUCE-2864:
--

Minor nit,  just noticed this in the patch in yarn-default.xml:

{code}
-  nodemanager.aux.service.mapreduce.shuffle.class
-  org.apache.hadoop.mapred.ShuffleHandler
+mapreduce.job.jar
+
+
   
--->
{code}

I think we should remove this property name (mapreduce.job.jar) in 
yarn-default.xml. This can be done when committing, no need to upload a new 
patch.

> Renaming of configuration property names in yarn
> 
>
> Key: MAPREDUCE-2864
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2864
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: jobhistoryserver, mrv2, nodemanager, resourcemanager
>Affects Versions: 0.23.0, 0.24.0
>Reporter: Robert Joseph Evans
>Assignee: Robert Joseph Evans
> Fix For: 0.23.0, 0.24.0
>
> Attachments: MR-2864-0.23-v10.patch, MR-2864-trunk-v10.patch, 
> MR-2864-v1.patch, MR-2864-v2.patch, MR-2864-v3.patch, MR-2864-v4.patch, 
> MR-2864-v5.patch, MR-2864-v6.patch, MR-2864-v7.patch, MR-2864-v8.patch, 
> MR-2864-v9.patch, update.pl, yarnConfigs.csv
>
>
> Now that YARN has been put in to trunk we should do something similar to 
> MAPREDUCE-849.  We should go back and look at all of the configurations that 
> have been added in and rename them as needed to be consistent and subdivided 
> by component.
> # We should use all lowercase in the config names. e.g., we should use 
> appsmanager instead of appsManager etc.
> # history server config names should be prefixed with mapreduce instead of 
> yarn.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Assigned] (MAPREDUCE-2954) Deadlock in NM with threads racing for ApplicationAttemptId

2011-09-08 Thread Siddharth Seth (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth reassigned MAPREDUCE-2954:
-

Assignee: Siddharth Seth

> Deadlock in NM with threads racing for ApplicationAttemptId
> ---
>
> Key: MAPREDUCE-2954
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2954
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Vinod Kumar Vavilapalli
>Assignee: Siddharth Seth
>Priority: Critical
> Fix For: 0.23.0
>
>
> Found this:
> {code}
> Java stack information for the threads listed above:
> ===
> "Thread-45":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doGet(ConcurrentSkipListMap.java:797)
> at 
> java.util.concurrent.ConcurrentSkipListMap.get(ConcurrentSkipListMap.java:1640)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:360)
> at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl$ContainerEventDispatcher.handle(ContainerManagerImpl.java:355)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher.dispatch(AsyncDispatcher.java:113)
> at 
> org.apache.hadoop.yarn.event.AsyncDispatcher$1.run(AsyncDispatcher.java:75)
> at java.lang.Thread.run(Thread.java:619)
> "Thread-30":
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.getApplicationId(ApplicationAttemptIdPBImpl.java:101)
> - waiting to lock <0xb6a443a0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:144)
> - locked <0xb6a43ba0> (a 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ApplicationAttemptIdPBImpl.compareTo(ApplicationAttemptIdPBImpl.java:31)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:215)
> at 
> org.apache.hadoop.yarn.api.records.impl.pb.ContainerIdPBImpl.compareTo(ContainerIdPBImpl.java:34)
> at 
> java.util.concurrent.ConcurrentSkipListMap.doRemove(ConcurrentSkipListMap.java:1078)
> at 
> java.util.concurrent.ConcurrentSkipListMap.remove(ConcurrentSkipListMap.java:1673)
> at 
> java.util.concurrent.ConcurrentSkipListMap$Iter.remove(ConcurrentSkipListMap.java:2256)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.getNodeStatus(NodeStatusUpdaterImpl.java:223)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl.access$300(NodeStatusUpdaterImpl.java:62)
> at 
> org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl$1.run(NodeStatusUpdaterImpl.java:262)
> Found 1 deadlock.
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2961) Increase the default threadpool size for container launching in the application master.

2011-09-08 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated MAPREDUCE-2961:
--

Summary: Increase the default threadpool size for container launching in 
the application master.  (was: Increase the default threadpool fize for 
container launching in the application master.)

> Increase the default threadpool size for container launching in the 
> application master.
> ---
>
> Key: MAPREDUCE-2961
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2961
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2961.patch
>
>
> Currently the default threadpool size is 10 for launching containers in 
> ContainerLauncherImpl. We should increase that to 100 for a reasonable 
> default, so that container launching is not backed up by a small thread pool 
> size.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2940) Build fails with ant 1.7.0 but works with 1.8.0

2011-09-08 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100791#comment-13100791
 ] 

Hadoop QA commented on MAPREDUCE-2940:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12493690/MAPREDUCE-2940.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

-1 patch.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/664//console

This message is automatically generated.

> Build fails with ant 1.7.0 but works with 1.8.0
> ---
>
> Key: MAPREDUCE-2940
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2940
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: build
>Affects Versions: 0.22.0
> Environment: Ubuntu 11.04; Sun JDK 1.6.0_26; Ant 1.8.2; Ant 1.7.0
>Reporter: Joep Rottinghuis
>Assignee: Joep Rottinghuis
> Fix For: 0.22.0
>
> Attachments: MAPREDUCE-2940.patch
>
>
> contrib builds fail when using Ant 1.7.
> build.xml calls build.xml in contrib, which calls block-forensics build, 
> which in turn uses build-contrib.
> The inheritAll=true overrides the basedir in ant 1.7.0 but not in 1.8.0.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2961) Increase the default threadpool fize for container launching in the application master.

2011-09-08 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-2961:
-

Status: Patch Available  (was: Open)

> Increase the default threadpool fize for container launching in the 
> application master.
> ---
>
> Key: MAPREDUCE-2961
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2961
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2961.patch
>
>
> Currently the default threadpool size is 10 for launching containers in 
> ContainerLauncherImpl. We should increase that to 100 for a reasonable 
> default, so that container launching is not backed up by a small thread pool 
> size.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2961) Increase the default threadpool fize for container launching in the application master.

2011-09-08 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-2961:
-

Attachment: MAPREDUCE-2961.patch

Trivial patch to increase the number of threads in the launcher so that we do 
not bottleneck on container launching.

> Increase the default threadpool fize for container launching in the 
> application master.
> ---
>
> Key: MAPREDUCE-2961
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2961
> Project: Hadoop Map/Reduce
>  Issue Type: Improvement
>  Components: mrv2
>Affects Versions: 0.23.0
>Reporter: Mahadev konar
>Assignee: Mahadev konar
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2961.patch
>
>
> Currently the default threadpool size is 10 for launching containers in 
> ContainerLauncherImpl. We should increase that to 100 for a reasonable 
> default, so that container launching is not backed up by a small thread pool 
> size.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-2962) Map percentage complete calculation incorrect

2011-09-08 Thread Todd Lipcon (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100786#comment-13100786
 ] 

Todd Lipcon commented on MAPREDUCE-2962:


There were a couple of "lost map output" errors here, but it's strange that the 
web UI doesn't show the tasks moving back from Complete to Pending/Running

> Map percentage complete calculation incorrect
> -
>
> Key: MAPREDUCE-2962
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2962
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>Affects Versions: 0.20.203.0
>Reporter: Todd Lipcon
>Priority: Minor
>
> For some reason I have a job with 5000 map tasks, all of which are complete, 
> and it's reporting 99.98% complete. The client logs show "map 100%" and then 
> going back down to "map 99%".

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2940) Build fails with ant 1.7.0 but works with 1.8.0

2011-09-08 Thread Joep Rottinghuis (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joep Rottinghuis updated MAPREDUCE-2940:


Status: Patch Available  (was: Open)

> Build fails with ant 1.7.0 but works with 1.8.0
> ---
>
> Key: MAPREDUCE-2940
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2940
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: build
>Affects Versions: 0.22.0
> Environment: Ubuntu 11.04; Sun JDK 1.6.0_26; Ant 1.8.2; Ant 1.7.0
>Reporter: Joep Rottinghuis
>Assignee: Joep Rottinghuis
> Fix For: 0.22.0
>
> Attachments: MAPREDUCE-2940.patch
>
>
> contrib builds fail when using Ant 1.7.
> build.xml calls build.xml in contrib, which calls block-forensics build, 
> which in turn uses build-contrib.
> The inheritAll=true overrides the basedir in ant 1.7.0 but not in 1.8.0.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-2962) Map percentage complete calculation incorrect

2011-09-08 Thread Todd Lipcon (JIRA)
Map percentage complete calculation incorrect
-

 Key: MAPREDUCE-2962
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2962
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Affects Versions: 0.20.203.0
Reporter: Todd Lipcon
Priority: Minor


For some reason I have a job with 5000 map tasks, all of which are complete, 
and it's reporting 99.98% complete. The client logs show "map 100%" and then 
going back down to "map 99%".

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2940) Build fails with ant 1.7.0 but works with 1.8.0

2011-09-08 Thread Joep Rottinghuis (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joep Rottinghuis updated MAPREDUCE-2940:


Attachment: MAPREDUCE-2940.patch

> Build fails with ant 1.7.0 but works with 1.8.0
> ---
>
> Key: MAPREDUCE-2940
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2940
> Project: Hadoop Map/Reduce
>  Issue Type: Bug
>  Components: build
>Affects Versions: 0.22.0
> Environment: Ubuntu 11.04; Sun JDK 1.6.0_26; Ant 1.8.2; Ant 1.7.0
>Reporter: Joep Rottinghuis
>Assignee: Joep Rottinghuis
> Fix For: 0.22.0
>
> Attachments: MAPREDUCE-2940.patch
>
>
> contrib builds fail when using Ant 1.7.
> build.xml calls build.xml in contrib, which calls block-forensics build, 
> which in turn uses build-contrib.
> The inheritAll=true overrides the basedir in ant 1.7.0 but not in 1.8.0.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




  1   2   3   >