[jira] [Commented] (MAPREDUCE-4120) mapred job -list-attempt-ids fails to get attempt ids
[ https://issues.apache.org/jira/browse/MAPREDUCE-4120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249730#comment-13249730 ] Bhallamudi Venkata Siva Kamesh commented on MAPREDUCE-4120: --- Hi Deva, I think, this is dup of [MAPREDUCE-4019|https://issues.apache.org/jira/browse/MAPREDUCE-4019]. mapred job -list-attempt-ids fails to get attempt ids - Key: MAPREDUCE-4120 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4120 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Blocker {code:xml} dev@ubuntudev-linux:~/hadoop/install/hadoop-3.0.0-SNAPSHOT/bin$ ./mapred job -list-attempt-ids job_1333786831666_0001 Usage: CLI [-list-attempt-ids job-id task-type task-state]. Valid values for task-type are MAP REDUCE JOB_SETUP JOB_CLEANUP TASK_CLEANUP. Valid values for task-state are running, completed {code} \\ \\ In the above command it gives valid task-type are MAP REDUCE JOB_SETUP JOB_CLEANUP TASK_CLEANUP. If we give the task-type as MAP, it says as invalid type. {code:xml} dev@ubuntudev-linux:~/hadoop/install/hadoop-3.0.0-SNAPSHOT/bin$ ./mapred job -list-attempt-ids job_1333786831666_0001 MAP completed 12/04/07 19:51:21 INFO mapred.ClientServiceDelegate: Application state is completed. FinalApplicationStatus=SUCCEEDED. Redirecting to job history server Exception in thread main java.lang.IllegalArgumentException: Invalid type: MAP. Valid types for task are: map, reduce, setup, cleanup. at org.apache.hadoop.mapreduce.tools.CLI.displayTasks(CLI.java:564) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:316) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} \\ \\ In the above command it gives valid types for task are: map, reduce, setup, cleanup.. If we give the task-type as map, it fails with the below error. {code:xml} dev@ubuntudev-linux:~/hadoop/install/hadoop-3.0.0-SNAPSHOT/bin$ ./mapred job -list-attempt-ids job_1333786831666_0001 map completed 12/04/07 19:51:42 INFO mapred.ClientServiceDelegate: Application state is completed. FinalApplicationStatus=SUCCEEDED. Redirecting to job history server Exception in thread main java.lang.IllegalArgumentException: No enum const class org.apache.hadoop.mapreduce.TaskType.map at java.lang.Enum.valueOf(Enum.java:214) at org.apache.hadoop.mapreduce.TaskType.valueOf(TaskType.java:27) at org.apache.hadoop.mapreduce.tools.CLI.displayTasks(CLI.java:572) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:316) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (MAPREDUCE-4120) mapred job -list-attempt-ids fails to get attempt ids
[ https://issues.apache.org/jira/browse/MAPREDUCE-4120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj K resolved MAPREDUCE-4120. -- Resolution: Duplicate Assignee: (was: Devaraj K) Dup of MAPREDUCE-4019. Thanks Kamesh for notifying me. mapred job -list-attempt-ids fails to get attempt ids - Key: MAPREDUCE-4120 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4120 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Priority: Blocker {code:xml} dev@ubuntudev-linux:~/hadoop/install/hadoop-3.0.0-SNAPSHOT/bin$ ./mapred job -list-attempt-ids job_1333786831666_0001 Usage: CLI [-list-attempt-ids job-id task-type task-state]. Valid values for task-type are MAP REDUCE JOB_SETUP JOB_CLEANUP TASK_CLEANUP. Valid values for task-state are running, completed {code} \\ \\ In the above command it gives valid task-type are MAP REDUCE JOB_SETUP JOB_CLEANUP TASK_CLEANUP. If we give the task-type as MAP, it says as invalid type. {code:xml} dev@ubuntudev-linux:~/hadoop/install/hadoop-3.0.0-SNAPSHOT/bin$ ./mapred job -list-attempt-ids job_1333786831666_0001 MAP completed 12/04/07 19:51:21 INFO mapred.ClientServiceDelegate: Application state is completed. FinalApplicationStatus=SUCCEEDED. Redirecting to job history server Exception in thread main java.lang.IllegalArgumentException: Invalid type: MAP. Valid types for task are: map, reduce, setup, cleanup. at org.apache.hadoop.mapreduce.tools.CLI.displayTasks(CLI.java:564) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:316) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} \\ \\ In the above command it gives valid types for task are: map, reduce, setup, cleanup.. If we give the task-type as map, it fails with the below error. {code:xml} dev@ubuntudev-linux:~/hadoop/install/hadoop-3.0.0-SNAPSHOT/bin$ ./mapred job -list-attempt-ids job_1333786831666_0001 map completed 12/04/07 19:51:42 INFO mapred.ClientServiceDelegate: Application state is completed. FinalApplicationStatus=SUCCEEDED. Redirecting to job history server Exception in thread main java.lang.IllegalArgumentException: No enum const class org.apache.hadoop.mapreduce.TaskType.map at java.lang.Enum.valueOf(Enum.java:214) at org.apache.hadoop.mapreduce.TaskType.valueOf(TaskType.java:27) at org.apache.hadoop.mapreduce.tools.CLI.displayTasks(CLI.java:572) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:316) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (MAPREDUCE-4122) The MRAppmaster process killed count is being added to the Apps Pending value(with -ve sign) in Cluster Metrics page.
The MRAppmaster process killed count is being added to the Apps Pending value(with -ve sign) in Cluster Metrics page. - Key: MAPREDUCE-4122 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4122 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Ramgopal N Application will retry yarn.resourcemanager.am.max-retries times before the job is failed,if the MRAppmaster process is getting killed continously.This killed count is considered for Pending applications with -ve value on CLuster metrics page. This will mis-interpret the exact number of jobs in the Pending state for the cluster.Even if the MRAppmaster kill count is monitored:should be done at the job level and not at the cluster level. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4122) The MRAppmaster process killed count is being added to the Apps Pending value(with -ve sign) in Cluster Metrics page.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249739#comment-13249739 ] Bhallamudi Venkata Siva Kamesh commented on MAPREDUCE-4122: --- Hi Ram, Is this dup of [MAPREDUCE-3870|https://issues.apache.org/jira/browse/MAPREDUCE-3870]? The MRAppmaster process killed count is being added to the Apps Pending value(with -ve sign) in Cluster Metrics page. - Key: MAPREDUCE-4122 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4122 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Ramgopal N Application will retry yarn.resourcemanager.am.max-retries times before the job is failed,if the MRAppmaster process is getting killed continously.This killed count is considered for Pending applications with -ve value on CLuster metrics page. This will mis-interpret the exact number of jobs in the Pending state for the cluster.Even if the MRAppmaster kill count is monitored:should be done at the job level and not at the cluster level. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4122) The MRAppmaster process killed count is being added to the Apps Pending value(with -ve sign) in Cluster Metrics page.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249751#comment-13249751 ] Ramgopal N commented on MAPREDUCE-4122: --- yes ,This is duplicate of MAPREDUCE-3870 The MRAppmaster process killed count is being added to the Apps Pending value(with -ve sign) in Cluster Metrics page. - Key: MAPREDUCE-4122 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4122 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Ramgopal N Application will retry yarn.resourcemanager.am.max-retries times before the job is failed,if the MRAppmaster process is getting killed continously.This killed count is considered for Pending applications with -ve value on CLuster metrics page. This will mis-interpret the exact number of jobs in the Pending state for the cluster.Even if the MRAppmaster kill count is monitored:should be done at the job level and not at the cluster level. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (MAPREDUCE-4122) The MRAppmaster process killed count is being added to the Apps Pending value(with -ve sign) in Cluster Metrics page.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bhallamudi Venkata Siva Kamesh resolved MAPREDUCE-4122. --- Resolution: Duplicate Thanks Ram for confirming, Closing this as dup of [MAPREDUCE-3870|https://issues.apache.org/jira/browse/MAPREDUCE-3870] The MRAppmaster process killed count is being added to the Apps Pending value(with -ve sign) in Cluster Metrics page. - Key: MAPREDUCE-4122 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4122 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Ramgopal N Application will retry yarn.resourcemanager.am.max-retries times before the job is failed,if the MRAppmaster process is getting killed continously.This killed count is considered for Pending applications with -ve value on CLuster metrics page. This will mis-interpret the exact number of jobs in the Pending state for the cluster.Even if the MRAppmaster kill count is monitored:should be done at the job level and not at the cluster level. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (MAPREDUCE-4123) mapred groups gives NoClassDefFoundError
mapred groups gives NoClassDefFoundError Key: MAPREDUCE-4123 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4123 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Nishan Shetty Priority: Critical linux-168:/home/v2/hadoop-3.0.0-SNAPSHOT/bin # ./mapred groups Exception in thread main java.lang.NoClassDefFoundError: org/apache/hadoop/mapred/tools/GetGroups Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.mapred.tools.GetGroups at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:188) at java.lang.ClassLoader.loadClass(ClassLoader.java:303) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:248) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:316) Could not find the main class: org.apache.hadoop.mapred.tools.GetGroups. Program will exit. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4123) ./mapred groups gives NoClassDefFoundError
[ https://issues.apache.org/jira/browse/MAPREDUCE-4123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nishan Shetty updated MAPREDUCE-4123: - Summary: ./mapred groups gives NoClassDefFoundError (was: mapred groups gives NoClassDefFoundError) ./mapred groups gives NoClassDefFoundError -- Key: MAPREDUCE-4123 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4123 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Nishan Shetty Priority: Critical linux-168:/home/v2/hadoop-3.0.0-SNAPSHOT/bin # ./mapred groups Exception in thread main java.lang.NoClassDefFoundError: org/apache/hadoop/mapred/tools/GetGroups Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.mapred.tools.GetGroups at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:188) at java.lang.ClassLoader.loadClass(ClassLoader.java:303) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:248) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:316) Could not find the main class: org.apache.hadoop.mapred.tools.GetGroups. Program will exit. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4019) -list-attempt-ids is not working
[ https://issues.apache.org/jira/browse/MAPREDUCE-4019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj K updated MAPREDUCE-4019: - Priority: Blocker (was: Minor) -list-attempt-ids is not working - Key: MAPREDUCE-4019 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4019 Project: Hadoop Map/Reduce Issue Type: Bug Components: client Affects Versions: 0.23.1, 0.23.2 Reporter: B Anil Kumar Priority: Blocker Attachments: MAPREDUCE-4019.patch while executing {noformat}bin/mapred job -list-attempt-ids job_id map running{noformat}, we are getting IllegalArgumentexception. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4019) -list-attempt-ids is not working
[ https://issues.apache.org/jira/browse/MAPREDUCE-4019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249777#comment-13249777 ] Devaraj K commented on MAPREDUCE-4019: -- Thanks Anil for the patch. Patch mostly looks good to me except few minor nits/doubts on the patch. - Can we show the states as MAP, REDUCE in the usage info instead of having taskTypes map(likewise before mrv2 changes)? - As of now there is no support for JOB_SETUP, JOB_CLEANUP. Do we need to display these also? -list-attempt-ids is not working - Key: MAPREDUCE-4019 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4019 Project: Hadoop Map/Reduce Issue Type: Bug Components: client Affects Versions: 0.23.1, 0.23.2 Reporter: B Anil Kumar Priority: Blocker Attachments: MAPREDUCE-4019.patch while executing {noformat}bin/mapred job -list-attempt-ids job_id map running{noformat}, we are getting IllegalArgumentexception. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (MAPREDUCE-4124) Job Priority is not changing
Job Priority is not changing - Key: MAPREDUCE-4124 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4124 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 3.0.0 Reporter: Nishan Shetty Priority: Critical 1.Submit job 2.Change the job priority using setPriority() or CLI command ./mapred job-set-priority job-id priority Observe that Job priority is not changed. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (MAPREDUCE-4125) If we are navigating to Nodemanager UI from Resourcemanager,then there is not link to navigate back to Resource manager
If we are navigating to Nodemanager UI from Resourcemanager,then there is not link to navigate back to Resource manager --- Key: MAPREDUCE-4125 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4125 Project: Hadoop Map/Reduce Issue Type: Improvement Components: mrv2 Reporter: Ramgopal N Priority: Minor If we are navigating to Nodemanager by clicking on the node link in RM,there is no link provided on the NM to navigate back to RM. If there is a link to navigate back to RM it would be good -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4126) This is a fix both for oracle support and managing DB splits in general.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph Doss updated MAPREDUCE-4126: --- Attachment: hadoop.dbinputformat.query.split.patch.patch This is a fix both for oracle support and managing DB splits in general. Key: MAPREDUCE-4126 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4126 Project: Hadoop Map/Reduce Issue Type: Improvement Components: performance Affects Versions: 1.0.0 Environment: Redhat 6.1 x86_64 Reporter: Joseph Doss Labels: hadoop, newbie, patch Fix For: 1.0.0 Attachments: hadoop.dbinputformat.query.split.patch.patch This is and addition to an older patch that addressed oracle support. We found that in large result sets, the map reduce was both splitting the results ( as expected ) and running a duplicate ( full result set ) on one node, resulting in a long runtime that should have been reduced by splitting. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (MAPREDUCE-4126) This is a fix both for oracle support and managing DB splits in general.
This is a fix both for oracle support and managing DB splits in general. Key: MAPREDUCE-4126 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4126 Project: Hadoop Map/Reduce Issue Type: Improvement Components: performance Affects Versions: 1.0.0 Environment: Redhat 6.1 x86_64 Reporter: Joseph Doss Fix For: 1.0.0 Attachments: hadoop.dbinputformat.query.split.patch.patch This is and addition to an older patch that addressed oracle support. We found that in large result sets, the map reduce was both splitting the results ( as expected ) and running a duplicate ( full result set ) on one node, resulting in a long runtime that should have been reduced by splitting. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-3921) MR AM should act on the nodes liveliness information when nodes go up/down/unhealthy
[ https://issues.apache.org/jira/browse/MAPREDUCE-3921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Joseph Evans updated MAPREDUCE-3921: --- Status: Patch Available (was: Open) Kicking Jenkins. MR AM should act on the nodes liveliness information when nodes go up/down/unhealthy Key: MAPREDUCE-3921 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3921 Project: Hadoop Map/Reduce Issue Type: Bug Components: mr-am, mrv2 Affects Versions: 0.23.0 Reporter: Vinod Kumar Vavilapalli Assignee: Bikas Saha Fix For: 0.23.2 Attachments: MAPREDUCE-3921-1.patch, MAPREDUCE-3921-branch-0.23.patch, MAPREDUCE-3921-branch-0.23.patch, MAPREDUCE-3921-branch-0.23.patch, MAPREDUCE-3921.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4099) ApplicationMaster may fail to remove staging directory
[ https://issues.apache.org/jira/browse/MAPREDUCE-4099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated MAPREDUCE-4099: -- Status: Open (was: Patch Available) ApplicationMaster may fail to remove staging directory -- Key: MAPREDUCE-4099 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4099 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Jason Lowe Assignee: Jason Lowe Priority: Critical Attachments: MAPREDUCE-4099.patch When the ApplicationMaster shuts down it's supposed to remove the staging directory, assuming properties weren't set to override this behavior. During shutdown the AM tells the ResourceManager that it has finished before it cleans up the staging directory. However upon hearing the AM has finished, the RM turns right around and kills the AM container. If the AM is too slow, the AM will be killed before the staging directory is removed. We're seeing the AM lose this race fairly consistently on our clusters, and the lack of staging directory cleanup quickly leads to filesystem quota issues for some users. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4099) ApplicationMaster may fail to remove staging directory
[ https://issues.apache.org/jira/browse/MAPREDUCE-4099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated MAPREDUCE-4099: -- Status: Patch Available (was: Open) Didn't see any actual failures in the Jenkins build, so maybe it was a timeout? Tried running the jobclient tests on trunk with and without this patch, and I didn't see any noticeable time difference. Kicking Jenkins again. ApplicationMaster may fail to remove staging directory -- Key: MAPREDUCE-4099 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4099 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Jason Lowe Assignee: Jason Lowe Priority: Critical Attachments: MAPREDUCE-4099.patch When the ApplicationMaster shuts down it's supposed to remove the staging directory, assuming properties weren't set to override this behavior. During shutdown the AM tells the ResourceManager that it has finished before it cleans up the staging directory. However upon hearing the AM has finished, the RM turns right around and kills the AM container. If the AM is too slow, the AM will be killed before the staging directory is removed. We're seeing the AM lose this race fairly consistently on our clusters, and the lack of staging directory cleanup quickly leads to filesystem quota issues for some users. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3921) MR AM should act on the nodes liveliness information when nodes go up/down/unhealthy
[ https://issues.apache.org/jira/browse/MAPREDUCE-3921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249844#comment-13249844 ] Robert Joseph Evans commented on MAPREDUCE-3921: I did a quick look at the code and it looks good to me. As for the recovery error you discovered could you please file a follow up JIRA for it, as it is a preexisting issue that can be caused by AM recovery with speculative execution. This patch may expose the issue more frequently, but not enough to really worry me that much. You need two nodes going down very close to one another which is possible, but not that often. MR AM should act on the nodes liveliness information when nodes go up/down/unhealthy Key: MAPREDUCE-3921 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3921 Project: Hadoop Map/Reduce Issue Type: Bug Components: mr-am, mrv2 Affects Versions: 0.23.0 Reporter: Vinod Kumar Vavilapalli Assignee: Bikas Saha Fix For: 0.23.2 Attachments: MAPREDUCE-3921-1.patch, MAPREDUCE-3921-branch-0.23.patch, MAPREDUCE-3921-branch-0.23.patch, MAPREDUCE-3921-branch-0.23.patch, MAPREDUCE-3921.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj K updated MAPREDUCE-4117: - Attachment: MAPREDUCE-4117.patch mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj K updated MAPREDUCE-4117: - Status: Patch Available (was: Open) I have attached patch to fix this issue. mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4079) Allow MR AppMaster to limit ephemeral port range.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Joseph Evans updated MAPREDUCE-4079: --- Attachment: MR-4079-trunk.txt MR-4079-full-branch-0.23.txt Up-merged and addressed some issues pointed out in HADOOP-8227. Allow MR AppMaster to limit ephemeral port range. - Key: MAPREDUCE-4079 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4079 Project: Hadoop Map/Reduce Issue Type: Improvement Components: mr-am, mrv2 Affects Versions: 0.23.2, 2.0.0 Reporter: Robert Joseph Evans Assignee: Robert Joseph Evans Priority: Blocker Attachments: MR-4079-full-branch-0.23.txt, MR-4079-full-branch-0.23.txt, MR-4079-trunk.txt, MR-4079-trunk.txt Having the MapReduce Application Masters bind to any ephemeral port makes it very difficult to setup ACLs. mapreduce.job.am-access-disabled from MAPREDUCE-3251 is not a practical permanent solution for all jobs. Especially for tools like pig where they are not aware of mapreduce.job.am-access-disabled and may deal with it properly. We should add in a config option that would allow someone to restrict the range of ports that the MR-AM can bind to. It will slow down startup in some cases because we will have to probe for open ports instead of just asking the OS to find one for us. But we can make that conditional on this config so users who do not set this config do not see any performance degradation. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3921) MR AM should act on the nodes liveliness information when nodes go up/down/unhealthy
[ https://issues.apache.org/jira/browse/MAPREDUCE-3921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249867#comment-13249867 ] Robert Joseph Evans commented on MAPREDUCE-3921: Someone pointed out to me that my comment is a bit confusing. When I said two nodes going down very close to one another I meant that for this to happen we would need one node to go down in succession that had the correct processes running on them. But now that I think about it more, I am not even sure if it will expose the issue. MR AM should act on the nodes liveliness information when nodes go up/down/unhealthy Key: MAPREDUCE-3921 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3921 Project: Hadoop Map/Reduce Issue Type: Bug Components: mr-am, mrv2 Affects Versions: 0.23.0 Reporter: Vinod Kumar Vavilapalli Assignee: Bikas Saha Fix For: 0.23.2 Attachments: MAPREDUCE-3921-1.patch, MAPREDUCE-3921-branch-0.23.patch, MAPREDUCE-3921-branch-0.23.patch, MAPREDUCE-3921-branch-0.23.patch, MAPREDUCE-3921.patch -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249868#comment-13249868 ] Jonathan Eagles commented on MAPREDUCE-4117: Thanks for the patch Devaraj. One case I'm wondering about is when getTaskFailureEventString is called on a Job with only non-TaskCompletionEvent.Status.FAILED events. Seems like lastEvent will still be null in that case. mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249869#comment-13249869 ] Hadoop QA commented on MAPREDUCE-4117: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12521957/MAPREDUCE-4117.patch against trunk revision . +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 1 new or modified test files. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 eclipse:eclipse. The patch built with eclipse:eclipse. +1 findbugs. The patch does not introduce any new Findbugs (version 1.3.9) warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. -1 core tests. The patch failed these unit tests: org.apache.hadoop.yarn.server.resourcemanager.TestClientRMService org.apache.hadoop.yarn.server.resourcemanager.resourcetracker.TestNMExpiry org.apache.hadoop.yarn.server.resourcemanager.TestAMAuthorization org.apache.hadoop.yarn.server.resourcemanager.TestApplicationACLs +1 contrib tests. The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2173//testReport/ Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2173//console This message is automatically generated. mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4059) The history server should have a separate pluggable storage/query interface
[ https://issues.apache.org/jira/browse/MAPREDUCE-4059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Joseph Evans updated MAPREDUCE-4059: --- Status: Open (was: Patch Available) Up-merging patch. The history server should have a separate pluggable storage/query interface --- Key: MAPREDUCE-4059 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4059 Project: Hadoop Map/Reduce Issue Type: Improvement Components: mrv2 Affects Versions: 0.24.0, 0.23.3 Reporter: Robert Joseph Evans Assignee: Robert Joseph Evans Attachments: MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt The history server currently caches all parsed jobs in RAM. These jobs can be very large because of counters. It would be nice to have a pluggable interface for the cacheing and querying of the cached data so that we can play around with different implementations. Also just for cleanness of the code it would be nice to split the very large JobHistoryServer.java into a few smaller ones that are more understandable and readable. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4059) The history server should have a separate pluggable storage/query interface
[ https://issues.apache.org/jira/browse/MAPREDUCE-4059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Joseph Evans updated MAPREDUCE-4059: --- Status: Patch Available (was: Open) The history server should have a separate pluggable storage/query interface --- Key: MAPREDUCE-4059 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4059 Project: Hadoop Map/Reduce Issue Type: Improvement Components: mrv2 Affects Versions: 0.24.0, 0.23.3 Reporter: Robert Joseph Evans Assignee: Robert Joseph Evans Attachments: MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt The history server currently caches all parsed jobs in RAM. These jobs can be very large because of counters. It would be nice to have a pluggable interface for the cacheing and querying of the cached data so that we can play around with different implementations. Also just for cleanness of the code it would be nice to split the very large JobHistoryServer.java into a few smaller ones that are more understandable and readable. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4059) The history server should have a separate pluggable storage/query interface
[ https://issues.apache.org/jira/browse/MAPREDUCE-4059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Joseph Evans updated MAPREDUCE-4059: --- Attachment: MR-4059.txt No code changes just upmerged. The history server should have a separate pluggable storage/query interface --- Key: MAPREDUCE-4059 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4059 Project: Hadoop Map/Reduce Issue Type: Improvement Components: mrv2 Affects Versions: 0.24.0, 0.23.3 Reporter: Robert Joseph Evans Assignee: Robert Joseph Evans Attachments: MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt The history server currently caches all parsed jobs in RAM. These jobs can be very large because of counters. It would be nice to have a pluggable interface for the cacheing and querying of the cached data so that we can play around with different implementations. Also just for cleanness of the code it would be nice to split the very large JobHistoryServer.java into a few smaller ones that are more understandable and readable. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (MAPREDUCE-4127) Resource manager UI does not show the Job Priority
Resource manager UI does not show the Job Priority -- Key: MAPREDUCE-4127 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4127 Project: Hadoop Map/Reduce Issue Type: Improvement Reporter: Nishan Shetty In RM UI the priority of job is not displayed -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3932) MR tasks failing and crashing the AM when available-resources/headRoom becomes zero
[ https://issues.apache.org/jira/browse/MAPREDUCE-3932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249877#comment-13249877 ] Robert Joseph Evans commented on MAPREDUCE-3932: @Vinod I know you are swamped is it OK with you if I take this one? MR tasks failing and crashing the AM when available-resources/headRoom becomes zero --- Key: MAPREDUCE-3932 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3932 Project: Hadoop Map/Reduce Issue Type: Bug Components: mr-am, mrv2 Affects Versions: 0.23.0 Reporter: Vinod Kumar Vavilapalli Assignee: Vinod Kumar Vavilapalli Priority: Critical Fix For: 0.23.2 [~karams] reported this offline. One reduce task gets preempted because of zero headRoom and crashes the AM. {code} 2012-02-23 11:30:15,956 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: After Scheduling: PendingReduces:377 ScheduledMaps:6 ScheduledReduces:23 AssignedMaps:0 AssignedReduces:0 completedMaps:4 completedReduces:0 containersAllocated:4 containersReleased:0 hostLocalAssigned:0 rackLocalAssigned:4 availableResources(headroom):memory: 44544 2012-02-23 11:30:16,959 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Before Scheduling: PendingReduces:377 ScheduledMaps:6 ScheduledReduces:23 AssignedMaps:0 AssignedReduces:0 completedMaps:4 completedReduces:0 containersAllocated:4 containersReleased:0 hostLocalAssigned:0 rackLocalAssigned:4 availableResources(headroom):memory: 44544 2012-02-23 11:30:16,965 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: After Scheduling: PendingReduces:377 ScheduledMaps:6 ScheduledReduces:23 AssignedMaps:0 AssignedReduces:0 completedMaps:4 completedReduces:0 containersAllocated:4 containersReleased:0 hostLocalAssigned:0 rackLocalAssigned:4 availableResources(headroom):memory: 0 2012-02-23 11:30:16,965 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Before Assign: PendingReduces:377 ScheduledMaps:6 ScheduledReduces:23 AssignedMaps:0 AssignedReduces:0 completedMaps:4 completedReduces:0 containersAllocated:4 containersReleased:0 hostLocalAssigned:0 rackLocalAssigned:4 availableResources(headroom):memory: 0 2012-02-23 11:30:16,965 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Got allocated containers 3 2012-02-23 11:30:16,965 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Assigned to reduce 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Assigned container container_1329995034628_0983_01_06 to attempt_1329995034628_0983_r_00_0 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Assigned to reduce 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Assigned container container_1329995034628_0983_01_07 to attempt_1329995034628_0983_r_01_0 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Assigned to reduce 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Assigned container container_1329995034628_0983_01_08 to attempt_1329995034628_0983_r_02_0 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: After Assign: PendingReduces:377 ScheduledMaps:6 ScheduledReduces:20 AssignedMaps:0 AssignedReduces:3 completedMaps:4 completedReduces:0 containersAllocated:7 containersReleased:0 hostLocalAssigned:0 rackLocalAssigned:4 availableResources(headroom):memory: 0 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Ramping down all scheduled reduces:20 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Going to preempt 2 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Preempting attempt_1329995034628_0983_r_02_0 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Preempting attempt_1329995034628_0983_r_01_0 2012-02-23 11:30:16,966 INFO [RMCommunicator Allocator] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator:
[jira] [Commented] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249888#comment-13249888 ] Devaraj K commented on MAPREDUCE-4117: -- Thanks Bobby for taking a look into the patch. When the Job is failed and it doesn't contain any task attempts(i.e Job is failed before launching any tasks itself) then the lastEvent will be null and it will lead to NPE. If the Job fails and job contains some attempts means the job should have failed because of task failures. This case lastEvent will not be null. mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249891#comment-13249891 ] Devaraj K commented on MAPREDUCE-4117: -- {quote} -1 core tests. The patch failed these unit tests: org.apache.hadoop.yarn.server.resourcemanager.TestClientRMService org.apache.hadoop.yarn.server.resourcemanager.resourcetracker.TestNMExpiry org.apache.hadoop.yarn.server.resourcemanager.TestAMAuthorization org.apache.hadoop.yarn.server.resourcemanager.TestApplicationACLs {quote} These test failures are not related to the patch. mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4059) The history server should have a separate pluggable storage/query interface
[ https://issues.apache.org/jira/browse/MAPREDUCE-4059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249895#comment-13249895 ] Hadoop QA commented on MAPREDUCE-4059: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12521964/MR-4059.txt against trunk revision . +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 7 new or modified test files. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 eclipse:eclipse. The patch built with eclipse:eclipse. +1 findbugs. The patch does not introduce any new Findbugs (version 1.3.9) warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. -1 core tests. The patch failed these unit tests: org.apache.hadoop.yarn.server.resourcemanager.TestClientRMService org.apache.hadoop.yarn.server.resourcemanager.resourcetracker.TestNMExpiry org.apache.hadoop.yarn.server.resourcemanager.TestAMAuthorization org.apache.hadoop.yarn.server.resourcemanager.TestApplicationACLs +1 contrib tests. The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2174//testReport/ Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2174//console This message is automatically generated. The history server should have a separate pluggable storage/query interface --- Key: MAPREDUCE-4059 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4059 Project: Hadoop Map/Reduce Issue Type: Improvement Components: mrv2 Affects Versions: 0.24.0, 0.23.3 Reporter: Robert Joseph Evans Assignee: Robert Joseph Evans Attachments: MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt The history server currently caches all parsed jobs in RAM. These jobs can be very large because of counters. It would be nice to have a pluggable interface for the cacheing and querying of the cached data so that we can play around with different implementations. Also just for cleanness of the code it would be nice to split the very large JobHistoryServer.java into a few smaller ones that are more understandable and readable. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4059) The history server should have a separate pluggable storage/query interface
[ https://issues.apache.org/jira/browse/MAPREDUCE-4059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249905#comment-13249905 ] Robert Joseph Evans commented on MAPREDUCE-4059: {quote}org.apache.hadoop.yarn.server.resourcemanager.TestClientRMService org.apache.hadoop.yarn.server.resourcemanager.resourcetracker.TestNMExpiry org.apache.hadoop.yarn.server.resourcemanager.TestAMAuthorization org.apache.hadoop.yarn.server.resourcemanager.TestApplicationACLs{quote} These tests have been failing randomly and are not related to the patch. The history server should have a separate pluggable storage/query interface --- Key: MAPREDUCE-4059 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4059 Project: Hadoop Map/Reduce Issue Type: Improvement Components: mrv2 Affects Versions: 0.24.0, 0.23.3 Reporter: Robert Joseph Evans Assignee: Robert Joseph Evans Attachments: MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt The history server currently caches all parsed jobs in RAM. These jobs can be very large because of counters. It would be nice to have a pluggable interface for the cacheing and querying of the cached data so that we can play around with different implementations. Also just for cleanness of the code it would be nice to split the very large JobHistoryServer.java into a few smaller ones that are more understandable and readable. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4105) Yarn RackResolver ignores rack configurations
[ https://issues.apache.org/jira/browse/MAPREDUCE-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249909#comment-13249909 ] Tom White commented on MAPREDUCE-4105: -- +1 thanks for addressing my comments, Ahmed. Yarn RackResolver ignores rack configurations - Key: MAPREDUCE-4105 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4105 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Fix For: 0.23.2 Attachments: MAPREDUCE-4105.patch, MAPREDUCE-4105_rev2.patch, MAPREDUCE-4105_rev3.patch Incorrect mappings because the Yarn RackResolver ignores rack configurations. This can be verified by inspecting the resource manager web ui that lists all the nodes, all of them show up with /default-rack regardless of the output from the script specified using net.topology.script.file.name configuration property. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249989#comment-13249989 ] Jonathan Eagles commented on MAPREDUCE-4117: My initial thought was the same as yours. However, in TypeConverter, you will see that both JobState.FAILED and JobState.ERROR are translated to JobStatus.State.FAILED. I believe it to be to much to assume that if JobStatus.State is FAILED, then you will be guaranteed to have either 1)a failed task completion event or 2) no task completion events at all. It is much more clear and simple to me to check for null in this case. mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (MAPREDUCE-4128) AM Recovery expects all attempts of a completed task to also be completed.
AM Recovery expects all attempts of a completed task to also be completed. -- Key: MAPREDUCE-4128 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4128 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 3.0.0 Reporter: Bikas Saha Assignee: Bikas Saha Fix For: 3.0.0 The AM seems to assume that all attempts of a completed task (from a previous AM incarnation) would also be completed. There is at least one case in which this does not hold. Case being cancellation of a completed task resulting in a new running attempt. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj K updated MAPREDUCE-4117: - Attachment: MAPREDUCE-4117-1.patch mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117-1.patch, MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj K updated MAPREDUCE-4117: - Status: Patch Available (was: Open) I am sorry Jon. It's my bad, I have mentioned the name wrongly in the above comment. Thanks for looking and identifying all the cases. I have attached the patch as per your suggestion. mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117-1.patch, MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj K updated MAPREDUCE-4117: - Status: Open (was: Patch Available) mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117-1.patch, MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4128) AM Recovery expects all attempts of a completed task to also be completed.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250010#comment-13250010 ] Bikas Saha commented on MAPREDUCE-4128: --- In recovery mode, the recovery service assigns a success status to any task that has a FINISHED event reported. The only way that status could be changed is if there is a FAILED event for that task, in which case a failed status is assigned to that task. So once a task is marked with a success status, it remains so even when subsequent events kill the successful task attempt and marks it invalid. Next the recovery service adds all success status tasks into a completedTasks collection. Then it proceeds to enumerate the events and process them. When it hits a TaskEventType.*_KILLED/FAILED/SUCCEEDED then it removes those attempts from the completedTasks. Recovery does not complete until all attempts of all completedTasks are removed. Now the following sequence of events can happen for Tasks A and B. A1 represents task attempt 1 of A. CompletedTasks contains A and B. A1 and A2 are succeeded. A2 was a rerun of A1. B1 is succeeded and B2 was running when AM crashed. A1- container request is processed. It uses the nodeid info from A1 to work. B1- container request is processed. It uses the nodeid info from B1 to work. A1- Succeeded. removes A1 B1- Succeeded. removes B1 A2- container request is processed. It uses the nodeid info from A2 to work B2- container request is processed. It uses the nodeid info from B2 to work. But there is no such info as it is populated on task attempt completion. AM crashed here while trying to resolve the nodeid. If AM had not crashed the following would have happened A2- Succeeded. removes A2 There is no FAILED/KILLED/SUCCEEDED event for B2 since it was running when the AM crashed. So it seems the AM would never move out of recovery. If the above is correct, there seems to be 2 problems 1) While recovery is in process, event handling for task attempts that are not in a completed state. I am not sure if the recovery design allows this and the current crash is simply a case of missing info. 2) Expecting every task attempt of a completedTask to have a KILLED/FAILED/SUCCEEDED entry. This seems to be clearly wrong in the current scenario. AM Recovery expects all attempts of a completed task to also be completed. -- Key: MAPREDUCE-4128 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4128 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 3.0.0 Reporter: Bikas Saha Assignee: Bikas Saha Fix For: 3.0.0 The AM seems to assume that all attempts of a completed task (from a previous AM incarnation) would also be completed. There is at least one case in which this does not hold. Case being cancellation of a completed task resulting in a new running attempt. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-3493) Add the default mapreduce.shuffle.port property to mapred-default.xml
[ https://issues.apache.org/jira/browse/MAPREDUCE-3493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] madhukara phatak updated MAPREDUCE-3493: Attachment: MAPREDUCE-3493-1.patch Fixed core tests errors. Add the default mapreduce.shuffle.port property to mapred-default.xml - Key: MAPREDUCE-3493 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3493 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.0 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Priority: Minor Labels: newbie Attachments: MAPREDUCE-3493-1.patch, MAPREDUCE-3493.patch I faced this issue when trying to run multiple Hadoop MR2 instances on the same node. The default value for this property is hardcoded in the ShuffleHandler.java class so it results in port conflicts. The issue is resolved if you set the property value in your conf files. But the absence of this property from *-default.xml files is confusing. So It'll be cleaner to move this property to mapred-default.xml, so its default value can be easily identified and changed if needed. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4099) ApplicationMaster may fail to remove staging directory
[ https://issues.apache.org/jira/browse/MAPREDUCE-4099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated MAPREDUCE-4099: -- Status: Open (was: Patch Available) ApplicationMaster may fail to remove staging directory -- Key: MAPREDUCE-4099 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4099 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Jason Lowe Assignee: Jason Lowe Priority: Critical Attachments: MAPREDUCE-4099.patch When the ApplicationMaster shuts down it's supposed to remove the staging directory, assuming properties weren't set to override this behavior. During shutdown the AM tells the ResourceManager that it has finished before it cleans up the staging directory. However upon hearing the AM has finished, the RM turns right around and kills the AM container. If the AM is too slow, the AM will be killed before the staging directory is removed. We're seeing the AM lose this race fairly consistently on our clusters, and the lack of staging directory cleanup quickly leads to filesystem quota issues for some users. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4099) ApplicationMaster may fail to remove staging directory
[ https://issues.apache.org/jira/browse/MAPREDUCE-4099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated MAPREDUCE-4099: -- Attachment: MAPREDUCE-4099.patch Apparently I need to submit a new patch to convince Jenkins to run again. ApplicationMaster may fail to remove staging directory -- Key: MAPREDUCE-4099 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4099 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Jason Lowe Assignee: Jason Lowe Priority: Critical Attachments: MAPREDUCE-4099.patch, MAPREDUCE-4099.patch When the ApplicationMaster shuts down it's supposed to remove the staging directory, assuming properties weren't set to override this behavior. During shutdown the AM tells the ResourceManager that it has finished before it cleans up the staging directory. However upon hearing the AM has finished, the RM turns right around and kills the AM container. If the AM is too slow, the AM will be killed before the staging directory is removed. We're seeing the AM lose this race fairly consistently on our clusters, and the lack of staging directory cleanup quickly leads to filesystem quota issues for some users. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4099) ApplicationMaster may fail to remove staging directory
[ https://issues.apache.org/jira/browse/MAPREDUCE-4099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Lowe updated MAPREDUCE-4099: -- Status: Patch Available (was: Open) ApplicationMaster may fail to remove staging directory -- Key: MAPREDUCE-4099 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4099 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Jason Lowe Assignee: Jason Lowe Priority: Critical Attachments: MAPREDUCE-4099.patch, MAPREDUCE-4099.patch When the ApplicationMaster shuts down it's supposed to remove the staging directory, assuming properties weren't set to override this behavior. During shutdown the AM tells the ResourceManager that it has finished before it cleans up the staging directory. However upon hearing the AM has finished, the RM turns right around and kills the AM container. If the AM is too slow, the AM will be killed before the staging directory is removed. We're seeing the AM lose this race fairly consistently on our clusters, and the lack of staging directory cleanup quickly leads to filesystem quota issues for some users. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250054#comment-13250054 ] Hadoop QA commented on MAPREDUCE-4117: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12521978/MAPREDUCE-4117-1.patch against trunk revision . +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 1 new or modified test files. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 eclipse:eclipse. The patch built with eclipse:eclipse. +1 findbugs. The patch does not introduce any new Findbugs (version 1.3.9) warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. -1 core tests. The patch failed these unit tests: org.apache.hadoop.yarn.server.resourcemanager.TestClientRMService org.apache.hadoop.yarn.server.resourcemanager.resourcetracker.TestNMExpiry org.apache.hadoop.yarn.server.resourcemanager.TestAMAuthorization org.apache.hadoop.yarn.server.resourcemanager.TestApplicationACLs +1 contrib tests. The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2175//testReport/ Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2175//console This message is automatically generated. mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117-1.patch, MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250061#comment-13250061 ] Jonathan Eagles commented on MAPREDUCE-4117: +1 lgtm np. this will be a great addition to the cli. i have verified that the test fail without the patch and succeed with the patch. mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117-1.patch, MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250073#comment-13250073 ] Jonathan Eagles commented on MAPREDUCE-4117: I should have stated +1 (non-binding). A committer will need to put this in mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Attachments: MAPREDUCE-4117-1.patch, MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4059) The history server should have a separate pluggable storage/query interface
[ https://issues.apache.org/jira/browse/MAPREDUCE-4059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250075#comment-13250075 ] Thomas Graves commented on MAPREDUCE-4059: -- For the most part looks good. can you make the formatting in CachedHistoryStorage.java consistent - some places are missing a space between if and (. I think it would be more extensible to have the getPartialJobs interface take a class rather then all the individual query parameters. If you want to push this off to another Jira I'm fine with that since its marked as InterfaceStability.Unstable. The history server should have a separate pluggable storage/query interface --- Key: MAPREDUCE-4059 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4059 Project: Hadoop Map/Reduce Issue Type: Improvement Components: mrv2 Affects Versions: 0.24.0, 0.23.3 Reporter: Robert Joseph Evans Assignee: Robert Joseph Evans Attachments: MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt, MR-4059.txt The history server currently caches all parsed jobs in RAM. These jobs can be very large because of counters. It would be nice to have a pluggable interface for the cacheing and querying of the cached data so that we can play around with different implementations. Also just for cleanness of the code it would be nice to split the very large JobHistoryServer.java into a few smaller ones that are more understandable and readable. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4099) ApplicationMaster may fail to remove staging directory
[ https://issues.apache.org/jira/browse/MAPREDUCE-4099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250093#comment-13250093 ] Hadoop QA commented on MAPREDUCE-4099: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12521985/MAPREDUCE-4099.patch against trunk revision . +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 7 new or modified test files. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 eclipse:eclipse. The patch built with eclipse:eclipse. +1 findbugs. The patch does not introduce any new Findbugs (version 1.3.9) warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. -1 core tests. The patch failed these unit tests: org.apache.hadoop.yarn.server.resourcemanager.TestClientRMService org.apache.hadoop.yarn.server.resourcemanager.resourcetracker.TestNMExpiry org.apache.hadoop.yarn.server.resourcemanager.TestAMAuthorization org.apache.hadoop.yarn.server.resourcemanager.TestApplicationACLs +1 contrib tests. The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2177//testReport/ Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2177//console This message is automatically generated. ApplicationMaster may fail to remove staging directory -- Key: MAPREDUCE-4099 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4099 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Jason Lowe Assignee: Jason Lowe Priority: Critical Attachments: MAPREDUCE-4099.patch, MAPREDUCE-4099.patch When the ApplicationMaster shuts down it's supposed to remove the staging directory, assuming properties weren't set to override this behavior. During shutdown the AM tells the ResourceManager that it has finished before it cleans up the staging directory. However upon hearing the AM has finished, the RM turns right around and kills the AM container. If the AM is too slow, the AM will be killed before the staging directory is removed. We're seeing the AM lose this race fairly consistently on our clusters, and the lack of staging directory cleanup quickly leads to filesystem quota issues for some users. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3493) Add the default mapreduce.shuffle.port property to mapred-default.xml
[ https://issues.apache.org/jira/browse/MAPREDUCE-3493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250098#comment-13250098 ] Hadoop QA commented on MAPREDUCE-3493: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12521983/MAPREDUCE-3493-1.patch against trunk revision . +1 @author. The patch does not contain any @author tags. -1 tests included. The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 eclipse:eclipse. The patch built with eclipse:eclipse. +1 findbugs. The patch does not introduce any new Findbugs (version 1.3.9) warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. +1 core tests. The patch passed unit tests in . +1 contrib tests. The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2176//testReport/ Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2176//console This message is automatically generated. Add the default mapreduce.shuffle.port property to mapred-default.xml - Key: MAPREDUCE-3493 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3493 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.0 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Priority: Minor Labels: newbie Attachments: MAPREDUCE-3493-1.patch, MAPREDUCE-3493.patch I faced this issue when trying to run multiple Hadoop MR2 instances on the same node. The default value for this property is hardcoded in the ShuffleHandler.java class so it results in port conflicts. The issue is resolved if you set the property value in your conf files. But the absence of this property from *-default.xml files is confusing. So It'll be cleaner to move this property to mapred-default.xml, so its default value can be easily identified and changed if needed. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4017) Add jobname to jobsummary log
[ https://issues.apache.org/jira/browse/MAPREDUCE-4017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated MAPREDUCE-4017: - Target Version/s: 1.1.0, 0.23.3 (was: 0.23.3, 1.1.0) Status: Patch Available (was: Open) Add jobname to jobsummary log - Key: MAPREDUCE-4017 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4017 Project: Hadoop Map/Reduce Issue Type: Improvement Components: jobhistoryserver, jobtracker Affects Versions: 0.23.0, 0.20.205.0 Reporter: Koji Noguchi Assignee: Thomas Graves Priority: Trivial Attachments: MAPREDUCE-4017-v0.20-1.patch, MAPREDUCE-4017.patch We occasionally use jobsummary from the JobTracker to collect users' slot usage on our clusters. It would be useful if the jobname was part of this jobsummary so that I don't need to join with other logs. Same jobsummary. 2012-03-15 16:05:55,919 INFO mapred.JobInProgress$JobSummary: jobId=job_201202160624_1089972,submitTime=1331827523632,launchTime=1331827528197,firstMapTaskLaunchTime=1331827536917,firstReduceTaskLaunchTime=1331827541251,firstJobSetupTaskLaunchTime=1331827528200,firstJobCleanupTaskLaunchTime=1331827551655,finishTime=1331827555919,numMaps=1,numSlotsPerMap=1,numReduces=1,numSlotsPerReduce=1,user=tortuga,queue=queue1,status=SUCCEEDED,mapSlotSeconds=13,reduceSlotsSeconds=10,clusterMapCapacity=,clusterReduceCapacity= I'd like to see jobName added to the end. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4017) Add jobname to jobsummary log
[ https://issues.apache.org/jira/browse/MAPREDUCE-4017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated MAPREDUCE-4017: - Attachment: MAPREDUCE-4017.patch Thanks for the patch Koji. Here is the corresponding trunk/branch-2 patch. I've tested on both branch1 and trunk on single node clusters and verified the output as well. Add jobname to jobsummary log - Key: MAPREDUCE-4017 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4017 Project: Hadoop Map/Reduce Issue Type: Improvement Components: jobhistoryserver, jobtracker Affects Versions: 0.20.205.0, 0.23.0 Reporter: Koji Noguchi Assignee: Thomas Graves Priority: Trivial Attachments: MAPREDUCE-4017-v0.20-1.patch, MAPREDUCE-4017.patch We occasionally use jobsummary from the JobTracker to collect users' slot usage on our clusters. It would be useful if the jobname was part of this jobsummary so that I don't need to join with other logs. Same jobsummary. 2012-03-15 16:05:55,919 INFO mapred.JobInProgress$JobSummary: jobId=job_201202160624_1089972,submitTime=1331827523632,launchTime=1331827528197,firstMapTaskLaunchTime=1331827536917,firstReduceTaskLaunchTime=1331827541251,firstJobSetupTaskLaunchTime=1331827528200,firstJobCleanupTaskLaunchTime=1331827551655,finishTime=1331827555919,numMaps=1,numSlotsPerMap=1,numReduces=1,numSlotsPerReduce=1,user=tortuga,queue=queue1,status=SUCCEEDED,mapSlotSeconds=13,reduceSlotsSeconds=10,clusterMapCapacity=,clusterReduceCapacity= I'd like to see jobName added to the end. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4017) Add jobname to jobsummary log
[ https://issues.apache.org/jira/browse/MAPREDUCE-4017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250118#comment-13250118 ] Hadoop QA commented on MAPREDUCE-4017: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12522004/MAPREDUCE-4017.patch against trunk revision . +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 1 new or modified test files. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 eclipse:eclipse. The patch built with eclipse:eclipse. +1 findbugs. The patch does not introduce any new Findbugs (version 1.3.9) warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. -1 core tests. The patch failed these unit tests: org.apache.hadoop.yarn.server.resourcemanager.TestClientRMService org.apache.hadoop.yarn.server.resourcemanager.resourcetracker.TestNMExpiry org.apache.hadoop.yarn.server.resourcemanager.TestAMAuthorization org.apache.hadoop.yarn.server.resourcemanager.TestApplicationACLs +1 contrib tests. The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2178//testReport/ Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2178//console This message is automatically generated. Add jobname to jobsummary log - Key: MAPREDUCE-4017 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4017 Project: Hadoop Map/Reduce Issue Type: Improvement Components: jobhistoryserver, jobtracker Affects Versions: 0.20.205.0, 0.23.0 Reporter: Koji Noguchi Assignee: Thomas Graves Priority: Trivial Attachments: MAPREDUCE-4017-v0.20-1.patch, MAPREDUCE-4017.patch We occasionally use jobsummary from the JobTracker to collect users' slot usage on our clusters. It would be useful if the jobname was part of this jobsummary so that I don't need to join with other logs. Same jobsummary. 2012-03-15 16:05:55,919 INFO mapred.JobInProgress$JobSummary: jobId=job_201202160624_1089972,submitTime=1331827523632,launchTime=1331827528197,firstMapTaskLaunchTime=1331827536917,firstReduceTaskLaunchTime=1331827541251,firstJobSetupTaskLaunchTime=1331827528200,firstJobCleanupTaskLaunchTime=1331827551655,finishTime=1331827555919,numMaps=1,numSlotsPerMap=1,numReduces=1,numSlotsPerReduce=1,user=tortuga,queue=queue1,status=SUCCEEDED,mapSlotSeconds=13,reduceSlotsSeconds=10,clusterMapCapacity=,clusterReduceCapacity= I'd like to see jobName added to the end. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3493) Add the default mapreduce.shuffle.port property to mapred-default.xml
[ https://issues.apache.org/jira/browse/MAPREDUCE-3493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250119#comment-13250119 ] Harsh J commented on MAPREDUCE-3493: Looks good. Can you also elaborate a bit more on what the ShuffleHandler is, in the description field? May come handy to some. Add the default mapreduce.shuffle.port property to mapred-default.xml - Key: MAPREDUCE-3493 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3493 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.0 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Priority: Minor Labels: newbie Attachments: MAPREDUCE-3493-1.patch, MAPREDUCE-3493.patch I faced this issue when trying to run multiple Hadoop MR2 instances on the same node. The default value for this property is hardcoded in the ShuffleHandler.java class so it results in port conflicts. The issue is resolved if you set the property value in your conf files. But the absence of this property from *-default.xml files is confusing. So It'll be cleaner to move this property to mapred-default.xml, so its default value can be easily identified and changed if needed. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (MAPREDUCE-1233) Incorrect Waiting maps/reduces in Jobtracker metrics
[ https://issues.apache.org/jira/browse/MAPREDUCE-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved MAPREDUCE-1233. -- Resolution: Fixed Fix Version/s: 0.20.203.0 Incorrect Waiting maps/reduces in Jobtracker metrics - Key: MAPREDUCE-1233 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1233 Project: Hadoop Map/Reduce Issue Type: Bug Components: jobtracker Reporter: V.Karthikeyan Assignee: Luke Lu Fix For: 0.20.203.0 Attachments: mr-1233-y20s-v1.patch, mr-1233-y20s-v2.patch Waiting Maps/Reduces are incorrect in Jobtracker metrics when a job fails. when a map/reduce fails(during job failure), waiting maps/reduce got incremented and doesn't get decremented even after job cleanup. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4017) Add jobname to jobsummary log
[ https://issues.apache.org/jira/browse/MAPREDUCE-4017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250128#comment-13250128 ] Thomas Graves commented on MAPREDUCE-4017: -- test failures are known and not caused by this change. Add jobname to jobsummary log - Key: MAPREDUCE-4017 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4017 Project: Hadoop Map/Reduce Issue Type: Improvement Components: jobhistoryserver, jobtracker Affects Versions: 0.20.205.0, 0.23.0 Reporter: Koji Noguchi Assignee: Thomas Graves Priority: Trivial Attachments: MAPREDUCE-4017-v0.20-1.patch, MAPREDUCE-4017.patch We occasionally use jobsummary from the JobTracker to collect users' slot usage on our clusters. It would be useful if the jobname was part of this jobsummary so that I don't need to join with other logs. Same jobsummary. 2012-03-15 16:05:55,919 INFO mapred.JobInProgress$JobSummary: jobId=job_201202160624_1089972,submitTime=1331827523632,launchTime=1331827528197,firstMapTaskLaunchTime=1331827536917,firstReduceTaskLaunchTime=1331827541251,firstJobSetupTaskLaunchTime=1331827528200,firstJobCleanupTaskLaunchTime=1331827551655,finishTime=1331827555919,numMaps=1,numSlotsPerMap=1,numReduces=1,numSlotsPerReduce=1,user=tortuga,queue=queue1,status=SUCCEEDED,mapSlotSeconds=13,reduceSlotsSeconds=10,clusterMapCapacity=,clusterReduceCapacity= I'd like to see jobName added to the end. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-1233) Incorrect Waiting maps/reduces in Jobtracker metrics
[ https://issues.apache.org/jira/browse/MAPREDUCE-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250127#comment-13250127 ] Thomas Graves commented on MAPREDUCE-1233: -- Yes, Koji is right, this patch was integrated into branch-1/20s in revision 1077665 (branches/branch-1.0/src/mapred/org/apache/hadoop/mapred/JobInProgress.java) - which according to changes.txt would have went into 0.20.202.0. Moving this to resolved and can fix issue under MAPREDUCE-1238. Incorrect Waiting maps/reduces in Jobtracker metrics - Key: MAPREDUCE-1233 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1233 Project: Hadoop Map/Reduce Issue Type: Bug Components: jobtracker Reporter: V.Karthikeyan Assignee: Luke Lu Fix For: 0.20.203.0 Attachments: mr-1233-y20s-v1.patch, mr-1233-y20s-v2.patch Waiting Maps/Reduces are incorrect in Jobtracker metrics when a job fails. when a map/reduce fails(during job failure), waiting maps/reduce got incremented and doesn't get decremented even after job cleanup. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (MAPREDUCE-1238) mapred metrics shows negative count of waiting maps and reduces
[ https://issues.apache.org/jira/browse/MAPREDUCE-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves reassigned MAPREDUCE-1238: Assignee: Thomas Graves mapred metrics shows negative count of waiting maps and reduces Key: MAPREDUCE-1238 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1238 Project: Hadoop Map/Reduce Issue Type: Bug Components: jobtracker Reporter: Ramya Sunil Assignee: Thomas Graves Attachments: MAPREDUCE-1238-v0.20-1.patch Negative waiting_maps and waiting_reduces count is observed in the mapred metrics -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4099) ApplicationMaster may fail to remove staging directory
[ https://issues.apache.org/jira/browse/MAPREDUCE-4099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250151#comment-13250151 ] Jason Lowe commented on MAPREDUCE-4099: --- All of the reported test failures appear to be unrelated to the patch. They all fail because a ResourceManager process can't start due to a socket bind problem -- a runaway RM process on the build machine, perhaps? I ran the RM unit tests locally with this patch and they all pass. I also manually tested the patch with a single-node cluster running sleep and wordcount jobs. Also connected the debugger to the ApplicationMaster, causing it to linger artificially in the FINISHING state to verify killing or expiring an application in the FINISHING state behaves properly. ApplicationMaster may fail to remove staging directory -- Key: MAPREDUCE-4099 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4099 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Jason Lowe Assignee: Jason Lowe Priority: Critical Attachments: MAPREDUCE-4099.patch, MAPREDUCE-4099.patch When the ApplicationMaster shuts down it's supposed to remove the staging directory, assuming properties weren't set to override this behavior. During shutdown the AM tells the ResourceManager that it has finished before it cleans up the staging directory. However upon hearing the AM has finished, the RM turns right around and kills the AM container. If the AM is too slow, the AM will be killed before the staging directory is removed. We're seeing the AM lose this race fairly consistently on our clusters, and the lack of staging directory cleanup quickly leads to filesystem quota issues for some users. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4128) AM Recovery expects all attempts of a completed task to also be completed.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250185#comment-13250185 ] Bikas Saha commented on MAPREDUCE-4128: --- I have managed to repro this using a modified TestFetchFailure. So this is unrelated to the specific changes in MAPREDUCE-3921. AM Recovery expects all attempts of a completed task to also be completed. -- Key: MAPREDUCE-4128 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4128 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 3.0.0 Reporter: Bikas Saha Assignee: Bikas Saha Fix For: 3.0.0 The AM seems to assume that all attempts of a completed task (from a previous AM incarnation) would also be completed. There is at least one case in which this does not hold. Case being cancellation of a completed task resulting in a new running attempt. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-1238) mapred metrics shows negative count of waiting maps and reduces
[ https://issues.apache.org/jira/browse/MAPREDUCE-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250207#comment-13250207 ] Thomas Graves commented on MAPREDUCE-1238: -- I believe this will still miss a case when it is killed while in progress of initializing the job. If it receives a kill during that initialization it waits until init is done and then throws to do the kill. The tasksInited will not have been set to true because the throw happens right before that is set and thus the metrics won't be properly decremented. Working on a fix for that. mapred metrics shows negative count of waiting maps and reduces Key: MAPREDUCE-1238 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1238 Project: Hadoop Map/Reduce Issue Type: Bug Components: jobtracker Reporter: Ramya Sunil Assignee: Thomas Graves Attachments: MAPREDUCE-1238-v0.20-1.patch Negative waiting_maps and waiting_reduces count is observed in the mapred metrics -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Joseph Evans updated MAPREDUCE-4117: --- Resolution: Fixed Fix Version/s: 2.0.0 0.23.3 Status: Resolved (was: Patch Available) I agree with Jon, +1. I merged this into trunk, branch-2, and branch-0.23 mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Fix For: 0.23.3, 2.0.0 Attachments: MAPREDUCE-4117-1.patch, MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250228#comment-13250228 ] Hudson commented on MAPREDUCE-4117: --- Integrated in Hadoop-Common-trunk-Commit #2031 (See [https://builds.apache.org/job/Hadoop-Common-trunk-Commit/2031/]) MAPREDUCE-4117. mapred job -status throws NullPointerException (Devaraj K via bobby) (Revision 1311479) Result = SUCCESS bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1311479 Files : * /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/Job.java * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/test/java/org/apache/hadoop/mapreduce/TestJob.java mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Fix For: 0.23.3, 2.0.0 Attachments: MAPREDUCE-4117-1.patch, MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4117) mapred job -status throws NullPointerException
[ https://issues.apache.org/jira/browse/MAPREDUCE-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250242#comment-13250242 ] Hudson commented on MAPREDUCE-4117: --- Integrated in Hadoop-Mapreduce-trunk-Commit #2042 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/2042/]) MAPREDUCE-4117. mapred job -status throws NullPointerException (Devaraj K via bobby) (Revision 1311479) Result = SUCCESS bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1311479 Files : * /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/Job.java * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/test/java/org/apache/hadoop/mapreduce/TestJob.java mapred job -status throws NullPointerException -- Key: MAPREDUCE-4117 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4117 Project: Hadoop Map/Reduce Issue Type: Bug Components: client, mrv2 Affects Versions: 2.0.0, 3.0.0 Reporter: Devaraj K Assignee: Devaraj K Priority: Critical Fix For: 0.23.3, 2.0.0 Attachments: MAPREDUCE-4117-1.patch, MAPREDUCE-4117.patch {code:xml} dev@ubuntudev-linux:~/hadoop/hadoop-trunk/bin$ ./mapred job -status job_1333408894669_0001 Exception in thread main java.lang.NullPointerException at org.apache.hadoop.mapreduce.Job.getTaskFailureEventString(Job.java:512) at org.apache.hadoop.mapreduce.Job.toString(Job.java:463) at java.lang.String.valueOf(String.java:2838) at java.io.PrintStream.println(PrintStream.java:788) at org.apache.hadoop.mapreduce.tools.CLI.run(CLI.java:255) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:69) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:83) at org.apache.hadoop.mapred.JobClient.main(JobClient.java:1244) {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4049) plugin for generic shuffle service
[ https://issues.apache.org/jira/browse/MAPREDUCE-4049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250275#comment-13250275 ] Mariappan Asokan commented on MAPREDUCE-4049: - Hi Avner, I worked on MAPREDUCE-2454(to make sort pluggable in Hadoop) and posted a patch on top of trunk version 1221902 a while back. The patch was created on top of the trunk since ReduceTask.java was already refactored nicely and I was advised to work on the trunk version. Please take a look at the patch file mapreduce-2454.patch posted in MAPREDUCE-2454. If you want, I can post a patch on top of the latest trunk. The patch decoupled the merge from shuffle by creating ShuffleRunner and ShuffleCallback interfaces. The MergeManager implements the ShuffleCallback and the shuffle itself implements ShuffleRunner interface. Since you are making shuffle as pluggable, I notice some overlapping changes. If I can be of any assistance to reduce the conflict between our patches, please let me know. Meanwhile, I will go over the details of your patch and get back. Do you have a patch created on top of trunk? Also, I would like to hear opinions from other developers who have shown interest in this Jira. plugin for generic shuffle service -- Key: MAPREDUCE-4049 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4049 Project: Hadoop Map/Reduce Issue Type: Improvement Components: performance, task, tasktracker Affects Versions: 0.23.1, 1.0.1 Reporter: Avner BenHanoch Labels: merge, plugin, rdma, shuffle Attachments: HADOOP-1.0.2.patch, HADOOP-1.0.x.patch, Hadoop Shuffle Consumer Plugin TLD.rtf, Hadoop Shuffle Provider Plugin TLD.rtf, MAPREDUCE-4049-branch-1.0.2.patch, mapred-site.xml, mapred.diff, src.tgz, test.diff Support generic shuffle service as set of two plugins: ShuffleProvider ShuffleConsumer. This will satisfy the following needs: # Better shuffle and merge performance. For example: we are working on shuffle plugin that performs shuffle over RDMA in fast networks (10gE, 40gE, or Infiniband) instead of using the current HTTP shuffle. Based on the fast RDMA shuffle, the plugin can also utilize a suitable merge approach during the intermediate merges. Hence, getting much better performance. # Satisfy MAPREDUCE-3060 - generic shuffle service for avoiding hidden dependency of NodeManager with a specific version of mapreduce shuffle (currently targeted to 0.24.0). References: # Hadoop Acceleration through Network Levitated Merging, by Prof. Weikuan Yu from Auburn University with others, [http://pasl.eng.auburn.edu/pubs/sc11-netlev.pdf] # I am attaching 2 documents with suggested Top Level Design for both plugins (currently, based on 1.0 branch) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4105) Yarn RackResolver ignores rack configurations
[ https://issues.apache.org/jira/browse/MAPREDUCE-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tom White updated MAPREDUCE-4105: - Resolution: Fixed Fix Version/s: (was: 0.23.2) 2.0.0 Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) I've just committed this. Thanks Ahmed! Yarn RackResolver ignores rack configurations - Key: MAPREDUCE-4105 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4105 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Fix For: 2.0.0 Attachments: MAPREDUCE-4105.patch, MAPREDUCE-4105_rev2.patch, MAPREDUCE-4105_rev3.patch Incorrect mappings because the Yarn RackResolver ignores rack configurations. This can be verified by inspecting the resource manager web ui that lists all the nodes, all of them show up with /default-rack regardless of the output from the script specified using net.topology.script.file.name configuration property. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Created] (MAPREDUCE-4129) Lots of unneeded counters log messages
Lots of unneeded counters log messages -- Key: MAPREDUCE-4129 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4129 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Fix For: 0.23.2 Huge number of the same WARN messages are written. We only need to write each distinct message once. The messages are of the form: {code} 2012-04-05 03:55:04,166 WARN mapreduce.Counters: Group {oldGroup} is deprecated. Use {newGroup} instead {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4129) Lots of unneeded counters log messages
[ https://issues.apache.org/jira/browse/MAPREDUCE-4129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ahmed Radwan updated MAPREDUCE-4129: Attachment: MAPREDUCE-4129.patch Attaching a patch addressing this issue. The patch modifies the behavior to only write the log message the first time the oldGroupName is found, instead of every time getGroup(..) is called. I don't think we need a test case for this change as it is a straightforward change in message logging. Lots of unneeded counters log messages -- Key: MAPREDUCE-4129 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4129 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Fix For: 0.23.2 Attachments: MAPREDUCE-4129.patch Huge number of the same WARN messages are written. We only need to write each distinct message once. The messages are of the form: {code} 2012-04-05 03:55:04,166 WARN mapreduce.Counters: Group {oldGroup} is deprecated. Use {newGroup} instead {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4129) Lots of unneeded counters log messages
[ https://issues.apache.org/jira/browse/MAPREDUCE-4129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ahmed Radwan updated MAPREDUCE-4129: Status: Patch Available (was: Open) Lots of unneeded counters log messages -- Key: MAPREDUCE-4129 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4129 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Fix For: 0.23.2 Attachments: MAPREDUCE-4129.patch Huge number of the same WARN messages are written. We only need to write each distinct message once. The messages are of the form: {code} 2012-04-05 03:55:04,166 WARN mapreduce.Counters: Group {oldGroup} is deprecated. Use {newGroup} instead {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-3409) Incorrect custom task status when running on MR2
[ https://issues.apache.org/jira/browse/MAPREDUCE-3409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tom White updated MAPREDUCE-3409: - Target Version/s: 2.0.0 Incorrect custom task status when running on MR2 Key: MAPREDUCE-3409 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3409 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.0 Reporter: Ahmed Radwan To reproduce this problem: 1- In your mapper setup() set: {code} context.setStatus(myStatus) {code} 2- When the job finishes: {code} TaskReport[] reports = job.getTaskReports(TaskType.MAP); assertEquals(myStatus, reports[0].getState()); {code} The returned status from reports[0].getState() is SUCCEEDED as opposed to the expected myStatus value. This exact code work fine on MR1. I saw this issue when tried running the TestTaskContext test cases on MR2. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4105) Yarn RackResolver ignores rack configurations
[ https://issues.apache.org/jira/browse/MAPREDUCE-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250320#comment-13250320 ] Hudson commented on MAPREDUCE-4105: --- Integrated in Hadoop-Mapreduce-trunk-Commit #2043 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/2043/]) MAPREDUCE-4105. Yarn RackResolver ignores rack configurations. Contributed by Ahmed Radwan. (Revision 1311520) Result = SUCCESS tomwhite : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1311520 Files : * /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/util/RackResolver.java * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/util/TestRackResolver.java * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/util/TestRackResolverScriptBasedMapping.java Yarn RackResolver ignores rack configurations - Key: MAPREDUCE-4105 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4105 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Fix For: 2.0.0 Attachments: MAPREDUCE-4105.patch, MAPREDUCE-4105_rev2.patch, MAPREDUCE-4105_rev3.patch Incorrect mappings because the Yarn RackResolver ignores rack configurations. This can be verified by inspecting the resource manager web ui that lists all the nodes, all of them show up with /default-rack regardless of the output from the script specified using net.topology.script.file.name configuration property. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3869) Distributed shell application fails with NoClassDefFoundError
[ https://issues.apache.org/jira/browse/MAPREDUCE-3869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250322#comment-13250322 ] Siddharth Seth commented on MAPREDUCE-3869: --- +1. Will commit this shortly. Distributed shell application fails with NoClassDefFoundError - Key: MAPREDUCE-3869 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3869 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Devaraj K Assignee: Devaraj K Priority: Blocker Fix For: 0.23.2 Attachments: MAPREDUCE-3869-1.patch, MAPREDUCE-3869-2.patch, MAPREDUCE-3869.patch Distributed shell application always fails to start the application master with the following error. \\ {code:xml} 12/02/16 05:35:25 FATAL distributedshell.ApplicationMaster: Error running ApplicationMaster java.lang.NoClassDefFoundError: org/apache/hadoop/yarn/ipc/YarnRPC at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.init(ApplicationMaster.java:252) at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.main(ApplicationMaster.java:195) Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.yarn.ipc.YarnRPC at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:188) at java.lang.ClassLoader.loadClass(ClassLoader.java:303) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:248) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:316) ... 2 more {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-3869) Distributed shell application fails with NoClassDefFoundError
[ https://issues.apache.org/jira/browse/MAPREDUCE-3869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Siddharth Seth updated MAPREDUCE-3869: -- Resolution: Fixed Fix Version/s: (was: 0.23.2) 2.0.0 Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Committed to trunk and branch-2. Thanks Devaraj. Distributed shell application fails with NoClassDefFoundError - Key: MAPREDUCE-3869 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3869 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Devaraj K Assignee: Devaraj K Priority: Blocker Fix For: 2.0.0 Attachments: MAPREDUCE-3869-1.patch, MAPREDUCE-3869-2.patch, MAPREDUCE-3869.patch Distributed shell application always fails to start the application master with the following error. \\ {code:xml} 12/02/16 05:35:25 FATAL distributedshell.ApplicationMaster: Error running ApplicationMaster java.lang.NoClassDefFoundError: org/apache/hadoop/yarn/ipc/YarnRPC at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.init(ApplicationMaster.java:252) at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.main(ApplicationMaster.java:195) Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.yarn.ipc.YarnRPC at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:188) at java.lang.ClassLoader.loadClass(ClassLoader.java:303) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:248) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:316) ... 2 more {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4105) Yarn RackResolver ignores rack configurations
[ https://issues.apache.org/jira/browse/MAPREDUCE-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250328#comment-13250328 ] Hudson commented on MAPREDUCE-4105: --- Integrated in Hadoop-Common-trunk-Commit #2032 (See [https://builds.apache.org/job/Hadoop-Common-trunk-Commit/2032/]) MAPREDUCE-4105. Yarn RackResolver ignores rack configurations. Contributed by Ahmed Radwan. (Revision 1311520) Result = SUCCESS tomwhite : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1311520 Files : * /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/util/RackResolver.java * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/util/TestRackResolver.java * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/util/TestRackResolverScriptBasedMapping.java Yarn RackResolver ignores rack configurations - Key: MAPREDUCE-4105 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4105 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Fix For: 2.0.0 Attachments: MAPREDUCE-4105.patch, MAPREDUCE-4105_rev2.patch, MAPREDUCE-4105_rev3.patch Incorrect mappings because the Yarn RackResolver ignores rack configurations. This can be verified by inspecting the resource manager web ui that lists all the nodes, all of them show up with /default-rack regardless of the output from the script specified using net.topology.script.file.name configuration property. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3869) Distributed shell application fails with NoClassDefFoundError
[ https://issues.apache.org/jira/browse/MAPREDUCE-3869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250329#comment-13250329 ] Hudson commented on MAPREDUCE-3869: --- Integrated in Hadoop-Common-trunk-Commit #2032 (See [https://builds.apache.org/job/Hadoop-Common-trunk-Commit/2032/]) MAPREDUCE-3869. Fix classpath for DistributedShell application. (Contributed by Devaraj K) (Revision 1311523) Result = SUCCESS sseth : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1311523 Files : * /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/Client.java Distributed shell application fails with NoClassDefFoundError - Key: MAPREDUCE-3869 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3869 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Devaraj K Assignee: Devaraj K Priority: Blocker Fix For: 2.0.0 Attachments: MAPREDUCE-3869-1.patch, MAPREDUCE-3869-2.patch, MAPREDUCE-3869.patch Distributed shell application always fails to start the application master with the following error. \\ {code:xml} 12/02/16 05:35:25 FATAL distributedshell.ApplicationMaster: Error running ApplicationMaster java.lang.NoClassDefFoundError: org/apache/hadoop/yarn/ipc/YarnRPC at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.init(ApplicationMaster.java:252) at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.main(ApplicationMaster.java:195) Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.yarn.ipc.YarnRPC at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:188) at java.lang.ClassLoader.loadClass(ClassLoader.java:303) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:248) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:316) ... 2 more {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3869) Distributed shell application fails with NoClassDefFoundError
[ https://issues.apache.org/jira/browse/MAPREDUCE-3869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250334#comment-13250334 ] Hudson commented on MAPREDUCE-3869: --- Integrated in Hadoop-Hdfs-trunk-Commit #2107 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/2107/]) MAPREDUCE-3869. Fix classpath for DistributedShell application. (Contributed by Devaraj K) (Revision 1311523) Result = SUCCESS sseth : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1311523 Files : * /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/Client.java Distributed shell application fails with NoClassDefFoundError - Key: MAPREDUCE-3869 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3869 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Devaraj K Assignee: Devaraj K Priority: Blocker Fix For: 2.0.0 Attachments: MAPREDUCE-3869-1.patch, MAPREDUCE-3869-2.patch, MAPREDUCE-3869.patch Distributed shell application always fails to start the application master with the following error. \\ {code:xml} 12/02/16 05:35:25 FATAL distributedshell.ApplicationMaster: Error running ApplicationMaster java.lang.NoClassDefFoundError: org/apache/hadoop/yarn/ipc/YarnRPC at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.init(ApplicationMaster.java:252) at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.main(ApplicationMaster.java:195) Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.yarn.ipc.YarnRPC at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:188) at java.lang.ClassLoader.loadClass(ClassLoader.java:303) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:248) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:316) ... 2 more {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4105) Yarn RackResolver ignores rack configurations
[ https://issues.apache.org/jira/browse/MAPREDUCE-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250333#comment-13250333 ] Hudson commented on MAPREDUCE-4105: --- Integrated in Hadoop-Hdfs-trunk-Commit #2107 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/2107/]) MAPREDUCE-4105. Yarn RackResolver ignores rack configurations. Contributed by Ahmed Radwan. (Revision 1311520) Result = SUCCESS tomwhite : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1311520 Files : * /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/util/RackResolver.java * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/util/TestRackResolver.java * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/util/TestRackResolverScriptBasedMapping.java Yarn RackResolver ignores rack configurations - Key: MAPREDUCE-4105 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4105 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Fix For: 2.0.0 Attachments: MAPREDUCE-4105.patch, MAPREDUCE-4105_rev2.patch, MAPREDUCE-4105_rev3.patch Incorrect mappings because the Yarn RackResolver ignores rack configurations. This can be verified by inspecting the resource manager web ui that lists all the nodes, all of them show up with /default-rack regardless of the output from the script specified using net.topology.script.file.name configuration property. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4129) Lots of unneeded counters log messages
[ https://issues.apache.org/jira/browse/MAPREDUCE-4129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250338#comment-13250338 ] Hadoop QA commented on MAPREDUCE-4129: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12522044/MAPREDUCE-4129.patch against trunk revision . +1 @author. The patch does not contain any @author tags. -1 tests included. The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 eclipse:eclipse. The patch built with eclipse:eclipse. +1 findbugs. The patch does not introduce any new Findbugs (version 1.3.9) warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. -1 core tests. The patch failed these unit tests: org.apache.hadoop.yarn.server.resourcemanager.TestClientRMService org.apache.hadoop.yarn.server.resourcemanager.resourcetracker.TestNMExpiry org.apache.hadoop.yarn.server.resourcemanager.TestAMAuthorization org.apache.hadoop.yarn.server.resourcemanager.TestApplicationACLs +1 contrib tests. The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2179//testReport/ Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2179//console This message is automatically generated. Lots of unneeded counters log messages -- Key: MAPREDUCE-4129 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4129 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Fix For: 0.23.2 Attachments: MAPREDUCE-4129.patch Huge number of the same WARN messages are written. We only need to write each distinct message once. The messages are of the form: {code} 2012-04-05 03:55:04,166 WARN mapreduce.Counters: Group {oldGroup} is deprecated. Use {newGroup} instead {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4128) AM Recovery expects all attempts of a completed task to also be completed.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bikas Saha updated MAPREDUCE-4128: -- Attachment: MAPREDUCE-4128.patch AM Recovery expects all attempts of a completed task to also be completed. -- Key: MAPREDUCE-4128 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4128 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 3.0.0 Reporter: Bikas Saha Assignee: Bikas Saha Fix For: 3.0.0 Attachments: MAPREDUCE-4128.patch The AM seems to assume that all attempts of a completed task (from a previous AM incarnation) would also be completed. There is at least one case in which this does not hold. Case being cancellation of a completed task resulting in a new running attempt. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4057) Compilation error in RAID
[ https://issues.apache.org/jira/browse/MAPREDUCE-4057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250352#comment-13250352 ] Tsz Wo (Nicholas), SZE commented on MAPREDUCE-4057: --- Raid still cannot be compiled after the patch: {noformat} compile-test: [echo] contrib: raid [javac] /Users/szetszwo/hadoop/t-commit/hadoop-mapreduce-project/src/contrib/build-contrib.xml:227: warning: 'includeantruntime' was not set, defaulting to build.sysclasspath=last; set to false for repeatable builds [javac] Compiling 20 source files to /Users/szetszwo/hadoop/t-commit/hadoop-mapreduce-project/build/contrib/raid/test [javac] /Users/szetszwo/hadoop/t-commit/hadoop-mapreduce-project/src/contrib/raid/src/test/org/apache/hadoop/raid/TestBlockFixer.java:42: cannot find symbol [javac] symbol : class MiniMRCluster [javac] location: package org.apache.hadoop.mapred [javac] import org.apache.hadoop.mapred.MiniMRCluster; [javac]^ ... [javac] Note: Some input files use or override a deprecated API. [javac] Note: Recompile with -Xlint:deprecation for details. [javac] 12 errors {noformat} Compilation error in RAID -- Key: MAPREDUCE-4057 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4057 Project: Hadoop Map/Reduce Issue Type: Bug Components: contrib/raid Reporter: Tsz Wo (Nicholas), SZE Attachments: MAPREDUCE-4057.patch {noformat} [javac] Compiling 33 source files to /Users/szetszwo/hadoop/t2/hadoop-mapreduce-project/build/contrib/raid/classes [javac] /Users/szetszwo/hadoop/t2/hadoop-mapreduce-project/src/contrib/raid/src/java/org/apache/hadoop/hdfs/server/namenode/NameNodeRaidUtil.java:42: unreported exception org.apache.hadoop.ipc.StandbyException; must be caught or declared to be thrown [javac] return namesystem.getFileInfo(src, resolveLink); [javac] ^ {noformat} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4128) AM Recovery expects all attempts of a completed task to also be completed.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250355#comment-13250355 ] Bikas Saha commented on MAPREDUCE-4128: --- The current recovery mechanism seems to be designed to recover completed tasks. Hence it assumes that all attempts of such tasks would also be complete. So it loads completed tasks and replays them till the replay completes all attempts of all completed tasks. This will break whenever there is an attempt running after a previously successful completion because the replay will not have info to correctly work on running attempts. Scenario 1 : MAPREDUCE-3921 introduces such an instance because it re-runs successful map tasks if the successful attempts had run on a bad machine. Scenario 2 : Even in the current code, when a successful map is rerun because of too many fetch failures, the above scenario is produced and caused a failure in recovery. The proposed solution in the patch is to make sure that if a task is re-run then it is not marked as completed during recovery. The JobHistoryParser has been changed to remove the SUCCEEDED status on a task if the successful attempt of that task later reports a failure. This fixes the repro case mentioned above. I have improved that testcase to cover Scenario 1. Scenario 2 will be covered in MAPREDUCE-3921. I am expecting the patch to introduce 3 additional warnings because of raw types in event handling (similar to existing warnings). I compiled a broken rumen class and test assuming the new field added to TaskFinishedEvent is not relevant to them. AM Recovery expects all attempts of a completed task to also be completed. -- Key: MAPREDUCE-4128 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4128 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 3.0.0 Reporter: Bikas Saha Assignee: Bikas Saha Fix For: 3.0.0 Attachments: MAPREDUCE-4128.patch The AM seems to assume that all attempts of a completed task (from a previous AM incarnation) would also be completed. There is at least one case in which this does not hold. Case being cancellation of a completed task resulting in a new running attempt. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3869) Distributed shell application fails with NoClassDefFoundError
[ https://issues.apache.org/jira/browse/MAPREDUCE-3869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250356#comment-13250356 ] Hudson commented on MAPREDUCE-3869: --- Integrated in Hadoop-Mapreduce-trunk-Commit #2044 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/2044/]) MAPREDUCE-3869. Fix classpath for DistributedShell application. (Contributed by Devaraj K) (Revision 1311523) Result = ABORTED sseth : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1311523 Files : * /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt * /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-applications/hadoop-yarn-applications-distributedshell/src/main/java/org/apache/hadoop/yarn/applications/distributedshell/Client.java Distributed shell application fails with NoClassDefFoundError - Key: MAPREDUCE-3869 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3869 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.1 Reporter: Devaraj K Assignee: Devaraj K Priority: Blocker Fix For: 2.0.0 Attachments: MAPREDUCE-3869-1.patch, MAPREDUCE-3869-2.patch, MAPREDUCE-3869.patch Distributed shell application always fails to start the application master with the following error. \\ {code:xml} 12/02/16 05:35:25 FATAL distributedshell.ApplicationMaster: Error running ApplicationMaster java.lang.NoClassDefFoundError: org/apache/hadoop/yarn/ipc/YarnRPC at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.init(ApplicationMaster.java:252) at org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.main(ApplicationMaster.java:195) Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.yarn.ipc.YarnRPC at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:188) at java.lang.ClassLoader.loadClass(ClassLoader.java:303) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:248) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:316) ... 2 more {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4057) Compilation error in RAID
[ https://issues.apache.org/jira/browse/MAPREDUCE-4057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tsz Wo (Nicholas), SZE updated MAPREDUCE-4057: -- Assignee: Devaraj K Devaraj, thanks for posting a patch. Could you also fix the errors in the tests? Compilation error in RAID -- Key: MAPREDUCE-4057 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4057 Project: Hadoop Map/Reduce Issue Type: Bug Components: contrib/raid Reporter: Tsz Wo (Nicholas), SZE Assignee: Devaraj K Attachments: MAPREDUCE-4057.patch {noformat} [javac] Compiling 33 source files to /Users/szetszwo/hadoop/t2/hadoop-mapreduce-project/build/contrib/raid/classes [javac] /Users/szetszwo/hadoop/t2/hadoop-mapreduce-project/src/contrib/raid/src/java/org/apache/hadoop/hdfs/server/namenode/NameNodeRaidUtil.java:42: unreported exception org.apache.hadoop.ipc.StandbyException; must be caught or declared to be thrown [javac] return namesystem.getFileInfo(src, resolveLink); [javac] ^ {noformat} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4128) AM Recovery expects all attempts of a completed task to also be completed.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bikas Saha updated MAPREDUCE-4128: -- Status: Patch Available (was: Open) AM Recovery expects all attempts of a completed task to also be completed. -- Key: MAPREDUCE-4128 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4128 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 3.0.0 Reporter: Bikas Saha Assignee: Bikas Saha Fix For: 3.0.0 Attachments: MAPREDUCE-4128.patch The AM seems to assume that all attempts of a completed task (from a previous AM incarnation) would also be completed. There is at least one case in which this does not hold. Case being cancellation of a completed task resulting in a new running attempt. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4128) AM Recovery expects all attempts of a completed task to also be completed.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250360#comment-13250360 ] Bikas Saha commented on MAPREDUCE-4128: --- Correction. bq.This fixes the repro case mentioned above. I have improved that testcase to cover -Scenario 1- Scenario 2. -Scenario 2- Scenario 1 will be covered in MAPREDUCE-3921. AM Recovery expects all attempts of a completed task to also be completed. -- Key: MAPREDUCE-4128 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4128 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 3.0.0 Reporter: Bikas Saha Assignee: Bikas Saha Fix For: 3.0.0 Attachments: MAPREDUCE-4128.patch The AM seems to assume that all attempts of a completed task (from a previous AM incarnation) would also be completed. There is at least one case in which this does not hold. Case being cancellation of a completed task resulting in a new running attempt. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4128) AM Recovery expects all attempts of a completed task to also be completed.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250373#comment-13250373 ] Hadoop QA commented on MAPREDUCE-4128: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12522049/MAPREDUCE-4128.patch against trunk revision . +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 2 new or modified test files. -1 patch. The patch command could not apply the patch. Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2180//console This message is automatically generated. AM Recovery expects all attempts of a completed task to also be completed. -- Key: MAPREDUCE-4128 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4128 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 3.0.0 Reporter: Bikas Saha Assignee: Bikas Saha Fix For: 3.0.0 Attachments: MAPREDUCE-4128.patch The AM seems to assume that all attempts of a completed task (from a previous AM incarnation) would also be completed. There is at least one case in which this does not hold. Case being cancellation of a completed task resulting in a new running attempt. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4099) ApplicationMaster may fail to remove staging directory
[ https://issues.apache.org/jira/browse/MAPREDUCE-4099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250387#comment-13250387 ] Siddharth Seth commented on MAPREDUCE-4099: --- bq. Initially I thought a quick fix would be to change MRAppMaster so the call to cleanupStagingDir() occurs before stopping all the services (and therefore before the RM tries to kill the AM). However this introduces another problem: if something goes wrong with the AM (killed, crashed, hung) between the time it has removed the staging directory and before it has notified the RM then the RM will think the AM did not complete successfully and it will either report the job as failed (after the AM already told the client it was successful) or worse, the RM will launch another AM attempt and fail because the staging directory has been removed. A similar situation would still exist. During service shutdown, all exceptions are ignored (logged) - and an attempt is made to shutdown all subsequent services. After service stop - the AM would proceed and delete the staging directory. A really bad case is if there was an error talking to the RM during the unregister. The client has already been told that the job is successful - the RM would have no idea, and the AM would eventually delete the staging directory and exit. Deleting the staging directory just before the RMCommunicator is stopped would be a much simpler change. It would however have the same problem in case of a failed unregister. Subsequent services do not matter. The HistoryEventHandler had a similar race - where the AM was being shutdown before the history handler was stopped. That was fixed by ensuring the history service was registered after the container allocator - so that it shuts down first. Handling the situation where the client thinks the job is successful, and the RM has no idea about the job seems like a separate Jira. Some way to have the RM pick up the state of the job if and when it restarts the next AM. ApplicationMaster may fail to remove staging directory -- Key: MAPREDUCE-4099 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4099 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.2 Reporter: Jason Lowe Assignee: Jason Lowe Priority: Critical Attachments: MAPREDUCE-4099.patch, MAPREDUCE-4099.patch When the ApplicationMaster shuts down it's supposed to remove the staging directory, assuming properties weren't set to override this behavior. During shutdown the AM tells the ResourceManager that it has finished before it cleans up the staging directory. However upon hearing the AM has finished, the RM turns right around and kills the AM container. If the AM is too slow, the AM will be killed before the staging directory is removed. We're seeing the AM lose this race fairly consistently on our clusters, and the lack of staging directory cleanup quickly leads to filesystem quota issues for some users. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3493) Add the default mapreduce.shuffle.port property to mapred-default.xml
[ https://issues.apache.org/jira/browse/MAPREDUCE-3493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250431#comment-13250431 ] madhukara phatak commented on MAPREDUCE-3493: - As per my knowledge shuffle handler is used by reducer to copy the intermediate map results from mapper nodes to the node where reducer runs. is that correct ? Add the default mapreduce.shuffle.port property to mapred-default.xml - Key: MAPREDUCE-3493 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3493 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.0 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Priority: Minor Labels: newbie Attachments: MAPREDUCE-3493-1.patch, MAPREDUCE-3493.patch I faced this issue when trying to run multiple Hadoop MR2 instances on the same node. The default value for this property is hardcoded in the ShuffleHandler.java class so it results in port conflicts. The issue is resolved if you set the property value in your conf files. But the absence of this property from *-default.xml files is confusing. So It'll be cleaner to move this property to mapred-default.xml, so its default value can be easily identified and changed if needed. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-3493) Add the default mapreduce.shuffle.port property to mapred-default.xml
[ https://issues.apache.org/jira/browse/MAPREDUCE-3493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13250449#comment-13250449 ] Harsh J commented on MAPREDUCE-3493: Yes thats right, but it is also a service rather than an instance every reducer runs. So perhaps a better description could be Default port that the ShuffleHandler will run on. ShuffleHandler is a service run at the NodeManager, to facilitate transfers of intermediate Map outputs to requesting Reducers. Add the default mapreduce.shuffle.port property to mapred-default.xml - Key: MAPREDUCE-3493 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3493 Project: Hadoop Map/Reduce Issue Type: Bug Components: mrv2 Affects Versions: 0.23.0 Reporter: Ahmed Radwan Assignee: Ahmed Radwan Priority: Minor Labels: newbie Attachments: MAPREDUCE-3493-1.patch, MAPREDUCE-3493.patch I faced this issue when trying to run multiple Hadoop MR2 instances on the same node. The default value for this property is hardcoded in the ShuffleHandler.java class so it results in port conflicts. The issue is resolved if you set the property value in your conf files. But the absence of this property from *-default.xml files is confusing. So It'll be cleaner to move this property to mapred-default.xml, so its default value can be easily identified and changed if needed. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira