[jira] [Updated] (MAPREDUCE-4464) Reduce tasks failing with NullPointerException in ConcurrentHashMap.get()
[ https://issues.apache.org/jira/browse/MAPREDUCE-4464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harsh J updated MAPREDUCE-4464: --- Attachment: MAPREDUCE-4464.patch I improved the message in Clint's patch a slight bit to indicate what to look at. And I could also successfully reproduce the issue on a forced bad hostname machine (devel_vm.vm): {code} 12/09/27 21:52:16 INFO mapred.JobClient: Task Id : attempt_201209272149_0001_r_00_2, Status : FAILED Error: java.io.IOException: Invalid hostname found in tracker location: 'http://devel_vm.vm:50060' at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.getMapCompletionEvents(ReduceTask.java:2920) at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run(ReduceTask.java:2845) {code} Reduce tasks failing with NullPointerException in ConcurrentHashMap.get() - Key: MAPREDUCE-4464 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4464 Project: Hadoop Map/Reduce Issue Type: Bug Components: task Affects Versions: 1.0.0 Reporter: Clint Heath Assignee: Clint Heath Priority: Minor Attachments: MAPREDUCE-4464_new.patch, MAPREDUCE-4464.patch, MAPREDUCE-4464.patch Original Estimate: 1h Remaining Estimate: 1h If DNS does not resolve hostnames properly, reduce tasks can fail with a very misleading exception. as per my peer Ahmed's diagnosis: In ReduceTask, it seems that event.getTaskTrackerHttp() returns a malformed URI, and so host from: {code} String host = u.getHost(); {code} is evaluated to null and the NullPointerException is thrown afterwards in the ConcurrentHashMap. I have written a patch to check for a null hostname condition when getHost is called in the getMapCompletionEvents method and print an intelligible warning message rather than suppressing it until later when it becomes confusing and misleading. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4464) Reduce tasks failing with NullPointerException in ConcurrentHashMap.get()
[ https://issues.apache.org/jira/browse/MAPREDUCE-4464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harsh J updated MAPREDUCE-4464: --- Issue Type: Improvement (was: Bug) Reduce tasks failing with NullPointerException in ConcurrentHashMap.get() - Key: MAPREDUCE-4464 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4464 Project: Hadoop Map/Reduce Issue Type: Improvement Components: task Affects Versions: 1.0.0 Reporter: Clint Heath Assignee: Clint Heath Priority: Minor Fix For: 1.2.0 Attachments: MAPREDUCE-4464_new.patch, MAPREDUCE-4464.patch, MAPREDUCE-4464.patch Original Estimate: 1h Remaining Estimate: 1h If DNS does not resolve hostnames properly, reduce tasks can fail with a very misleading exception. as per my peer Ahmed's diagnosis: In ReduceTask, it seems that event.getTaskTrackerHttp() returns a malformed URI, and so host from: {code} String host = u.getHost(); {code} is evaluated to null and the NullPointerException is thrown afterwards in the ConcurrentHashMap. I have written a patch to check for a null hostname condition when getHost is called in the getMapCompletionEvents method and print an intelligible warning message rather than suppressing it until later when it becomes confusing and misleading. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4464) Reduce tasks failing with NullPointerException in ConcurrentHashMap.get()
[ https://issues.apache.org/jira/browse/MAPREDUCE-4464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Harsh J updated MAPREDUCE-4464: --- Resolution: Fixed Fix Version/s: 1.2.0 Status: Resolved (was: Patch Available) I've committed this to branch-1. Thanks very much for the report, your keen eye for issues, and your patch contributions Clint! Hope to see more in future! Reduce tasks failing with NullPointerException in ConcurrentHashMap.get() - Key: MAPREDUCE-4464 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4464 Project: Hadoop Map/Reduce Issue Type: Improvement Components: task Affects Versions: 1.0.0 Reporter: Clint Heath Assignee: Clint Heath Priority: Minor Fix For: 1.2.0 Attachments: MAPREDUCE-4464_new.patch, MAPREDUCE-4464.patch, MAPREDUCE-4464.patch Original Estimate: 1h Remaining Estimate: 1h If DNS does not resolve hostnames properly, reduce tasks can fail with a very misleading exception. as per my peer Ahmed's diagnosis: In ReduceTask, it seems that event.getTaskTrackerHttp() returns a malformed URI, and so host from: {code} String host = u.getHost(); {code} is evaluated to null and the NullPointerException is thrown afterwards in the ConcurrentHashMap. I have written a patch to check for a null hostname condition when getHost is called in the getMapCompletionEvents method and print an intelligible warning message rather than suppressing it until later when it becomes confusing and misleading. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4464) Reduce tasks failing with NullPointerException in ConcurrentHashMap.get()
[ https://issues.apache.org/jira/browse/MAPREDUCE-4464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Clint Heath updated MAPREDUCE-4464: --- Hadoop Flags: Reviewed Status: Patch Available (was: Open) Reduce tasks failing with NullPointerException in ConcurrentHashMap.get() - Key: MAPREDUCE-4464 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4464 Project: Hadoop Map/Reduce Issue Type: Bug Components: task Affects Versions: 1.0.0 Reporter: Clint Heath Assignee: Clint Heath Priority: Minor Attachments: MAPREDUCE-4464_new.patch, MAPREDUCE-4464.patch Original Estimate: 1h Remaining Estimate: 1h If DNS does not resolve hostnames properly, reduce tasks can fail with a very misleading exception. as per my peer Ahmed's diagnosis: In ReduceTask, it seems that event.getTaskTrackerHttp() returns a malformed URI, and so host from: {code} String host = u.getHost(); {code} is evaluated to null and the NullPointerException is thrown afterwards in the ConcurrentHashMap. I have written a patch to check for a null hostname condition when getHost is called in the getMapCompletionEvents method and print an intelligible warning message rather than suppressing it until later when it becomes confusing and misleading. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4464) Reduce tasks failing with NullPointerException in ConcurrentHashMap.get()
[ https://issues.apache.org/jira/browse/MAPREDUCE-4464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Clint Heath updated MAPREDUCE-4464: --- Attachment: MAPREDUCE-4464_new.patch I have just attached the new patch which throws an IO Exception instead. Would it be possible to request that this JIRA be assigned to me? I'd like to get this resolved as quickly as possible. Thank you. Reduce tasks failing with NullPointerException in ConcurrentHashMap.get() - Key: MAPREDUCE-4464 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4464 Project: Hadoop Map/Reduce Issue Type: Bug Components: task Affects Versions: 1.0.0 Reporter: Clint Heath Priority: Minor Attachments: MAPREDUCE-4464.patch, MAPREDUCE-4464_new.patch Original Estimate: 1h Remaining Estimate: 1h If DNS does not resolve hostnames properly, reduce tasks can fail with a very misleading exception. as per my peer Ahmed's diagnosis: In ReduceTask, it seems that event.getTaskTrackerHttp() returns a malformed URI, and so host from: {code} String host = u.getHost(); {code} is evaluated to null and the NullPointerException is thrown afterwards in the ConcurrentHashMap. I have written a patch to check for a null hostname condition when getHost is called in the getMapCompletionEvents method and print an intelligible warning message rather than suppressing it until later when it becomes confusing and misleading. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4464) Reduce tasks failing with NullPointerException in ConcurrentHashMap.get()
[ https://issues.apache.org/jira/browse/MAPREDUCE-4464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Clint Heath updated MAPREDUCE-4464: --- Attachment: MAPREDUCE-4464.patch Reduce tasks failing with NullPointerException in ConcurrentHashMap.get() - Key: MAPREDUCE-4464 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4464 Project: Hadoop Map/Reduce Issue Type: Bug Components: task Affects Versions: 1.0.0 Reporter: Clint Heath Priority: Minor Attachments: MAPREDUCE-4464.patch Original Estimate: 1h Remaining Estimate: 1h If DNS does not resolve hostnames properly, reduce tasks can fail with a very misleading exception. as per my peer Ahmed's diagnosis: In ReduceTask, it seems that event.getTaskTrackerHttp() returns a malformed URI, and so host from: {code} String host = u.getHost(); {code} is evaluated to null and the NullPointerException is thrown afterwards in the ConcurrentHashMap. I have written a patch to check for a null hostname condition when getHost is called in the getMapCompletionEvents method and print an intelligible warning message rather than suppressing it until later when it becomes confusing and misleading. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira