[jira] [Updated] (MAPREDUCE-3360) Provide information about lost nodes in the UI.

2012-01-09 Thread Bhallamudi Venkata Siva Kamesh (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bhallamudi Venkata Siva Kamesh updated MAPREDUCE-3360:
--

Attachment: MAPREDUCE-3360-5.patch

Addressing above comments

 Provide information about lost nodes in the UI.
 ---

 Key: MAPREDUCE-3360
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3360
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
 Environment: NA
Reporter: Bhallamudi Venkata Siva Kamesh
Assignee: Bhallamudi Venkata Siva Kamesh
Priority: Critical
 Attachments: LostNodes.png, MAPREDUCE-3360-1.patch, 
 MAPREDUCE-3360-2.patch, MAPREDUCE-3360-3.patch, MAPREDUCE-3360-4.patch, 
 MAPREDUCE-3360-5.patch, MAPREDUCE-3360.patch, lostNodes.png


 Currently there is no information provided about *lost nodes*. Provide 
 information in the UI. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3360) Provide information about lost nodes in the UI.

2012-01-09 Thread Bhallamudi Venkata Siva Kamesh (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bhallamudi Venkata Siva Kamesh updated MAPREDUCE-3360:
--

Status: Open  (was: Patch Available)

 Provide information about lost nodes in the UI.
 ---

 Key: MAPREDUCE-3360
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3360
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
 Environment: NA
Reporter: Bhallamudi Venkata Siva Kamesh
Assignee: Bhallamudi Venkata Siva Kamesh
Priority: Critical
 Attachments: LostNodes.png, MAPREDUCE-3360-1.patch, 
 MAPREDUCE-3360-2.patch, MAPREDUCE-3360-3.patch, MAPREDUCE-3360-4.patch, 
 MAPREDUCE-3360-5.patch, MAPREDUCE-3360.patch, lostNodes.png


 Currently there is no information provided about *lost nodes*. Provide 
 information in the UI. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-2640) The maxRunningTasks property of the LimitTasksPerJob scheduler is ambiguous in its description, and must be updated

2012-01-09 Thread Harsh J (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Harsh J updated MAPREDUCE-2640:
---

   Resolution: Not A Problem
Fix Version/s: (was: 0.24.0)
   Status: Resolved  (was: Patch Available)

No longer a problem in MR2. I guess we can close this.

 The maxRunningTasks property of the LimitTasksPerJob scheduler is ambiguous 
 in its description, and must be updated
 ---

 Key: MAPREDUCE-2640
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2640
 Project: Hadoop Map/Reduce
  Issue Type: Task
  Components: documentation
Affects Versions: 0.20.2
Reporter: Harsh J
Assignee: Harsh J
  Labels: doc
 Attachments: 0.22-MAPREDUCE-2640.r1.diff, 
 0.23-MAPREDUCE-2640.r1.diff, MAPREDUCE-2640.r1.diff


 The property's entry in mapred-default.xml is like so:
 {code}
 property
   namemapreduce.jobtracker.taskscheduler.maxrunningtasks.perjob/name
   value/value
   descriptionThe maximum number of running tasks for a job before
   it gets preempted. No limits if undefined.
   /description
 /property
 {code}
 There is no mention that this is a property exclusive to the LimitTasksPerJob 
 scheduler alone. The doc ought to be updated to note that unless there's a 
 plan of reusing such a property (I do not see fair or capacity schedulers 
 utilizing this, and they use their own configs).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3354) JobHistoryServer should be started by bin/mapred and not by bin/yarn

2012-01-09 Thread Bhallamudi Venkata Siva Kamesh (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182469#comment-13182469
 ] 

Bhallamudi Venkata Siva Kamesh commented on MAPREDUCE-3354:
---

Jonathan,
minor comments from the patch. 

{noformat}usage=Usage: mapreduce-daemon.sh [--config conf-dir] [--hosts 
hostlistfile] (start|stop) yarn-command {noformat}

I *think*, instead of mapreduce-daemon.sh, it should be mapred-daemon.sh in 
mapred-daemon.sh. Moreover after inclusion of this patch, it will be  nomore a 
yarn-command, it will be a mapred-command.

{noformat}$ $YARN_HOME/bin/mapred stop historyserver --config $HADOOP_CONF_DIR 
{noformat}
I *think*, we should replace HADOOP_CONF_DIR by YARN_CONF_DIR and mapred by 
mapred-daemon.sh in ClusterSetup.apt.vm.

 JobHistoryServer should be started by bin/mapred and not by bin/yarn
 

 Key: MAPREDUCE-3354
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3354
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver, mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
Assignee: Jonathan Eagles
Priority: Blocker
 Attachments: MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch


 JobHistoryServer belongs to mapreduce land.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Resolved] (MAPREDUCE-3619) Change streaming code to use new mapreduce api.

2012-01-09 Thread Liyin Liang (Resolved) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liyin Liang resolved MAPREDUCE-3619.


Resolution: Duplicate

 Change streaming code to use new mapreduce api.
 ---

 Key: MAPREDUCE-3619
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3619
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: contrib/streaming, mrv2
Affects Versions: 0.23.1
Reporter: Liyin Liang

 If we run a streaming job with following python script as mapper or reducer, 
 the job will throws NullPointerException.
 {code:}
 #!/usr/bin/python
 import sys,os
 class MyTask:
   def __init__(self, file=sys.stdin):
 self.file = file
 print sys.stderr, reporter:counter:spam,disp_flag_record,0
 print sys.stderr, reporter:counter:spam,spam_record,0
   def process(self):
 while True:
   line = self.file.readline()
   if not line:
 break;
   print line
 if __name__ == __main__:
   task = MyTask()
   task.process()
 {code}
 Here is the NPE related log:
 2011-12-22 14:14:06,310 WARN org.apache.hadoop.streaming.PipeMapRed: 
 java.lang.NullPointerException
   at 
 org.apache.hadoop.streaming.PipeMapRed$MRErrorThread.incrCounter(PipeMapRed.java:502)
   at 
 org.apache.hadoop.streaming.PipeMapRed$MRErrorThread.run(PipeMapRed.java:444)
 This is because the above script's print sys.stderr will invoke 
 reporter.incrCounter() during PipeMapper|PipeReducer.configure(). While we 
 can not get reporter in configure() function. 
 To fix this problem, we should change streaming code to use new-api. Then we 
 can call context.getCounter() in Mapper|Reducer.setup() function.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3360) Provide information about lost nodes in the UI.

2012-01-09 Thread Jason Lowe (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182544#comment-13182544
 ] 

Jason Lowe commented on MAPREDUCE-3360:
---

+1, looks good to me.

 Provide information about lost nodes in the UI.
 ---

 Key: MAPREDUCE-3360
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3360
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
 Environment: NA
Reporter: Bhallamudi Venkata Siva Kamesh
Assignee: Bhallamudi Venkata Siva Kamesh
Priority: Critical
 Attachments: LostNodes.png, MAPREDUCE-3360-1.patch, 
 MAPREDUCE-3360-2.patch, MAPREDUCE-3360-3.patch, MAPREDUCE-3360-4.patch, 
 MAPREDUCE-3360-5.patch, MAPREDUCE-3360.patch, lostNodes.png


 Currently there is no information provided about *lost nodes*. Provide 
 information in the UI. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3360) Provide information about lost nodes in the UI.

2012-01-09 Thread Jason Lowe (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe updated MAPREDUCE-3360:
--

Target Version/s: 0.23.1, 0.24.0  (was: 0.23.1)
  Status: Patch Available  (was: Open)

 Provide information about lost nodes in the UI.
 ---

 Key: MAPREDUCE-3360
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3360
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
 Environment: NA
Reporter: Bhallamudi Venkata Siva Kamesh
Assignee: Bhallamudi Venkata Siva Kamesh
Priority: Critical
 Attachments: LostNodes.png, MAPREDUCE-3360-1.patch, 
 MAPREDUCE-3360-2.patch, MAPREDUCE-3360-3.patch, MAPREDUCE-3360-4.patch, 
 MAPREDUCE-3360-5.patch, MAPREDUCE-3360.patch, lostNodes.png


 Currently there is no information provided about *lost nodes*. Provide 
 information in the UI. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3299) Add AMInfo table to the AM job page

2012-01-09 Thread Thomas Graves (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182549#comment-13182549
 ] 

Thomas Graves commented on MAPREDUCE-3299:
--

Hey Jon, 

It looks like there is a pre-existing bug (that I introduced) in the HS 
attempts code that you picked up when you copied it over.  The nodeId has the 
wrong port.  It has the http port instead of the normal node manager port 
(should be using getNodeManagerPort()).  Would you mind fixing both of those 
(app master and history server) here?  Otherwise we can file a separate jira 
for the HS stuff.

Everything else looks good.

Thanks.

 Add AMInfo table to the AM job page
 ---

 Key: MAPREDUCE-3299
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3299
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Jonathan Eagles
Priority: Minor
 Attachments: MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch


 JobHistory has a table to list all AMs. A similar table can be added to the 
 AM for info on past failed AMs and the current running one.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3360) Provide information about lost nodes in the UI.

2012-01-09 Thread Hadoop QA (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182573#comment-13182573
 ] 

Hadoop QA commented on MAPREDUCE-3360:
--

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12509882/MAPREDUCE-3360-5.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 24 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 eclipse:eclipse.  The patch built with eclipse:eclipse.

-1 findbugs.  The patch appears to introduce 1 new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1568//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1568//artifact/trunk/hadoop-mapreduce-project/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1568//console

This message is automatically generated.

 Provide information about lost nodes in the UI.
 ---

 Key: MAPREDUCE-3360
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3360
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
 Environment: NA
Reporter: Bhallamudi Venkata Siva Kamesh
Assignee: Bhallamudi Venkata Siva Kamesh
Priority: Critical
 Attachments: LostNodes.png, MAPREDUCE-3360-1.patch, 
 MAPREDUCE-3360-2.patch, MAPREDUCE-3360-3.patch, MAPREDUCE-3360-4.patch, 
 MAPREDUCE-3360-5.patch, MAPREDUCE-3360.patch, lostNodes.png


 Currently there is no information provided about *lost nodes*. Provide 
 information in the UI. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Robert Joseph Evans (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182592#comment-13182592
 ] 

Robert Joseph Evans commented on MAPREDUCE-3511:


I have a few comments about the patch.  Overall it looks very good.
 
If the old counter API is going to be the long term fix then perhaps we should 
not mark is as deprecated any more.  If not then can we file a new JIRA to 
update the new Counters to be as space efficient as the old ones. 

This is a minor performance improvement, but in many of the HistoryEvents there 
is a datum, that has had most of its fields replicated inside the event itself. 
 I guess this is because they are in slightly different formats now.  But we 
still create an instance of datum in the constructor, and always populate its 
fields inside getDatum.  These event objects are imutable, outside of the 
setDatum method that is only to be used when de-serializing the event.  
getDatum, however, tends to be called repeatedly to pull out individual fields 
from the datum.  I would prefer to see the datum start out as null, and only 
have its fields set if it is not null, inside getDatum.

Why was TokenCache.java modified at all? It does not seem to be related to this 
JIRA.

You added a TODO in CompletedTask.java and CompletedJob.java {code}// TODO: 
Make sure.{code} Did you make sure yet? if so please delete the TODO.

Also good catch on TestHsWebServicesTasks.java, TestAMWebServicesAttempts.java, 
TestHsWebServicesTasks.java and the others.  I also don't think we need any 
more tests because, all we are doing is reducing memory usage, which is very 
hard to write a unit test for. 

Inside JobHistoryEventHandler.java you added in {code}// TODO: Only 
job-counters is enough? How about the myriad clones in this code-path.{code} is 
this TODO still needed?

Like I said before, overall the patch looks really good and I only have some 
minor comments, thanks for doing this.

 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3299) Add AMInfo table to the AM job page

2012-01-09 Thread Jonathan Eagles (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Eagles updated MAPREDUCE-3299:
---

Status: Open  (was: Patch Available)

 Add AMInfo table to the AM job page
 ---

 Key: MAPREDUCE-3299
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3299
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Jonathan Eagles
Priority: Minor
 Attachments: MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch


 JobHistory has a table to list all AMs. A similar table can be added to the 
 AM for info on past failed AMs and the current running one.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3299) Add AMInfo table to the AM job page

2012-01-09 Thread Jonathan Eagles (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Eagles updated MAPREDUCE-3299:
---

Attachment: MAPREDUCE-3299.patch

Thanks, Tom. Fixed nodeid for AM and Hs with latest patch.

 Add AMInfo table to the AM job page
 ---

 Key: MAPREDUCE-3299
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3299
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Jonathan Eagles
Priority: Minor
 Attachments: MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch


 JobHistory has a table to list all AMs. A similar table can be added to the 
 AM for info on past failed AMs and the current running one.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3299) Add AMInfo table to the AM job page

2012-01-09 Thread Jonathan Eagles (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Eagles updated MAPREDUCE-3299:
---

Status: Patch Available  (was: Open)

 Add AMInfo table to the AM job page
 ---

 Key: MAPREDUCE-3299
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3299
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Jonathan Eagles
Priority: Minor
 Attachments: MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch


 JobHistory has a table to list all AMs. A similar table can be added to the 
 AM for info on past failed AMs and the current running one.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3643) Reducer Job always goes to pending state.

2012-01-09 Thread Arun Prakash (Created) (JIRA)
Reducer Job always goes to pending state.
-

 Key: MAPREDUCE-3643
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3643
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Arun Prakash
Priority: Minor


I am using hadoop streaming command for map-reduce.In my job there are 4 
streaming commands,i have no issues with first 3 steps but step4 job once 
mapper got  100% completed  directly going to pending state,even there was no 
error in the job tracker.

In the step4,i am merging the output(i.e step2 and step 3 output) as single 
output using identity reducer.
I have ran this job several times there was no issues but i am facing the 
problem right now.

Any suggestions!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3643) Reducer Job always goes to pending state.

2012-01-09 Thread Robert Joseph Evans (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182608#comment-13182608
 ] 

Robert Joseph Evans commented on MAPREDUCE-3643:


What version is this filed against?  Can you update the affects version field?

 Reducer Job always goes to pending state.
 -

 Key: MAPREDUCE-3643
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3643
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Arun Prakash
Priority: Minor

 I am using hadoop streaming command for map-reduce.In my job there are 4 
 streaming commands,i have no issues with first 3 steps but step4 job once 
 mapper got  100% completed  directly going to pending state,even there was no 
 error in the job tracker.
 In the step4,i am merging the output(i.e step2 and step 3 output) as single 
 output using identity reducer.
 I have ran this job several times there was no issues but i am facing the 
 problem right now.
 Any suggestions!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3643) Reducer Job always goes to pending state.

2012-01-09 Thread Arun Prakash (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182613#comment-13182613
 ] 

Arun Prakash commented on MAPREDUCE-3643:
-

I am using the hadoop version 0.20,i couldn't find the exact version in the 
field.

 Reducer Job always goes to pending state.
 -

 Key: MAPREDUCE-3643
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3643
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Arun Prakash
Priority: Minor

 I am using hadoop streaming command for map-reduce.In my job there are 4 
 streaming commands,i have no issues with first 3 steps but step4 job once 
 mapper got  100% completed  directly going to pending state,even there was no 
 error in the job tracker.
 In the step4,i am merging the output(i.e step2 and step 3 output) as single 
 output using identity reducer.
 I have ran this job several times there was no issues but i am facing the 
 problem right now.
 Any suggestions!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3354) JobHistoryServer should be started by bin/mapred and not by bin/yarn

2012-01-09 Thread Jonathan Eagles (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Eagles updated MAPREDUCE-3354:
---

Attachment: MAPREDUCE-3354.patch

 JobHistoryServer should be started by bin/mapred and not by bin/yarn
 

 Key: MAPREDUCE-3354
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3354
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver, mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
Assignee: Jonathan Eagles
Priority: Blocker
 Attachments: MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch


 JobHistoryServer belongs to mapreduce land.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3354) JobHistoryServer should be started by bin/mapred and not by bin/yarn

2012-01-09 Thread Jonathan Eagles (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Eagles updated MAPREDUCE-3354:
---

Status: Open  (was: Patch Available)

 JobHistoryServer should be started by bin/mapred and not by bin/yarn
 

 Key: MAPREDUCE-3354
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3354
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver, mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
Assignee: Jonathan Eagles
Priority: Blocker
 Attachments: MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch


 JobHistoryServer belongs to mapreduce land.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3354) JobHistoryServer should be started by bin/mapred and not by bin/yarn

2012-01-09 Thread Jonathan Eagles (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Eagles updated MAPREDUCE-3354:
---

Status: Patch Available  (was: Open)

 JobHistoryServer should be started by bin/mapred and not by bin/yarn
 

 Key: MAPREDUCE-3354
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3354
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver, mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
Assignee: Jonathan Eagles
Priority: Blocker
 Attachments: MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch


 JobHistoryServer belongs to mapreduce land.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3643) Reducer Job always goes to pending state.

2012-01-09 Thread Robert Joseph Evans (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182616#comment-13182616
 ] 

Robert Joseph Evans commented on MAPREDUCE-3643:


Is there any more information you can give on this?  How reproducible is it?  
It sounds like it is kind of sporadic.  So the reducer is the identity reducer, 
what about the mapper.  Do you any logs that you can upload.  The JT log for 
the time of the error would be very helpful.

 Reducer Job always goes to pending state.
 -

 Key: MAPREDUCE-3643
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3643
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Arun Prakash
Priority: Minor

 I am using hadoop streaming command for map-reduce.In my job there are 4 
 streaming commands,i have no issues with first 3 steps but step4 job once 
 mapper got  100% completed  directly going to pending state,even there was no 
 error in the job tracker.
 In the step4,i am merging the output(i.e step2 and step 3 output) as single 
 output using identity reducer.
 I have ran this job several times there was no issues but i am facing the 
 problem right now.
 Any suggestions!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3354) JobHistoryServer should be started by bin/mapred and not by bin/yarn

2012-01-09 Thread Jonathan Eagles (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182615#comment-13182615
 ] 

Jonathan Eagles commented on MAPREDUCE-3354:


Thanks for the comments. I have fixed the issues you have addressed. I have 
left historyserver being started by mapred since i think that better reflects 
the recommended usage.

 JobHistoryServer should be started by bin/mapred and not by bin/yarn
 

 Key: MAPREDUCE-3354
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3354
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver, mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
Assignee: Jonathan Eagles
Priority: Blocker
 Attachments: MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch


 JobHistoryServer belongs to mapreduce land.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3354) JobHistoryServer should be started by bin/mapred and not by bin/yarn

2012-01-09 Thread Hadoop QA (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182618#comment-13182618
 ] 

Hadoop QA commented on MAPREDUCE-3354:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12509929/MAPREDUCE-3354.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

-1 patch.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1570//console

This message is automatically generated.

 JobHistoryServer should be started by bin/mapred and not by bin/yarn
 

 Key: MAPREDUCE-3354
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3354
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver, mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
Assignee: Jonathan Eagles
Priority: Blocker
 Attachments: MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch


 JobHistoryServer belongs to mapreduce land.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3354) JobHistoryServer should be started by bin/mapred and not by bin/yarn

2012-01-09 Thread Jonathan Eagles (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182627#comment-13182627
 ] 

Jonathan Eagles commented on MAPREDUCE-3354:


Patch modifies hadoop-assemblies so patch apply failure is expected.

 JobHistoryServer should be started by bin/mapred and not by bin/yarn
 

 Key: MAPREDUCE-3354
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3354
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobhistoryserver, mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
Assignee: Jonathan Eagles
Priority: Blocker
 Attachments: MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, MAPREDUCE-3354.patch, 
 MAPREDUCE-3354.patch, MAPREDUCE-3354.patch


 JobHistoryServer belongs to mapreduce land.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3643) Reducer Job always goes to pending state.

2012-01-09 Thread Arun Prakash (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun Prakash updated MAPREDUCE-3643:


Attachment: snap.JPG

 Reducer Job always goes to pending state.
 -

 Key: MAPREDUCE-3643
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3643
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Arun Prakash
Priority: Minor
 Attachments: snap.JPG


 I am using hadoop streaming command for map-reduce.In my job there are 4 
 streaming commands,i have no issues with first 3 steps but step4 job once 
 mapper got  100% completed  directly going to pending state,even there was no 
 error in the job tracker.
 In the step4,i am merging the output(i.e step2 and step 3 output) as single 
 output using identity reducer.
 I have ran this job several times there was no issues but i am facing the 
 problem right now.
 Any suggestions!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3643) Reducer Job always goes to pending state.

2012-01-09 Thread Arun Prakash (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182631#comment-13182631
 ] 

Arun Prakash commented on MAPREDUCE-3643:
-

I have attached the snapshot for the job tracker and below i mentioned the 
console output.

12/01/09 11:36:29 INFO streaming.StreamJob:  map 0%  reduce 0%
12/01/09 11:36:34 INFO streaming.StreamJob:  map 100%  reduce 0%

Streaming command for step4

hadoop fs -rmr $OUTPUT
hadoop jar $HADOOP_HOME/contrib/streaming/hadoop-streaming-*.jar \
  -D mapred.job.name=$NAME \
  -D mapred.reduce.tasks=1 \
  -D mapred.task.timeout=$TIMEOUT \
  -input $INPUT1 \
  -input $INPUT2 \
  -mapper /bin/awk 'gsub(/\t/, \,\)' \
  -reducer /bin/cat \
  -output $OUTPUT


i have used the /bin/cat instead of identity reducer and mapper


 Reducer Job always goes to pending state.
 -

 Key: MAPREDUCE-3643
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3643
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Arun Prakash
Priority: Minor
 Attachments: snap.JPG


 I am using hadoop streaming command for map-reduce.In my job there are 4 
 streaming commands,i have no issues with first 3 steps but step4 job once 
 mapper got  100% completed  directly going to pending state,even there was no 
 error in the job tracker.
 In the step4,i am merging the output(i.e step2 and step 3 output) as single 
 output using identity reducer.
 I have ran this job several times there was no issues but i am facing the 
 problem right now.
 Any suggestions!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3643) Reducer Job always goes to pending state.

2012-01-09 Thread Robert Joseph Evans (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182633#comment-13182633
 ] 

Robert Joseph Evans commented on MAPREDUCE-3643:


This is unrelated, but for your information having {code}-reducer NONE{code} or 
not including a -reducer command is going to be faster then using 
{code}-reducer /bin/cat{code} as the streaming code will not bother to stream 
the data through cat which will reduce the IO and processing needed. 

 Reducer Job always goes to pending state.
 -

 Key: MAPREDUCE-3643
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3643
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Arun Prakash
Priority: Minor
 Attachments: snap.JPG


 I am using hadoop streaming command for map-reduce.In my job there are 4 
 streaming commands,i have no issues with first 3 steps but step4 job once 
 mapper got  100% completed  directly going to pending state,even there was no 
 error in the job tracker.
 In the step4,i am merging the output(i.e step2 and step 3 output) as single 
 output using identity reducer.
 I have ran this job several times there was no issues but i am facing the 
 problem right now.
 Any suggestions!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3299) Add AMInfo table to the AM job page

2012-01-09 Thread Hadoop QA (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182641#comment-13182641
 ] 

Hadoop QA commented on MAPREDUCE-3299:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12509926/MAPREDUCE-3299.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 6 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 eclipse:eclipse.  The patch built with eclipse:eclipse.

-1 findbugs.  The patch appears to introduce 1 new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1569//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1569//artifact/trunk/hadoop-mapreduce-project/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1569//console

This message is automatically generated.

 Add AMInfo table to the AM job page
 ---

 Key: MAPREDUCE-3299
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3299
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Jonathan Eagles
Priority: Minor
 Attachments: MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch


 JobHistory has a table to list all AMs. A similar table can be added to the 
 AM for info on past failed AMs and the current running one.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3360) Provide information about lost nodes in the UI.

2012-01-09 Thread Thomas Graves (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182656#comment-13182656
 ] 

Thomas Graves commented on MAPREDUCE-3360:
--

A couple of comments:

RMWebServices.java
- minor nit: I would prefer the web services to output empty string rather then 
N/A for the nodeHttpAddress. It seems more programmatic and I tried to make 
that change consistent throughout.
- I think you can use the nid.getHost() instead of doing the split on : to 
get the hostName.

MetricsOverviewTable.java
- I think we need to change ClusterMetricsInfo.java to match if this field is 
Active instead of Total.  It has a field totalNodes which appears to really be 
active nodes. So I think we should add a new activeNodes field and then make 
totalNodes the true total nodes of the cluster - active + rebooted + lost + 
decommissioned.

ClusterMetrics.java
- do we want to change the heading output to match active nodes?
  @Metric(# of NMs) MutableGaugeInt numNMs;




 Provide information about lost nodes in the UI.
 ---

 Key: MAPREDUCE-3360
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3360
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
 Environment: NA
Reporter: Bhallamudi Venkata Siva Kamesh
Assignee: Bhallamudi Venkata Siva Kamesh
Priority: Critical
 Attachments: LostNodes.png, MAPREDUCE-3360-1.patch, 
 MAPREDUCE-3360-2.patch, MAPREDUCE-3360-3.patch, MAPREDUCE-3360-4.patch, 
 MAPREDUCE-3360-5.patch, MAPREDUCE-3360.patch, lostNodes.png


 Currently there is no information provided about *lost nodes*. Provide 
 information in the UI. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3299) Add AMInfo table to the AM job page

2012-01-09 Thread Jonathan Eagles (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182661#comment-13182661
 ] 

Jonathan Eagles commented on MAPREDUCE-3299:


Findbugs ContainerRemoteLaunchEvent defines equals but not hashCode is 
pre-existing and not related to this patch

 Add AMInfo table to the AM job page
 ---

 Key: MAPREDUCE-3299
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3299
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Jonathan Eagles
Priority: Minor
 Attachments: MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch


 JobHistory has a table to list all AMs. A similar table can be added to the 
 AM for info on past failed AMs and the current running one.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3299) Add AMInfo table to the AM job page

2012-01-09 Thread Thomas Graves (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182662#comment-13182662
 ] 

Thomas Graves commented on MAPREDUCE-3299:
--

+1 lgtm

 Add AMInfo table to the AM job page
 ---

 Key: MAPREDUCE-3299
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3299
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Jonathan Eagles
Priority: Minor
 Attachments: MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, MAPREDUCE-3299.patch, 
 MAPREDUCE-3299.patch, MAPREDUCE-3299.patch


 JobHistory has a table to list all AMs. A similar table can be added to the 
 AM for info on past failed AMs and the current running one.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3427) streaming tests fail with MR2

2012-01-09 Thread Alejandro Abdelnur (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182691#comment-13182691
 ] 

Alejandro Abdelnur commented on MAPREDUCE-3427:
---

Mahadev,

the tests you are see failing, those were tweaked and passing with HADOOP-7590. 
Would you check if any of those changes have been mistakenly reverted?

 streaming tests fail with MR2
 -

 Key: MAPREDUCE-3427
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3427
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: contrib/streaming, mrv2
Affects Versions: 0.23.1, 0.24.0
Reporter: Alejandro Abdelnur
Priority: Blocker
 Fix For: 0.23.1, 0.24.0


 After Mavenizing streaming and getting its testcases to use the MiniMRCluster 
 wrapper (MAPREDUCE-3169), 4 testcases fail to pass.
 Following is an assessment of those failures. Note that the testcases have 
 been tweaked only to set the streaming JAR and yarn as the  framework.
  
 (If these issues are unrelated we should create sub-tasks for each one of 
 them).
 *TestStreamingCombiner*, fails because returned counters don't match 
 assertion. However, counters printed in the test output indicate values that 
 would satisfy the assertion. As Tom has indicated it seems MR/YARN are not 
 passing back counter information to the client API.
 *TestStreamingBadRecords*, the job is failing with the following exception
 {code}
 Application application_1321575850006_0001 failed 1 times due to AM Container 
 for 
 appattempt_1321575850006_0001_01 exited with  exitCode: 127 due to: 
 .Failing this attempt.. Failing the application.
 {code}
 Difficult to troubleshoot because there are not task logs from Mini MR/YARN  
 run.
 *TestStreamingStatus* fails in validateTaskStatus() in the following assertion
 {code}
 expected:[before consuming input  sort] but was:[SUCCEEDED]
 {code}
 *TestUlimit* fails with
 {code}
 org.junit.ComparisonFailure: output is wrong expected:[786432] but 
 was:[unlimited]
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Vinod Kumar Vavilapalli (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-3511:
---

Status: Open  (was: Patch Available)

 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3505) yarn APPLICATION_CLASSPATH needs to be overridable

2012-01-09 Thread Commented

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182747#comment-13182747
 ] 

Bruno Mahé commented on MAPREDUCE-3505:
---

+1 to the patch.
It solves a big issue I have had to hack around. This patch makes it way better 
than the current situation and we can always improve it later on.

 yarn APPLICATION_CLASSPATH needs to be overridable
 --

 Key: MAPREDUCE-3505
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3505
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Bruno Mahé
Assignee: Ahmed Radwan
  Labels: bigtop
 Attachments: MAPREDUCE-3505.patch


 Right now MRApps sets the classpath to just being mrapp-generated-classpath, 
 its content and a hardcoded list of directories.
 If I understand correctly mrapp-generated-classpath is only there for testing 
 and may change or disappear at any time
 The list of hardcoded directories is defined in 
 hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-api/src/main/java/org/apache/hadoop/yarn/api/ApplicationConstants.java
  at line 92.
 For convenience, here is its current content:
 {noformat}
   /**
* Classpath for typical applications.
*/
   public static final String[] APPLICATION_CLASSPATH =
   new String[] {
 $HADOOP_CONF_DIR,
 $HADOOP_COMMON_HOME/share/hadoop/common/*,
 $HADOOP_COMMON_HOME/share/hadoop/common/lib/*,
 $HADOOP_HDFS_HOME/share/hadoop/hdfs/*,
 $HADOOP_HDFS_HOME/share/hadoop/hdfs/lib/*,
 $YARN_HOME/modules/*,
 $YARN_HOME/lib/*
   };
 {noformat}
 Not all deployment scenarii fit in this layout and therefore we need a 
 standardized way to customize this class path.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Vinod Kumar Vavilapalli (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-3511:
---

Attachment: MAPREDUCE-3511-20120109.txt

Thanks for looking into it, Robert!

bq. If the old counter API is going to be the long term fix then perhaps we 
should not mark is as deprecated any more.
The counters that I am now using are from {{mapreduce.Counters}} which aren't 
deprecated. Irrespective of this, it makes sense to undeprecate other old v1 
stuff (mapped.*), I'll propose and merge MAPREDUCE-1735 into trunk/23 also.

bq. [...] a minor performance improvement [...] I would prefer to see the datum 
start out as null, and only have its fields set if it is not null, inside 
getDatum.
Not sure if {{getDatum()}} will be called multiple times as each event will be 
logged only once. Makes sense to implement your proposal anyways just to be 
sure. Done.

bq. Why was TokenCache.java modified at all? It does not seem to be related to 
this JIRA.
It isn't. But without that, due to HADOOP-7963/MAPREDUCE-3639 the patch 
couldn't be tested on cluster. I'll revert those changes from the patch.

bq. You added a TODO in CompletedTask.java and CompletedJob.java
Done, added that to remind myself to avoid any clones :) Removing the clones 
now.

bq. good catch on TestHsWebServicesTasks.java, TestAMWebServicesAttempts.java, 
TestHsWebServicesTasks.java 
Am surprised Jenkins didn't catch these.

bq.  I also don't think we need any more tests because, all we are doing is 
reducing memory usage, which is very hard to write a unit test for.
Yes, +1 :)

bq. Inside JobHistoryEventHandler.java you added in // TODO: Only job-counters 
is enough? How about the myriad clones in this code-path. is this TODO still 
needed?
Avoided that extra clone for a finished Job.

 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3505) yarn APPLICATION_CLASSPATH needs to be overridable

2012-01-09 Thread Roman Shaposhnik (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182753#comment-13182753
 ] 

Roman Shaposhnik commented on MAPREDUCE-3505:
-

+1 on the current patch. Can we, please, have it committed? I understand the 
interest in a more general solution, but committing this would be an 
incremental positive step.

 yarn APPLICATION_CLASSPATH needs to be overridable
 --

 Key: MAPREDUCE-3505
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3505
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Bruno Mahé
Assignee: Ahmed Radwan
  Labels: bigtop
 Attachments: MAPREDUCE-3505.patch


 Right now MRApps sets the classpath to just being mrapp-generated-classpath, 
 its content and a hardcoded list of directories.
 If I understand correctly mrapp-generated-classpath is only there for testing 
 and may change or disappear at any time
 The list of hardcoded directories is defined in 
 hadoop-mapreduce-project/hadoop-yarn/hadoop-yarn-api/src/main/java/org/apache/hadoop/yarn/api/ApplicationConstants.java
  at line 92.
 For convenience, here is its current content:
 {noformat}
   /**
* Classpath for typical applications.
*/
   public static final String[] APPLICATION_CLASSPATH =
   new String[] {
 $HADOOP_CONF_DIR,
 $HADOOP_COMMON_HOME/share/hadoop/common/*,
 $HADOOP_COMMON_HOME/share/hadoop/common/lib/*,
 $HADOOP_HDFS_HOME/share/hadoop/hdfs/*,
 $HADOOP_HDFS_HOME/share/hadoop/hdfs/lib/*,
 $YARN_HOME/modules/*,
 $YARN_HOME/lib/*
   };
 {noformat}
 Not all deployment scenarii fit in this layout and therefore we need a 
 standardized way to customize this class path.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Vinod Kumar Vavilapalli (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-3511:
---

Status: Patch Available  (was: Open)

 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Robert Joseph Evans (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182755#comment-13182755
 ] 

Robert Joseph Evans commented on MAPREDUCE-3511:


LGTM +1 (Non-binding)

 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3644) Snapshot builds have confusing jar file names in share/hadoop/mapreduce in tarball

2012-01-09 Thread Andrew Bayer (Created) (JIRA)
Snapshot builds have confusing jar file names in share/hadoop/mapreduce in 
tarball
--

 Key: MAPREDUCE-3644
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3644
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: build, mrv2
Affects Versions: 0.23.0
Reporter: Andrew Bayer


If you build a Hadoop tarball with a non-release version, the moduleSet used in 
hadoop-assemblies/src/main/resources/assemblies/hadoop-mapreduce-dist.xml 
results in jar files going into share/hadoop/mapreduce with unique snapshot 
versions - i.e., the timestamp they were built. This isn't an issue in release 
builds. It can be fixed by adding  
outputFileNameMapping${module.artifactId}-${project.version}${dashClassifier?}.${module.extension}/outputFileNameMapping
 to the binaries tag of the moduleSet.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3641) CapacityScheduler should be more conservative assigning off-switch requests

2012-01-09 Thread Vinod Kumar Vavilapalli (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-3641:
---

Issue Type: Sub-task  (was: Bug)
Parent: MAPREDUCE-3561

 CapacityScheduler should be more conservative assigning off-switch requests
 ---

 Key: MAPREDUCE-3641
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3641
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mrv2, scheduler
Affects Versions: 0.23.0
Reporter: Arun C Murthy
Assignee: Arun C Murthy
Priority: Blocker
 Attachments: MAPREDUCE-3641.patch


 In hadoop-1, the CS is very conservative handing out off-switch assignments, 
 we need to do the same in YARN.
 We noticed performance regressions due to this, particularly for reduces.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3614) finalState UNDEFINED if AM is killed by hand

2012-01-09 Thread Ravi Prakash (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182760#comment-13182760
 ] 

Ravi Prakash commented on MAPREDUCE-3614:
-

The client would fail with this exception 

{noformat}
RemoteTrace: 
 at Local Trace: 
org.apache.hadoop.yarn.exceptions.impl.pb.YarnRemoteExceptionPBImpl: 
Unknown job job_1326140392720_0001
at 
org.apache.hadoop.yarn.ipc.ProtoOverHadoopRpcEngine$Invoker.invoke(ProtoOverHadoopRpcEngine.java:151)
at $Proxy8.getTaskAttemptCompletionEvents(Unknown Source)
at 
org.apache.hadoop.mapreduce.v2.api.impl.pb.client.MRClientProtocolPBClientImpl.getTaskAttemptCompletionEvents(MRClientProtocolPBClientImpl.java:172)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at 
org.apache.hadoop.mapred.ClientServiceDelegate.invoke(ClientServiceDelegate.java:328)
at 
org.apache.hadoop.mapred.ClientServiceDelegate.getTaskCompletionEvents(ClientServiceDelegate.java:372)
at 
org.apache.hadoop.mapred.YARNRunner.getTaskCompletionEvents(YARNRunner.java:438)
at org.apache.hadoop.mapreduce.Job$5.run(Job.java:656)
at org.apache.hadoop.mapreduce.Job$5.run(Job.java:653)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1157)
at org.apache.hadoop.mapreduce.Job.getTaskCompletionEvents(Job.java:653)
at org.apache.hadoop.mapreduce.Job.monitorAndPrintJob(Job.java:1295)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:1235)
at org.apache.hadoop.examples.WordCount.main(WordCount.java:84)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at 
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:144)
at org.apache.hadoop.examples.ExampleDriver.main(ExampleDriver.java:68)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:200)
{noformat}



 finalState UNDEFINED if AM is killed by hand
 

 Key: MAPREDUCE-3614
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3614
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Ravi Prakash
Assignee: Ravi Prakash

 Courtesy [~dcapwell]
 {quote}
 If the AM is running and you kill the process (sudo kill #pid), the State in 
 Yarn would be FINISHED and FinalStatus is UNDEFINED.  The Tracking UI would 
 say History and point to the proxy url (which will redirect to the history 
 server).
 The state should be more descriptive that the job failed and the tracker url 
 shouldn't point to the history server.
 {quote}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3312) Make MR AM not send a stopContainer w/o corresponding start container

2012-01-09 Thread Vinod Kumar Vavilapalli (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182764#comment-13182764
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-3312:


Looking at the patch now...

 Make MR AM not send a stopContainer w/o corresponding start container
 -

 Key: MAPREDUCE-3312
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3312
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0, 0.24.0
Reporter: Robert Joseph Evans
Assignee: Robert Joseph Evans
 Fix For: 0.23.1, 0.24.0

 Attachments: MR-3312.txt, MR-3312.txt, MR-3312.txt


 This is a follow on to MAPREDUCE-3274.  It is possible, although rare, for 
 the MR AM to send a stop container before it sends a start container.  This 
 needs to stop that from happening.  If a stop is found first it should 
 prevent the start from being sent.  It tries to do this, but only if the stop 
 is currently pending.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3614) finalState UNDEFINED if AM is killed by hand

2012-01-09 Thread Ravi Prakash (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ravi Prakash updated MAPREDUCE-3614:


Attachment: MAPREDUCE-3614.branch-0.23.patch

I identified the problem to RMCommunicator.java:unregister() . The call

{noformat}
scheduler.finishApplicationMaster(request);
{noformat}
is what is throwing the Exception. I'm attaching a patch which prevents the 
exception from being thrown and the output on the UI and everywhere else seems 
to be what is desired, but the approach to me seems to be dicey. I'm 
essentially not making the call when finishState == JobState.FAILED. I doubt 
that is the right approach. I'll dig deeper into code to see what the right 
approach would be


 finalState UNDEFINED if AM is killed by hand
 

 Key: MAPREDUCE-3614
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3614
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Ravi Prakash
Assignee: Ravi Prakash
 Attachments: MAPREDUCE-3614.branch-0.23.patch


 Courtesy [~dcapwell]
 {quote}
 If the AM is running and you kill the process (sudo kill #pid), the State in 
 Yarn would be FINISHED and FinalStatus is UNDEFINED.  The Tracking UI would 
 say History and point to the proxy url (which will redirect to the history 
 server).
 The state should be more descriptive that the job failed and the tracker url 
 shouldn't point to the history server.
 {quote}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Siddharth Seth (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182773#comment-13182773
 ] 

Siddharth Seth commented on MAPREDUCE-3511:
---

Also, for jobs which use custom counters (non-optimized) - it would be good to 
document that the AM heap may need to be adjusted.

 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3596) Sort benchmark got hang after completion of 99% map phase

2012-01-09 Thread Siddharth Seth (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182780#comment-13182780
 ] 

Siddharth Seth commented on MAPREDUCE-3596:
---

From another set of logs, sequence of events.
1. AM calls a startContainer
2. NM receives this, starts processing but takes about 1minutes20 seconds to 
finish processing it.
3. Meanwhile, the AM times out the call after 1 minute - and sends a release 
container to the RM
4. RM ends up removing references to the container
5. The NM sends a containerStarted event to the RM - which ends up causing the 
NPE.

From a quick look at the code - if the AM release event had gone out after the 
NM containerStarted, things would've been handled.

 Sort benchmark got hang after completion of 99% map phase
 -

 Key: MAPREDUCE-3596
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3596
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: applicationmaster, mrv2
Affects Versions: 0.23.0
Reporter: Ravi Prakash
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Attachments: logs.tar.bz2, logs.tar.bz2


 Courtesy [~vinaythota]
 {quote}
 Ran sort benchmark couple of times and every time the job got hang after 
 completion 99% map phase. There are some map tasks failed. Also it's not 
 scheduled some of the pending map tasks.
 Cluster size is 350 nodes.
 Build Details:
 ==
 Compiled:   Fri Dec 9 16:25:27 PST 2011 by someone from 
 branches/branch-0.23/hadoop-common-project/hadoop-common 
 ResourceManager version:revision 1212681 by someone source checksum 
 on Fri Dec 9 16:52:07 PST 2011
 Hadoop version: revision 1212592 by someone Fri Dec 9 16:25:27 PST 
 2011
 {quote}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Hadoop QA (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182791#comment-13182791
 ] 

Hadoop QA commented on MAPREDUCE-3511:
--

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12509947/MAPREDUCE-3511-20120109.txt
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 18 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 eclipse:eclipse.  The patch built with eclipse:eclipse.

-1 findbugs.  The patch appears to introduce 1 new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1571//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1571//artifact/trunk/hadoop-mapreduce-project/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1571//console

This message is automatically generated.

 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3625) CapacityScheduler web-ui display of queue's used capacity is broken

2012-01-09 Thread Jason Lowe (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe updated MAPREDUCE-3625:
--

Attachment: CapacityScheduler.png

Based on discussions with Arun, we decided to rework the UI a bit.  Attached is 
a sample screenshot of new capacity scheduler UI.

 CapacityScheduler web-ui display of queue's used capacity is broken
 ---

 Key: MAPREDUCE-3625
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3625
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Arun C Murthy
Assignee: Jason Lowe
Priority: Critical
 Attachments: CapacityScheduler.png


 The display of the queue's used capacity at runtime is broken because it 
 display's 'used' relative to the queue's capacity and not the parent's 
 capacity as shown in the above attachment.
 The display should be relative to parent's capacity and not leaf queues as 
 everything else in the display is relative to parent's capacity.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Work started] (MAPREDUCE-3625) CapacityScheduler web-ui display of queue's used capacity is broken

2012-01-09 Thread Jason Lowe (Work started) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on MAPREDUCE-3625 started by Jason Lowe.

 CapacityScheduler web-ui display of queue's used capacity is broken
 ---

 Key: MAPREDUCE-3625
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3625
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Arun C Murthy
Assignee: Jason Lowe
Priority: Critical
 Attachments: CapacityScheduler.png


 The display of the queue's used capacity at runtime is broken because it 
 display's 'used' relative to the queue's capacity and not the parent's 
 capacity as shown in the above attachment.
 The display should be relative to parent's capacity and not leaf queues as 
 everything else in the display is relative to parent's capacity.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Vinod Kumar Vavilapalli (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182793#comment-13182793
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-3511:


bq. Also, for jobs which use custom counters (non-optimized) - it would be good 
to document that the AM heap may need to be adjusted.
That's a good to have, I'll open a separate ticket to add a best-practices doc.

I am pushing this in.

 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Vinod Kumar Vavilapalli (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-3511:
---

  Resolution: Fixed
Release Note: Removed a multitude of cloned/duplicate counters in the AM 
thereby reducing the AM heap size and preventing full GCs.
Hadoop Flags: Reviewed
  Status: Resolved  (was: Patch Available)

Just committed this to trunk and branch-0.23.

 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3625) CapacityScheduler web-ui display of queue's used capacity is broken

2012-01-09 Thread Jason Lowe (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe updated MAPREDUCE-3625:
--

Attachment: MAPREDUCE-3625.patch

Patch reworks capacity scheduler UI to fix the reported problem and make the UI 
a bit easier to read in general.

Green now represents used capacity instead of spare capacity and always grows 
from left to right.  The color of the bar changes to orange if the used 
capacity goes over 100%, and it doesn't streak off the right end of the page as 
it did previously.

Also added a legend so it's clear what the colors indicate and added a used 
capacity column for quick reference.

 CapacityScheduler web-ui display of queue's used capacity is broken
 ---

 Key: MAPREDUCE-3625
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3625
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Arun C Murthy
Assignee: Jason Lowe
Priority: Critical
 Attachments: CapacityScheduler.png, MAPREDUCE-3625.patch


 The display of the queue's used capacity at runtime is broken because it 
 display's 'used' relative to the queue's capacity and not the parent's 
 capacity as shown in the above attachment.
 The display should be relative to parent's capacity and not leaf queues as 
 everything else in the display is relative to parent's capacity.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3625) CapacityScheduler web-ui display of queue's used capacity is broken

2012-01-09 Thread Jason Lowe (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe updated MAPREDUCE-3625:
--

Target Version/s: 0.23.1, 0.24.0  (was: 0.23.1)
  Status: Patch Available  (was: In Progress)

 CapacityScheduler web-ui display of queue's used capacity is broken
 ---

 Key: MAPREDUCE-3625
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3625
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Arun C Murthy
Assignee: Jason Lowe
Priority: Critical
 Attachments: CapacityScheduler.png, MAPREDUCE-3625.patch


 The display of the queue's used capacity at runtime is broken because it 
 display's 'used' relative to the queue's capacity and not the parent's 
 capacity as shown in the above attachment.
 The display should be relative to parent's capacity and not leaf queues as 
 everything else in the display is relative to parent's capacity.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Hudson (Commented) (JIRA)
-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/JobFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/MapAttemptFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/ReduceAttemptFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/TaskAttemptFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/TaskFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/CompletedJob.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/CompletedTask.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/CompletedTaskAttempt.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/HistoryClientService.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/PartialJob.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/test/java/org/apache/hadoop/mapreduce/v2/hs/webapp/TestHsWebServicesAttempts.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/test/java/org/apache/hadoop/mapreduce/v2/hs/webapp/TestHsWebServicesTasks.java


 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Siddharth Seth (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-3639:
--

Attachment: MR3639.txt

 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Priority: Critical
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Siddharth Seth (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-3639:
--

Assignee: Siddharth Seth
  Status: Patch Available  (was: Open)

 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Critical
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Siddharth Seth (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-3639:
--

Priority: Blocker  (was: Critical)

MR + viewFs will fail without this. Marking as blocker.

 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Blocker
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Hudson (Commented) (JIRA)
/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/data_join
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/eclipse-plugin
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/index
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/vaidya
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/examples
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/java
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/fs
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/hdfs
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/io/FileBench.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/io/TestSequenceFileMergeProgress.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/ipc
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/security/authorize/TestServiceLevelAuthorization.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/test/MapredTestDriver.java
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/webapps/job
* /hadoop/common/branches/branch-0.23/hadoop-project
* /hadoop/common/branches/branch-0.23/hadoop-project/src/site


 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Hudson (Commented) (JIRA)
/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/data_join
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/eclipse-plugin
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/index
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/vaidya
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/examples
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/java
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/fs
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/hdfs
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/io/FileBench.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/io/TestSequenceFileMergeProgress.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/ipc
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/security/authorize/TestServiceLevelAuthorization.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/test/MapredTestDriver.java
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/webapps/job
* /hadoop/common/branches/branch-0.23/hadoop-project
* /hadoop/common/branches/branch-0.23/hadoop-project/src/site


 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Hudson (Commented) (JIRA)
-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/JobFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/MapAttemptFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/ReduceAttemptFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/TaskAttemptFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/TaskFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/CompletedJob.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/CompletedTask.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/CompletedTaskAttempt.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/HistoryClientService.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/PartialJob.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/test/java/org/apache/hadoop/mapreduce/v2/hs/webapp/TestHsWebServicesAttempts.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/test/java/org/apache/hadoop/mapreduce/v2/hs/webapp/TestHsWebServicesTasks.java


 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Siddharth Seth (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-3528:
--

Attachment: MR3528.txt

Trivial patch - introducing a new config param for the timeout check. Not 
including a unit test.

 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Siddharth Seth (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-3528:
--

Assignee: Siddharth Seth
  Status: Patch Available  (was: Open)

 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Hudson (Commented) (JIRA)
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/data_join
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/eclipse-plugin
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/index
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/contrib/vaidya
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/examples
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/java
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/fs
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/hdfs
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/io/FileBench.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/io/TestSequenceFileMergeProgress.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/ipc
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/security/authorize/TestServiceLevelAuthorization.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/test/mapred/org/apache/hadoop/test/MapredTestDriver.java
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/src/webapps/job
* /hadoop/common/branches/branch-0.23/hadoop-project
* /hadoop/common/branches/branch-0.23/hadoop-project/src/site


 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3645) TestJobHistory fails

2012-01-09 Thread Thomas Graves (Created) (JIRA)
TestJobHistory fails


 Key: MAPREDUCE-3645
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3645
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Reporter: Thomas Graves
Priority: Blocker
 Fix For: 0.23.0


TestJobHistory fails.

 org.apache.hadoop.mapred.TestJobHistory.testDoneFolderOnHDFS
 org.apache.hadoop.mapred.TestJobHistory.testDoneFolderNotOnDefaultFileSystem
 
 org.apache.hadoop.mapred.TestJobHistory.testHistoryFolderOnHDFS 
 org.apache.hadoop.mapred.TestJobHistory.testJobHistoryFile 

It looks like this was introduced by MAPREDUCE-3349 and the issue is that the 
test expects the hostname to be in the format rackname/hostname, but with 3349 
it split those apart into 2 different fields.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3645) TestJobHistory fails

2012-01-09 Thread Thomas Graves (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182825#comment-13182825
 ] 

Thomas Graves commented on MAPREDUCE-3645:
--

One of the errors with added debug into to print hostname and expected hostname

Host name : host1.foo.com of task attempt 
attempt_20120109210634113_0001_m_02_0 obtained from history file did not 
match the expected value 
/default-rack/host1.foo.comjunit.framework.AssertionFailedError: Host name : 
host1.foo.com of task attempt attempt_20120109210634113_0001_m_02_0 
obtained from history file did not match the expected value 
/default-rack/host1.foo.comat 
org.apache.hadoop.mapred.TestJobHistory.validateTaskAttemptLevelKeyValues(TestJobHistory.java:527)
at 
org.apache.hadoop.mapred.TestJobHistory.validateJobHistoryFileContent(TestJobHistory.java:595)
at 
org.apache.hadoop.mapred.TestJobHistory.runDoneFolderTest(TestJobHistory.java:751)
at 
org.apache.hadoop.mapred.TestJobHistory.testDoneFolderOnHDFS(TestJobHistory.java:621)

 TestJobHistory fails
 

 Key: MAPREDUCE-3645
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3645
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Reporter: Thomas Graves
Priority: Blocker
 Fix For: 0.23.0


 TestJobHistory fails.
  org.apache.hadoop.mapred.TestJobHistory.testDoneFolderOnHDFS  
  org.apache.hadoop.mapred.TestJobHistory.testDoneFolderNotOnDefaultFileSystem

  org.apache.hadoop.mapred.TestJobHistory.testHistoryFolderOnHDFS   
  org.apache.hadoop.mapred.TestJobHistory.testJobHistoryFile 
 It looks like this was introduced by MAPREDUCE-3349 and the issue is that the 
 test expects the hostname to be in the format rackname/hostname, but with 
 3349 it split those apart into 2 different fields.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3511) Counters occupy a good part of AM heap

2012-01-09 Thread Hudson (Commented) (JIRA)
-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/JobFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/MapAttemptFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/ReduceAttemptFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/TaskAttemptFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/jobhistory/TaskFinishedEvent.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/CompletedJob.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/CompletedTask.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/CompletedTaskAttempt.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/HistoryClientService.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/main/java/org/apache/hadoop/mapreduce/v2/hs/PartialJob.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/test/java/org/apache/hadoop/mapreduce/v2/hs/webapp/TestHsWebServicesAttempts.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/src/test/java/org/apache/hadoop/mapreduce/v2/hs/webapp/TestHsWebServicesTasks.java


 Counters occupy a good part of AM heap
 --

 Key: MAPREDUCE-3511
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3511
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3511-20120107.1.txt, 
 MAPREDUCE-3511-20120109.txt


 Per task counters seem to be occupying a good part of an AMs heap. Looks like 
 more than 50% of what's used by a TaskAttemptImpl object.
 This could be optimized by interning strings or possibly using mrv1 counters 
 which are optimized. Currently counters are converted from mrv1 to mrv2 
 format for in memory storage. The conversion could be delayed till it's 
 actually required for RPC transfers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Vinod Kumar Vavilapalli (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182840#comment-13182840
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-3639:


Trivial patch. +1. Waiting for Jenkins.

 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Blocker
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Hadoop QA (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182841#comment-13182841
 ] 

Hadoop QA commented on MAPREDUCE-3528:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12509964/MR3528.txt
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 eclipse:eclipse.  The patch built with eclipse:eclipse.

-1 findbugs.  The patch appears to introduce 1 new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

-1 core tests.  The patch failed these unit tests:
  
org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.TestContainersMonitor

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1574//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1574//artifact/trunk/hadoop-mapreduce-project/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1574//console

This message is automatically generated.

 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Hadoop QA (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182844#comment-13182844
 ] 

Hadoop QA commented on MAPREDUCE-3639:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12509961/MR3639.txt
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 eclipse:eclipse.  The patch built with eclipse:eclipse.

-1 findbugs.  The patch appears to introduce 1 new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1572//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1572//artifact/trunk/hadoop-mapreduce-project/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1572//console

This message is automatically generated.

 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Blocker
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3625) CapacityScheduler web-ui display of queue's used capacity is broken

2012-01-09 Thread Hadoop QA (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182846#comment-13182846
 ] 

Hadoop QA commented on MAPREDUCE-3625:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12509960/MAPREDUCE-3625.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 eclipse:eclipse.  The patch built with eclipse:eclipse.

-1 findbugs.  The patch appears to introduce 1 new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1573//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1573//artifact/trunk/hadoop-mapreduce-project/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1573//console

This message is automatically generated.

 CapacityScheduler web-ui display of queue's used capacity is broken
 ---

 Key: MAPREDUCE-3625
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3625
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Arun C Murthy
Assignee: Jason Lowe
Priority: Critical
 Attachments: CapacityScheduler.png, MAPREDUCE-3625.patch


 The display of the queue's used capacity at runtime is broken because it 
 display's 'used' relative to the queue's capacity and not the parent's 
 capacity as shown in the above attachment.
 The display should be relative to parent's capacity and not leaf queues as 
 everything else in the display is relative to parent's capacity.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Thomas Graves (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182853#comment-13182853
 ] 

Thomas Graves commented on MAPREDUCE-3639:
--

this fixes the following tests then?

org.apache.hadoop.mapreduce.security.TestTokenCache.testGetTokensForHftpFS 
org.apache.hadoop.mapreduce.security.TestTokenCache.testGetTokensForViewFS 

 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Blocker
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3625) CapacityScheduler web-ui display of queue's used capacity is broken

2012-01-09 Thread Jason Lowe (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182857#comment-13182857
 ] 

Jason Lowe commented on MAPREDUCE-3625:
---

Findbugs warnings are unrelated to this patch.

 CapacityScheduler web-ui display of queue's used capacity is broken
 ---

 Key: MAPREDUCE-3625
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3625
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Arun C Murthy
Assignee: Jason Lowe
Priority: Critical
 Attachments: CapacityScheduler.png, MAPREDUCE-3625.patch


 The display of the queue's used capacity at runtime is broken because it 
 display's 'used' relative to the queue's capacity and not the parent's 
 capacity as shown in the above attachment.
 The display should be relative to parent's capacity and not leaf queues as 
 everything else in the display is relative to parent's capacity.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Daryn Sharp (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182859#comment-13182859
 ] 

Daryn Sharp commented on MAPREDUCE-3639:


Hang on with the commit.  I don't think this patch is complete, need to cross 
check with 205.

 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Blocker
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3616) Thread pool for launching containers in MR AM not expanding as expected

2012-01-09 Thread Siddharth Seth (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated MAPREDUCE-3616:
--

  Resolution: Fixed
Hadoop Flags: Reviewed
  Status: Resolved  (was: Patch Available)

+1. Committed to trunk and branch-0.23. Thanks Vinod.

 Thread pool for launching containers in MR AM not expanding as expected
 ---

 Key: MAPREDUCE-3616
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3616
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, performance
Affects Versions: 0.23.1
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3616-20120105.txt


 Found this while running some benchmarks on 350 nodes. The thread pool stays 
 at 60 for a long time and only expands to 350 towards the fag end of the job.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3646) Remove redundant URL info from mapred job output

2012-01-09 Thread Ramya Sunil (Created) (JIRA)
Remove redundant URL info from mapred job output
--

 Key: MAPREDUCE-3646
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3646
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: client
Affects Versions: 0.23.1
Reporter: Ramya Sunil
Priority: Minor
 Fix For: 0.23.1


The URL information to track the job is printed for all the mapred jobmrv2 
commands. This information is redundant and has to be removed.

E.g:
{noformat}
-bash-3.2$ mapred job -list 

Total jobs:3
JobId   State   StartTime   UserNameQueue   PriorityMaps
Reduces UsedContainers  RsvdContainers  UsedMem RsvdMem NeededMem   AM info
12/01/09 22:20:15 INFO mapred.ClientServiceDelegate: The url to track the job: 
RM host:8088/proxy/application ID 1/
job ID 1  RUNNING 1326147596446   ramya  default NORMAL  10  10  21   
   0   22528M  0M  22528M  RM host:8088/proxy/application ID 1/
12/01/09 22:20:15 INFO mapred.ClientServiceDelegate: The url to track the job: 
RM host:8088/proxy/application ID 2/
job ID 2  RUNNING 1326147603726   ramya  default NORMAL  10  10  11   
   0   12288M  0M  12288M  RM host:8088/proxy/application ID 2/
12/01/09 22:20:16 INFO mapred.ClientServiceDelegate: The url to track the job: 
RM host:8088/proxy/application ID 3/
job ID 3  RUNNING 1326147520126   ramya  default NORMAL  10  10  21   
   0   22528M  0M  22528M  RM host:8088/proxy/application ID 3/
{noformat}


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3616) Thread pool for launching containers in MR AM not expanding as expected

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182868#comment-13182868
 ] 

Hudson commented on MAPREDUCE-3616:
---

Integrated in Hadoop-Common-trunk-Commit #1515 (See 
[https://builds.apache.org/job/Hadoop-Common-trunk-Commit/1515/])
MAPREDUCE-3616. Thread pool for launching containers in MR AM not expanding 
as expected. (Contributed by Vinod Kumar Vavilapalli)

sseth : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229394
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/launcher/ContainerLauncherImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/TestContainerLauncher.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher/TestContainerLauncher.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java


 Thread pool for launching containers in MR AM not expanding as expected
 ---

 Key: MAPREDUCE-3616
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3616
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, performance
Affects Versions: 0.23.1
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3616-20120105.txt


 Found this while running some benchmarks on 350 nodes. The thread pool stays 
 at 60 for a long time and only expands to 350 towards the fag end of the job.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (MAPREDUCE-3647) Pipes job fails with Illegal text protocol

2012-01-09 Thread Ramya Sunil (Created) (JIRA)
Pipes job fails with Illegal text protocol


 Key: MAPREDUCE-3647
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3647
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: pipes
Affects Versions: 0.23.1
Reporter: Ramya Sunil
 Fix For: 0.23.1


Pipes job fail with Hadoop Pipes Exception: Illegal text protocol command

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Assigned] (MAPREDUCE-3645) TestJobHistory fails

2012-01-09 Thread Thomas Graves (Assigned) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Graves reassigned MAPREDUCE-3645:


Assignee: Thomas Graves

 TestJobHistory fails
 

 Key: MAPREDUCE-3645
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3645
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Reporter: Thomas Graves
Assignee: Thomas Graves
Priority: Blocker
 Fix For: 0.23.0

 Attachments: MAPREDUCE-3645.patch


 TestJobHistory fails.
  org.apache.hadoop.mapred.TestJobHistory.testDoneFolderOnHDFS  
  org.apache.hadoop.mapred.TestJobHistory.testDoneFolderNotOnDefaultFileSystem

  org.apache.hadoop.mapred.TestJobHistory.testHistoryFolderOnHDFS   
  org.apache.hadoop.mapred.TestJobHistory.testJobHistoryFile 
 It looks like this was introduced by MAPREDUCE-3349 and the issue is that the 
 test expects the hostname to be in the format rackname/hostname, but with 
 3349 it split those apart into 2 different fields.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Daryn Sharp (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182871#comment-13182871
 ] 

Daryn Sharp commented on MAPREDUCE-3639:


{{TokenCache#obtainTokensForNamenodesInternal}} needs the same null check on 
{{getCanonicalServiceName}} as 205 does.

 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Blocker
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3616) Thread pool for launching containers in MR AM not expanding as expected

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182873#comment-13182873
 ] 

Hudson commented on MAPREDUCE-3616:
---

Integrated in Hadoop-Hdfs-trunk-Commit #1588 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/1588/])
MAPREDUCE-3616. Thread pool for launching containers in MR AM not expanding 
as expected. (Contributed by Vinod Kumar Vavilapalli)

sseth : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229394
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/launcher/ContainerLauncherImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/TestContainerLauncher.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher/TestContainerLauncher.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java


 Thread pool for launching containers in MR AM not expanding as expected
 ---

 Key: MAPREDUCE-3616
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3616
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, performance
Affects Versions: 0.23.1
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3616-20120105.txt


 Found this while running some benchmarks on 350 nodes. The thread pool stays 
 at 60 for a long time and only expands to 350 towards the fag end of the job.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3645) TestJobHistory fails

2012-01-09 Thread Thomas Graves (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Graves updated MAPREDUCE-3645:
-

Attachment: MAPREDUCE-3645.patch

 TestJobHistory fails
 

 Key: MAPREDUCE-3645
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3645
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Reporter: Thomas Graves
Assignee: Thomas Graves
Priority: Blocker
 Fix For: 0.23.0

 Attachments: MAPREDUCE-3645.patch


 TestJobHistory fails.
  org.apache.hadoop.mapred.TestJobHistory.testDoneFolderOnHDFS  
  org.apache.hadoop.mapred.TestJobHistory.testDoneFolderNotOnDefaultFileSystem

  org.apache.hadoop.mapred.TestJobHistory.testHistoryFolderOnHDFS   
  org.apache.hadoop.mapred.TestJobHistory.testJobHistoryFile 
 It looks like this was introduced by MAPREDUCE-3349 and the issue is that the 
 test expects the hostname to be in the format rackname/hostname, but with 
 3349 it split those apart into 2 different fields.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Vinod Kumar Vavilapalli (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182874#comment-13182874
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-3528:


Neither the findBugs warning, nor the test-failure is related to this patch.

Trivial patch replacing the interval configuration. So, no tests.

+1. Pushing this in.

 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3616) Thread pool for launching containers in MR AM not expanding as expected

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182878#comment-13182878
 ] 

Hudson commented on MAPREDUCE-3616:
---

Integrated in Hadoop-Hdfs-0.23-Commit #345 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-0.23-Commit/345/])
merge MAPREDUCE-3616 from trunk

sseth : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229395
Files : 
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/launcher/ContainerLauncherImpl.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/TestContainerLauncher.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher/TestContainerLauncher.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java


 Thread pool for launching containers in MR AM not expanding as expected
 ---

 Key: MAPREDUCE-3616
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3616
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, performance
Affects Versions: 0.23.1
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3616-20120105.txt


 Found this while running some benchmarks on 350 nodes. The thread pool stays 
 at 60 for a long time and only expands to 350 towards the fag end of the job.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3616) Thread pool for launching containers in MR AM not expanding as expected

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182877#comment-13182877
 ] 

Hudson commented on MAPREDUCE-3616:
---

Integrated in Hadoop-Common-0.23-Commit #355 (See 
[https://builds.apache.org/job/Hadoop-Common-0.23-Commit/355/])
merge MAPREDUCE-3616 from trunk

sseth : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229395
Files : 
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/launcher/ContainerLauncherImpl.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/TestContainerLauncher.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher/TestContainerLauncher.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java


 Thread pool for launching containers in MR AM not expanding as expected
 ---

 Key: MAPREDUCE-3616
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3616
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, performance
Affects Versions: 0.23.1
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3616-20120105.txt


 Found this while running some benchmarks on 350 nodes. The thread pool stays 
 at 60 for a long time and only expands to 350 towards the fag end of the job.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Vinod Kumar Vavilapalli (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-3528:
---

   Resolution: Fixed
Fix Version/s: 0.23.1
 Release Note: Fixed TaskHeartBeatHandler to use a new configuration for 
the thread loop interval separate from task-timeout configuration property.
 Hadoop Flags: Reviewed
   Status: Resolved  (was: Patch Available)

I just committed this to trunk and branch 0.23. Thanks Sid!

 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Fix For: 0.23.1

 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182888#comment-13182888
 ] 

Hudson commented on MAPREDUCE-3528:
---

Integrated in Hadoop-Hdfs-0.23-Commit #346 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-0.23-Commit/346/])
MAPREDUCE-3528. Fixed TaskHeartBeatHandler to use a new configuration for 
the thread loop interval separate from task-timeout configuration property. 
Contributed by Siddharth Seth.
svn merge --ignore-ancestry -c 1229403 ../../trunk/

vinodkv : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229405
Files : 
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/TaskHeartbeatHandler.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/MRJobConfig.java


 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Fix For: 0.23.1

 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3646) Remove redundant URL info from mapred job output

2012-01-09 Thread Jonathan Eagles (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182887#comment-13182887
 ] 

Jonathan Eagles commented on MAPREDUCE-3646:


This information is printed for both map tasks and reduces tasks per job.

path is 
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/tools/CLI.java
 (displayJobList-getTaskReports)


hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/main/java/org/apache/hadoop/mapred/ClientServiceDelegate.java
 (getTaskReports-invoke-getProxy)

getProxy always print tracking url to screen.

 Remove redundant URL info from mapred job output
 --

 Key: MAPREDUCE-3646
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3646
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: client
Affects Versions: 0.23.1
Reporter: Ramya Sunil
Priority: Minor
 Fix For: 0.23.1


 The URL information to track the job is printed for all the mapred jobmrv2 
 commands. This information is redundant and has to be removed.
 E.g:
 {noformat}
 -bash-3.2$ mapred job -list 
 Total jobs:3
 JobId   State   StartTime   UserNameQueue   PriorityMaps  
   Reduces UsedContainers  RsvdContainers  UsedMem RsvdMem NeededMem   AM 
 info
 12/01/09 22:20:15 INFO mapred.ClientServiceDelegate: The url to track the 
 job: RM host:8088/proxy/application ID 1/
 job ID 1  RUNNING 1326147596446   ramya  default NORMAL  10  10  21 
  0   22528M  0M  22528M  RM host:8088/proxy/application ID 1/
 12/01/09 22:20:15 INFO mapred.ClientServiceDelegate: The url to track the 
 job: RM host:8088/proxy/application ID 2/
 job ID 2  RUNNING 1326147603726   ramya  default NORMAL  10  10  11 
  0   12288M  0M  12288M  RM host:8088/proxy/application ID 2/
 12/01/09 22:20:16 INFO mapred.ClientServiceDelegate: The url to track the 
 job: RM host:8088/proxy/application ID 3/
 job ID 3  RUNNING 1326147520126   ramya  default NORMAL  10  10  21 
  0   22528M  0M  22528M  RM host:8088/proxy/application ID 3/
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182890#comment-13182890
 ] 

Hudson commented on MAPREDUCE-3528:
---

Integrated in Hadoop-Common-trunk-Commit #1516 (See 
[https://builds.apache.org/job/Hadoop-Common-trunk-Commit/1516/])
MAPREDUCE-3528. Fixed TaskHeartBeatHandler to use a new configuration for 
the thread loop interval separate from task-timeout configuration property. 
(Siddharth Seth via vinodkv)

vinodkv : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229403
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/TaskHeartbeatHandler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/MRJobConfig.java


 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Fix For: 0.23.1

 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Siddharth Seth (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182892#comment-13182892
 ] 

Siddharth Seth commented on MAPREDUCE-3639:
---

@Daryn, That check should not be present. getCanonicalServiceName will return 
null for filesystems like viewfs. With that additional check - TokenCache will 
not get tokens for the underlying filesystems.

@Thomas - didn't run the ant tests earlier. Looks like testGetTokensForViewFS 
is fixed, but HftpFS is not. Do you know when these tests started failing ?

 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Blocker
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3616) Thread pool for launching containers in MR AM not expanding as expected

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182897#comment-13182897
 ] 

Hudson commented on MAPREDUCE-3616:
---

Integrated in Hadoop-Mapreduce-0.23-Commit #366 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-0.23-Commit/366/])
merge MAPREDUCE-3616 from trunk

sseth : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229395
Files : 
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/launcher/ContainerLauncherImpl.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/TestContainerLauncher.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher/TestContainerLauncher.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java


 Thread pool for launching containers in MR AM not expanding as expected
 ---

 Key: MAPREDUCE-3616
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3616
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, performance
Affects Versions: 0.23.1
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3616-20120105.txt


 Found this while running some benchmarks on 350 nodes. The thread pool stays 
 at 60 for a long time and only expands to 350 towards the fag end of the job.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182894#comment-13182894
 ] 

Hudson commented on MAPREDUCE-3528:
---

Integrated in Hadoop-Hdfs-trunk-Commit #1589 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/1589/])
MAPREDUCE-3528. Fixed TaskHeartBeatHandler to use a new configuration for 
the thread loop interval separate from task-timeout configuration property. 
(Siddharth Seth via vinodkv)

vinodkv : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229403
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/TaskHeartbeatHandler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/MRJobConfig.java


 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Fix For: 0.23.1

 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182896#comment-13182896
 ] 

Hudson commented on MAPREDUCE-3528:
---

Integrated in Hadoop-Common-0.23-Commit #356 (See 
[https://builds.apache.org/job/Hadoop-Common-0.23-Commit/356/])
MAPREDUCE-3528. Fixed TaskHeartBeatHandler to use a new configuration for 
the thread loop interval separate from task-timeout configuration property. 
Contributed by Siddharth Seth.
svn merge --ignore-ancestry -c 1229403 ../../trunk/

vinodkv : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229405
Files : 
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/TaskHeartbeatHandler.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/MRJobConfig.java


 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Fix For: 0.23.1

 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3644) Snapshot builds have confusing jar file names in share/hadoop/mapreduce in tarball

2012-01-09 Thread Eli Collins (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eli Collins updated MAPREDUCE-3644:
---

Labels: bigtop  (was: )

 Snapshot builds have confusing jar file names in share/hadoop/mapreduce in 
 tarball
 --

 Key: MAPREDUCE-3644
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3644
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: build, mrv2
Affects Versions: 0.23.0
Reporter: Andrew Bayer
  Labels: bigtop

 If you build a Hadoop tarball with a non-release version, the moduleSet used 
 in hadoop-assemblies/src/main/resources/assemblies/hadoop-mapreduce-dist.xml 
 results in jar files going into share/hadoop/mapreduce with unique snapshot 
 versions - i.e., the timestamp they were built. This isn't an issue in 
 release builds. It can be fixed by adding  
 outputFileNameMapping${module.artifactId}-${project.version}${dashClassifier?}.${module.extension}/outputFileNameMapping
  to the binaries tag of the moduleSet.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Thomas Graves (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182903#comment-13182903
 ] 

Thomas Graves commented on MAPREDUCE-3639:
--

Looks like the Jan 5th or 6th. I saw this and was hoping it fixed them.  If you 
think the other is different issue we can file separate jira.



 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Blocker
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3596) Sort benchmark got hang after completion of 99% map phase

2012-01-09 Thread Siddharth Seth (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182904#comment-13182904
 ] 

Siddharth Seth commented on MAPREDUCE-3596:
---

For the NM taking a long time to process a startContainer call - would be 
interesting to see if changing RPC thread priorities will make a difference, or 
if RPC traffic can be prioritized over shuffle. Also, trying to reduce what the 
startContainer call does in the NM.

 Sort benchmark got hang after completion of 99% map phase
 -

 Key: MAPREDUCE-3596
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3596
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: applicationmaster, mrv2
Affects Versions: 0.23.0
Reporter: Ravi Prakash
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Attachments: logs.tar.bz2, logs.tar.bz2


 Courtesy [~vinaythota]
 {quote}
 Ran sort benchmark couple of times and every time the job got hang after 
 completion 99% map phase. There are some map tasks failed. Also it's not 
 scheduled some of the pending map tasks.
 Cluster size is 350 nodes.
 Build Details:
 ==
 Compiled:   Fri Dec 9 16:25:27 PST 2011 by someone from 
 branches/branch-0.23/hadoop-common-project/hadoop-common 
 ResourceManager version:revision 1212681 by someone source checksum 
 on Fri Dec 9 16:52:07 PST 2011
 Hadoop version: revision 1212592 by someone Fri Dec 9 16:25:27 PST 
 2011
 {quote}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3639) TokenCache likely broken for FileSystems which don't issue delegation tokens

2012-01-09 Thread Siddharth Seth (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182905#comment-13182905
 ] 

Siddharth Seth commented on MAPREDUCE-3639:
---

From running ant -Dtestcase=TestTokenCache. I would lean towards fixing that 
via another jira.
{{nofromat}}
Testcase: testTokenCache took 12.438 sec
Testcase: testLocalJobTokenCache took 13.397 sec
Testcase: testGetTokensForNamenodes took 0.002 sec
Testcase: testGetTokensForHftpFS took 0.16 sec
  Caused an ERROR
java.net.UnknownHostException: host 
java.lang.IllegalArgumentException: java.net.UnknownHostException: host 
  at 
org.apache.hadoop.security.SecurityUtil.buildTokenService(SecurityUtil.java:425)
  at 
org.apache.hadoop.security.SecurityUtil.buildDTServiceName(SecurityUtil.java:308)
  at 
org.apache.hadoop.mapreduce.security.TestTokenCache.testGetTokensForHftpFS(TestTokenCache.java:324)
Caused by: java.net.UnknownHostException: host 

Testcase: testGetJTPrincipal took 0.011 sec
Testcase: testGetTokensForViewFS took 0.01 sec
{{nofromat}}


 TokenCache likely broken for FileSystems which don't issue delegation tokens
 

 Key: MAPREDUCE-3639
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3639
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Priority: Blocker
 Attachments: MR3639.txt


 Ref HADOOP-7963.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (MAPREDUCE-3647) Pipes job fails with Illegal text protocol

2012-01-09 Thread Mahadev konar (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-3647:
-

Component/s: mrv2
   Priority: Critical  (was: Major)

 Pipes job fails with Illegal text protocol
 

 Key: MAPREDUCE-3647
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3647
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv2, pipes
Affects Versions: 0.23.1
Reporter: Ramya Sunil
Priority: Critical
 Fix For: 0.23.1


 Pipes job fail with Hadoop Pipes Exception: Illegal text protocol command

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182915#comment-13182915
 ] 

Hudson commented on MAPREDUCE-3528:
---

Integrated in Hadoop-Mapreduce-0.23-Commit #367 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-0.23-Commit/367/])
MAPREDUCE-3528. Fixed TaskHeartBeatHandler to use a new configuration for 
the thread loop interval separate from task-timeout configuration property. 
Contributed by Siddharth Seth.
svn merge --ignore-ancestry -c 1229403 ../../trunk/

vinodkv : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229405
Files : 
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/TaskHeartbeatHandler.java
* 
/hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/MRJobConfig.java


 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Fix For: 0.23.1

 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3645) TestJobHistory fails

2012-01-09 Thread Hadoop QA (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182919#comment-13182919
 ] 

Hadoop QA commented on MAPREDUCE-3645:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12509969/MAPREDUCE-3645.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 eclipse:eclipse.  The patch built with eclipse:eclipse.

-1 findbugs.  The patch appears to introduce 1 new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1575//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1575//artifact/trunk/hadoop-mapreduce-project/patchprocess/newPatchFindbugsWarningshadoop-mapreduce-client-app.html
Console output: 
https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/1575//console

This message is automatically generated.

 TestJobHistory fails
 

 Key: MAPREDUCE-3645
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3645
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mrv1
Reporter: Thomas Graves
Assignee: Thomas Graves
Priority: Blocker
 Fix For: 0.23.0

 Attachments: MAPREDUCE-3645.patch


 TestJobHistory fails.
  org.apache.hadoop.mapred.TestJobHistory.testDoneFolderOnHDFS  
  org.apache.hadoop.mapred.TestJobHistory.testDoneFolderNotOnDefaultFileSystem

  org.apache.hadoop.mapred.TestJobHistory.testHistoryFolderOnHDFS   
  org.apache.hadoop.mapred.TestJobHistory.testJobHistoryFile 
 It looks like this was introduced by MAPREDUCE-3349 and the issue is that the 
 test expects the hostname to be in the format rackname/hostname, but with 
 3349 it split those apart into 2 different fields.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3528) The task timeout check interval should be configurable independent of mapreduce.task.timeout

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182921#comment-13182921
 ] 

Hudson commented on MAPREDUCE-3528:
---

Integrated in Hadoop-Mapreduce-trunk-Commit #1535 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/1535/])
MAPREDUCE-3528. Fixed TaskHeartBeatHandler to use a new configuration for 
the thread loop interval separate from task-timeout configuration property. 
(Siddharth Seth via vinodkv)

vinodkv : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229403
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/TaskHeartbeatHandler.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/java/org/apache/hadoop/mapreduce/MRJobConfig.java


 The task timeout check interval should be configurable independent of 
 mapreduce.task.timeout
 

 Key: MAPREDUCE-3528
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3528
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: mr-am, mrv2
Affects Versions: 0.23.0
Reporter: Siddharth Seth
Assignee: Siddharth Seth
 Fix For: 0.23.1

 Attachments: MR3528.txt


 TaskHeartbeatHandler sleeps for 'mapreduce.task.timeout' - between each 
 check. If a task/NM goes bad immediately after starting a task - the timeout 
 is detected in ~2x the configured timeout interval.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (MAPREDUCE-3616) Thread pool for launching containers in MR AM not expanding as expected

2012-01-09 Thread Hudson (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-3616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13182922#comment-13182922
 ] 

Hudson commented on MAPREDUCE-3616:
---

Integrated in Hadoop-Mapreduce-trunk-Commit #1535 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/1535/])
MAPREDUCE-3616. Thread pool for launching containers in MR AM not expanding 
as expected. (Contributed by Vinod Kumar Vavilapalli)

sseth : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1229394
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/launcher/ContainerLauncherImpl.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/TestContainerLauncher.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/test/java/org/apache/hadoop/mapreduce/v2/app/launcher/TestContainerLauncher.java
* 
/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-common/src/main/java/org/apache/hadoop/mapreduce/v2/util/MRApps.java


 Thread pool for launching containers in MR AM not expanding as expected
 ---

 Key: MAPREDUCE-3616
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3616
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: mr-am, performance
Affects Versions: 0.23.1
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli
 Fix For: 0.23.1

 Attachments: MAPREDUCE-3616-20120105.txt


 Found this while running some benchmarks on 350 nodes. The thread pool stays 
 at 60 for a long time and only expands to 350 towards the fag end of the job.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Resolved] (MAPREDUCE-3402) AMScalability test of Sleep job with 100K 1-sec maps regressed into running very slowly

2012-01-09 Thread Vinod Kumar Vavilapalli (Resolved) (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-3402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli resolved MAPREDUCE-3402.


Resolution: Fixed

Fixed after MAPREDUCE-3511.

 AMScalability test of Sleep job with 100K 1-sec maps regressed into running 
 very slowly
 ---

 Key: MAPREDUCE-3402
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3402
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
  Components: applicationmaster, mrv2
Affects Versions: 0.23.0
Reporter: Vinod Kumar Vavilapalli
Assignee: Vinod Kumar Vavilapalli
Priority: Blocker
 Fix For: 0.23.1


 The world was rosier before October 19-25, [~karams] says.
 The 100K 1 second sleep job used to take around 800mins or 13-14 mins. It now 
 runs till 45 mins and still manages to complete only about 45K tasks.
 One/more of the flurry of commits for 0.23.0 deserve(s) the blame.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




  1   2   >