[jira] Commented: (MAPREDUCE-828) Provide a mechanism to pause the jobtracker

2009-08-06 Thread Hemanth Yamijala (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12739914#action_12739914
 ] 

Hemanth Yamijala commented on MAPREDUCE-828:


Some initial thoughts on implementation:
- From the time the pause command is issued, the JT will process heartbeats by 
sending back a special response to the TTs indicating its state.
- This special command will cause the TTs to replay their message as if the 
original message was not received by the JT.
- The above is similar to what happens today if TTs fail to communicate with 
the JT.
- The JT will not process any data sent by the TTs during a paused state. IOW 
job state will not change. Since this data will be replayed until resuming, it 
will not be lost and can be picked up once the JT is resumed.
- The ExpireLaunchingTasks thread will pause as well, since the status of tasks 
last launched before the pause will not be updated.
- The CleanupQueue thread on the JT which deletes files from the DFS will also 
be paused as it might fail DFS deletes.

Thoughts on the requirements / proposal ?

 Provide a mechanism to pause the jobtracker
 ---

 Key: MAPREDUCE-828
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-828
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: jobtracker
Reporter: Hemanth Yamijala

 We've seen scenarios when we have needed to stop the namenode for a 
 maintenance activity. In such scenarios, if the jobtracker (JT) continues to 
 run, jobs would fail due to initialization or task failures (due to DFS). We 
 could restart the JT enabling job recovery, during such scenarios. But 
 restart has proved to be a very intrusive activity, particularly if the JT is 
 not at fault itself and does not require a restart. The ask is for a 
 admin-controlled feature to pause the JT which would take it to a state 
 somewhat analogous to the safe mode of DFS.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-227) Ability to pause/resume jobs

2009-08-06 Thread Alejandro Abdelnur (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12739913#action_12739913
 ] 

Alejandro Abdelnur commented on MAPREDUCE-227:
--

Between AUG03 and AUG10 I'll be on vacations and I will not checking email.

I'll reply to your message at my return.

Alejandro


 Ability to pause/resume jobs
 

 Key: MAPREDUCE-227
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-227
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
Reporter: Amar Kamat
Assignee: Amar Kamat
 Attachments: HADOOP-4350-v1.2.patch, HADOOP-4350-v1.3.patch


 Consider a case where the user job depends on some external entity/service 
 like a database or a web service. If the service needs restart or encounters 
 a failure, the user should be able to pause the job and resume only when the 
 service is up. This will be better than re-executing the whole job. Hence 
 there should be some way to pause/resume jobs (from web-ui/command line) etc.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-659) gridmix2 not compiling under mapred module trunk/src/benchmarks/gridmix2

2009-08-06 Thread Giridharan Kesavan (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Giridharan Kesavan updated MAPREDUCE-659:
-

   Resolution: Fixed
Fix Version/s: 0.21.0
   Status: Resolved  (was: Patch Available)

I just committed this.
Thanks for the comments.

 gridmix2 not compiling under mapred module trunk/src/benchmarks/gridmix2 
 -

 Key: MAPREDUCE-659
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-659
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: build
 Environment: latest trunk
Reporter: Iyappan Srinivasan
Assignee: Giridharan Kesavan
Priority: Critical
 Fix For: 0.21.0

 Attachments: 659-1.patch, MAPREDUCE-659-V2.PATCH, MAPREDUCE-659.PATCH


 When build is tried in gridmix2, it fails
 trunk/src/benchmarks/gridmix2 $ ant
 Buildfile: build.xml
 init:
 compile:
 [javac] Compiling 3 source files to 
 /home/iyappans/new_trunk1/mapreduce/trunk/src/benchmarks/gridmix2/build
 [javac] 
 /home/iyappans/new_trunk1/mapreduce/trunk/src/benchmarks/gridmix2/src/java/org/apache/hadoop/mapred/GridMixRunner.java:40:
  package org.apache.hadoop.streaming does not exist
 [javac] import org.apache.hadoop.streaming.StreamJob;
 [javac]   ^
 [javac] 
 /home/iyappans/new_trunk1/mapreduce/trunk/src/benchmarks/gridmix2/src/java/org/apache/hadoop/mapred/GridMixRunner.java:123:
  cannot find symbol
 [javac] symbol: variable StreamJob
 [javac] JobConf jobconf = StreamJob.createJob(args);
 [javac]   ^
 [javac] Note: Some input files use or override a deprecated API.
 [javac] Note: Recompile with -Xlint:deprecation for details.
 [javac] 2 errors
 BUILD FAILED
 /home/iyappans/new_trunk1/mapreduce/trunk/src/benchmarks/gridmix2/build.xml:27:
  Compile failed; see the compiler error output for details.
 Total time: 1 second

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-478) separate jvm param for mapper and reducer

2009-08-06 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-478:


Fix Version/s: 0.21.0
   Status: Patch Available  (was: Open)

 separate jvm param for mapper and reducer
 -

 Key: MAPREDUCE-478
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-478
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Koji Noguchi
Assignee: Arun C Murthy
Priority: Minor
 Fix For: 0.21.0

 Attachments: HADOOP-5684_0_20090420.patch, 
 MAPREDUCE-478_0_20090804.patch, MAPREDUCE-478_0_20090804_yhadoop20.patch


 Memory footprint of mapper and reducer can differ. 
 It would be nice if we can pass different jvm param (mapred.child.java.opts) 
 for mappers and reducers.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-706) Support for FIFO pools in the fair scheduler

2009-08-06 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12739939#action_12739939
 ] 

Hadoop QA commented on MAPREDUCE-706:
-

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12415552/mapreduce-706.v3.patch
  against trunk revision 800693.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 9 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/443/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/443/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/443/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/443/console

This message is automatically generated.

 Support for FIFO pools in the fair scheduler
 

 Key: MAPREDUCE-706
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-706
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: contrib/fair-share
Reporter: Matei Zaharia
Assignee: Matei Zaharia
 Attachments: fsdesigndoc.pdf, fsdesigndoc.tex, mapreduce-706.patch, 
 mapreduce-706.v1.patch, mapreduce-706.v2.patch, mapreduce-706.v3.patch


 The fair scheduler should support making the internal scheduling algorithm 
 for some pools be FIFO instead of fair sharing in order to work better for 
 batch workloads. FIFO pools will behave exactly like the current default 
 scheduler, sorting jobs by priority and then submission time. Pools will have 
 their scheduling algorithm set through the pools config file, and it will be 
 changeable at runtime.
 To support this feature, I'm also changing the internal logic of the fair 
 scheduler to no longer use deficits. Instead, for fair sharing, we will 
 assign tasks to the job farthest below its share as a ratio of its share. 
 This is easier to combine with other scheduling algorithms and leads to a 
 more stable sharing situation, avoiding unfairness issues brought up in 
 MAPREDUCE-543 and MAPREDUCE-544 that happen when some jobs have long tasks. 
 The new preemption (MAPREDUCE-551) will ensure that critical jobs can gain 
 their fair share within a bounded amount of time.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (MAPREDUCE-829) When localizing job configuration from FS, TTs configuration on local disk is not loaded at all

2009-08-06 Thread Vinod K V (JIRA)
When localizing job configuration from FS, TTs configuration on local disk is 
not loaded at all
---

 Key: MAPREDUCE-829
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-829
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: tasktracker
Reporter: Vinod K V


We should first load the local configuration fConf, over which the job.xml from 
the JobTracker's file system should be loaded. This is needed so as to enforce 
settings specific to the TaskTracker if it has some.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-814) Move completed Job history files to HDFS

2009-08-06 Thread Sharad Agarwal (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sharad Agarwal updated MAPREDUCE-814:
-

Attachment: 814_v4.patch

Changes from last patch: Refactored JobTracker constructor to fix an issue 
related to Jobtracker recovery requiring history to be inited. 

 Move completed Job history files to HDFS
 

 Key: MAPREDUCE-814
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-814
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: jobtracker
Reporter: Sharad Agarwal
Assignee: Sharad Agarwal
 Attachments: 814_v1.patch, 814_v2.patch, 814_v3.patch, 814_v4.patch


 Currently completed job history files remain on the jobtracker node. Having 
 the files available on HDFS will enable clients to access these files more 
 easily.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-824) Support a hierarchy of queues in the capacity scheduler

2009-08-06 Thread rahul k singh (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

rahul k singh updated MAPREDUCE-824:


Attachment: HADOOP-824-1.patch

Uploaded the patch:
Patch implements the model discussed above. This is first cut of the 
implementation.

Some Class level details:
==
The hierarchial queue follows composite pattern.

Component

AbstractQueue :

- This is abstraction for all the queues.
- implements default behavior for some of the abstraction.
- declares interfaces to access the children and its components.
- defines an interface for accessing a queue's parent in the recursive 
structure.

Composite
-
ContainerQueue:

- This provides functionality mentioned as the part of Sub-Queues.
- consists of composition of AbstractQueue.
- provides functionality to manipulate children
- delegates respective children component calls.

Leaf
--
JobQueue:

- This provides functionality similar to Leaf Queue mentioned in the document 
above.
- Jobs would be submitted only to this queue.
- Consists of list of jobs and acts on the JobInProgressListener events.


JobQueuesManager no more maintains list of runningJob and waitingJobs , they 
are maintained by JobQueue. It simply delegates
the call to JobQueue.


AbstractQueue have QueueSchedulingContext ,defined , it consists of all the 
queue related information required for scheduling decision.

Some refactoring has been done in terms of moving out some of the inner classes 
from CapacityTaskScheduler , esp the QueueSchedulingInfo and TaskSchedulingInfo.


I ll be uploading patch with testcases.

 Support a hierarchy of queues in the capacity scheduler
 ---

 Key: MAPREDUCE-824
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-824
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: contrib/capacity-sched
Reporter: Hemanth Yamijala
 Attachments: HADOOP-824-1.patch


 Currently in Capacity Scheduler, cluster capacity is divided among the queues 
 based on the queue capacity. These queues typically represent an organization 
 and the capacity of the queue represents the capacity the organization is 
 entitled to. Most organizations are large and need to divide their capacity 
 among sub-organizations they have. Or they may want to divide the capacity 
 based on a category or type of jobs they run. This JIRA covers the 
 requirements and other details to provide the above feature.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-479) Add reduce ID to shuffle clienttrace

2009-08-06 Thread Jiaqi Tan (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jiaqi Tan updated MAPREDUCE-479:


Release Note: Adds Reduce Attempt ID to ClientTrace log messages, and adds 
Reduce Attempt ID to HTTP query string sent to mapOutputServlet. Extracts 
partition number from attempt ID.   (was: Adds Reduce Attempt ID to ClientTrace 
log messages, and adds Reduce Attempt ID to HTTP query string sent to 
mapOutputServlet.)
  Status: Patch Available  (was: Open)

Did microbenchmark of shuffle durations with and without added reduce attempt 
ID transmission and reduce partition number extraction; shuffle times before 
and after this patch are statistically comparable (chi-squared test for 
distribution similarity of shuffle times, p-value 0.23 = null-hypothesis of 
statistically different distributions not rejected); thus this patch does not 
cause any performance impact.

 Add reduce ID to shuffle clienttrace
 

 Key: MAPREDUCE-479
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-479
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Affects Versions: 0.21.0
Reporter: Jiaqi Tan
Assignee: Jiaqi Tan
Priority: Minor
 Fix For: 0.21.0

 Attachments: HADOOP-6013.patch, MAPREDUCE-479-1.patch, 
 MAPREDUCE-479-2.patch, MAPREDUCE-479-3.patch, MAPREDUCE-479-4.patch, 
 MAPREDUCE-479.patch


 Current clienttrace messages from shuffles note only the destination map ID 
 but not the source reduce ID. Having both source and destination ID of each 
 shuffle enables full tracing of execution. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-805) Deadlock in Jobtracker

2009-08-06 Thread Amar Kamat (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12740074#action_12740074
 ] 

Amar Kamat commented on MAPREDUCE-805:
--

Test failed with 2 errors TestReduceFetch FAILED (timeout) and 
TestTaskTrackerMemoryManager FAILED. Doesnt seem related but will debug.


 Deadlock in Jobtracker
 --

 Key: MAPREDUCE-805
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-805
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Michael Tamm
 Attachments: MAPREDUCE-805-v1.1.patch, MAPREDUCE-805-v1.2.patch, 
 MAPREDUCE-805-v1.3.patch, MAPREDUCE-805-v1.6.patch


 We are running a hadoop cluster (version 0.20.0) and have detected the 
 following deadlock on our jobtracker:
 {code}
 IPC Server handler 51 on 9001:
   at 
 org.apache.hadoop.mapred.JobInProgress.getCounters(JobInProgress.java:943)
   - waiting to lock 0x7f2b6fb46130 (a 
 org.apache.hadoop.mapred.JobInProgress)
   at 
 org.apache.hadoop.mapred.JobTracker.getJobCounters(JobTracker.java:3102)
   - locked 0x7f2b5f026000 (a org.apache.hadoop.mapred.JobTracker)
   at sun.reflect.GeneratedMethodAccessor21.invoke(Unknown Source)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:959)
   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:955)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:396)
   at org.apache.hadoop.ipc.Server$Handler.run(Server.java:953)
  pool-1-thread-2:
   at org.apache.hadoop.mapred.JobTracker.finalizeJob(JobTracker.java:2017)
   - waiting to lock 0x7f2b5f026000 (a 
 org.apache.hadoop.mapred.JobTracker)
   at 
 org.apache.hadoop.mapred.JobInProgress.garbageCollect(JobInProgress.java:2483)
   - locked 0x7f2b6fb46130 (a org.apache.hadoop.mapred.JobInProgress)
   at 
 org.apache.hadoop.mapred.JobInProgress.terminateJob(JobInProgress.java:2152)
   - locked 0x7f2b6fb46130 (a org.apache.hadoop.mapred.JobInProgress)
   at 
 org.apache.hadoop.mapred.JobInProgress.terminate(JobInProgress.java:2169)
   - locked 0x7f2b6fb46130 (a org.apache.hadoop.mapred.JobInProgress)
   at org.apache.hadoop.mapred.JobInProgress.fail(JobInProgress.java:2245)
   - locked 0x7f2b6fb46130 (a org.apache.hadoop.mapred.JobInProgress)
   at 
 org.apache.hadoop.mapred.EagerTaskInitializationListener$InitJob.run(EagerTaskInitializationListener.java:86)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
   at java.lang.Thread.run(Thread.java:619)
 {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-825) JobClient completion poll interval of 5s causes slow tests in local mode

2009-08-06 Thread Aaron Kimball (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron Kimball updated MAPREDUCE-825:


Status: Open  (was: Patch Available)

 JobClient completion poll interval of 5s causes slow tests in local mode
 

 Key: MAPREDUCE-825
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-825
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Aaron Kimball
Assignee: Aaron Kimball
Priority: Minor
 Attachments: completion-poll-interval.patch


 The JobClient.NetworkedJob.waitForCompletion() method polls for job 
 completion every 5 seconds. When running a set of short tests in 
 pseudo-distributed mode, this is unnecessarily slow and causes lots of wasted 
 time. When bandwidth is not scarce, setting the poll interval to 100 ms 
 results in a 4x speedup in some tests.  This interval should be parametrized 
 to allow users to control the interval for testing purposes.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-825) JobClient completion poll interval of 5s causes slow tests in local mode

2009-08-06 Thread Aaron Kimball (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron Kimball updated MAPREDUCE-825:


Attachment: MAPREDUCE-825.2.patch

Attaching a new patch that also includes a new 
jobclient.progress.monitor.poll.interval setting; default is 1000 ms. 
Modified TestTaskFail to set the completion poll interval to 50 ms.

With default (5000) ms timeout, test runtime was 3 minutes 15 seconds. Setting 
the timeout to 50 ms reduced test runtime to 3 minutes 8 seconds. If we expect 
an average of 2500 milliseconds wasted per job in the default case, then this 
is 2500*3 = 7500 ms expected to be wasted, so the observed speedup seems 
correct. To be sure, I also set the timeout to 2 ms; test runtime went up 
to 3 minutes 52 seconds. So there's definitely a correlation.

 JobClient completion poll interval of 5s causes slow tests in local mode
 

 Key: MAPREDUCE-825
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-825
 Project: Hadoop Map/Reduce
  Issue Type: Bug
Reporter: Aaron Kimball
Assignee: Aaron Kimball
Priority: Minor
 Attachments: completion-poll-interval.patch, MAPREDUCE-825.2.patch


 The JobClient.NetworkedJob.waitForCompletion() method polls for job 
 completion every 5 seconds. When running a set of short tests in 
 pseudo-distributed mode, this is unnecessarily slow and causes lots of wasted 
 time. When bandwidth is not scarce, setting the poll interval to 100 ms 
 results in a 4x speedup in some tests.  This interval should be parametrized 
 to allow users to control the interval for testing purposes.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-706) Support for FIFO pools in the fair scheduler

2009-08-06 Thread Matei Zaharia (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matei Zaharia updated MAPREDUCE-706:


Status: Open  (was: Patch Available)

 Support for FIFO pools in the fair scheduler
 

 Key: MAPREDUCE-706
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-706
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: contrib/fair-share
Reporter: Matei Zaharia
Assignee: Matei Zaharia
 Attachments: fsdesigndoc.pdf, fsdesigndoc.tex, mapreduce-706.patch, 
 mapreduce-706.v1.patch, mapreduce-706.v2.patch, mapreduce-706.v3.patch, 
 mapreduce-706.v4.patch


 The fair scheduler should support making the internal scheduling algorithm 
 for some pools be FIFO instead of fair sharing in order to work better for 
 batch workloads. FIFO pools will behave exactly like the current default 
 scheduler, sorting jobs by priority and then submission time. Pools will have 
 their scheduling algorithm set through the pools config file, and it will be 
 changeable at runtime.
 To support this feature, I'm also changing the internal logic of the fair 
 scheduler to no longer use deficits. Instead, for fair sharing, we will 
 assign tasks to the job farthest below its share as a ratio of its share. 
 This is easier to combine with other scheduling algorithms and leads to a 
 more stable sharing situation, avoiding unfairness issues brought up in 
 MAPREDUCE-543 and MAPREDUCE-544 that happen when some jobs have long tasks. 
 The new preemption (MAPREDUCE-551) will ensure that critical jobs can gain 
 their fair share within a bounded amount of time.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-706) Support for FIFO pools in the fair scheduler

2009-08-06 Thread Matei Zaharia (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matei Zaharia updated MAPREDUCE-706:


Attachment: mapreduce-706.v4.patch

I've made a few changes to the Web UI to show more / better info:
- Pools now have their scheduling mode and fair shares displayed.
- Jobs in a FIFO pool have their fair share displayed as NA instead of 0.

Attaching a patch with these changes.

 Support for FIFO pools in the fair scheduler
 

 Key: MAPREDUCE-706
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-706
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: contrib/fair-share
Reporter: Matei Zaharia
Assignee: Matei Zaharia
 Attachments: fsdesigndoc.pdf, fsdesigndoc.tex, mapreduce-706.patch, 
 mapreduce-706.v1.patch, mapreduce-706.v2.patch, mapreduce-706.v3.patch, 
 mapreduce-706.v4.patch


 The fair scheduler should support making the internal scheduling algorithm 
 for some pools be FIFO instead of fair sharing in order to work better for 
 batch workloads. FIFO pools will behave exactly like the current default 
 scheduler, sorting jobs by priority and then submission time. Pools will have 
 their scheduling algorithm set through the pools config file, and it will be 
 changeable at runtime.
 To support this feature, I'm also changing the internal logic of the fair 
 scheduler to no longer use deficits. Instead, for fair sharing, we will 
 assign tasks to the job farthest below its share as a ratio of its share. 
 This is easier to combine with other scheduling algorithms and leads to a 
 more stable sharing situation, avoiding unfairness issues brought up in 
 MAPREDUCE-543 and MAPREDUCE-544 that happen when some jobs have long tasks. 
 The new preemption (MAPREDUCE-551) will ensure that critical jobs can gain 
 their fair share within a bounded amount of time.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-749) Make Sqoop unit tests more Hudson-friendly

2009-08-06 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12740344#action_12740344
 ] 

Hadoop QA commented on MAPREDUCE-749:
-

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12415424/MAPREDUCE-749.3.patch
  against trunk revision 801517.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 5 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/447/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/447/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/447/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/447/console

This message is automatically generated.

 Make Sqoop unit tests more Hudson-friendly
 --

 Key: MAPREDUCE-749
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-749
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: contrib/sqoop
Reporter: Aaron Kimball
Assignee: Aaron Kimball
 Attachments: MAPREDUCE-749.2.patch, MAPREDUCE-749.3.patch, 
 MAPREDUCE-749.patch


 Hudson servers (other than Apache's) need to be able to run the sqoop unit 
 tests which depend on thirdparty JDBC drivers / database implementations. The 
 build.xml needs some refactoring to make this happen.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-749) Make Sqoop unit tests more Hudson-friendly

2009-08-06 Thread Aaron Kimball (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12740354#action_12740354
 ] 

Aaron Kimball commented on MAPREDUCE-749:
-

Streaming test failures are unrelated

 Make Sqoop unit tests more Hudson-friendly
 --

 Key: MAPREDUCE-749
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-749
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: contrib/sqoop
Reporter: Aaron Kimball
Assignee: Aaron Kimball
 Attachments: MAPREDUCE-749.2.patch, MAPREDUCE-749.3.patch, 
 MAPREDUCE-749.patch


 Hudson servers (other than Apache's) need to be able to run the sqoop unit 
 tests which depend on thirdparty JDBC drivers / database implementations. The 
 build.xml needs some refactoring to make this happen.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-478) separate jvm param for mapper and reducer

2009-08-06 Thread Arun C Murthy (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated MAPREDUCE-478:


Status: Patch Available  (was: Open)

 separate jvm param for mapper and reducer
 -

 Key: MAPREDUCE-478
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-478
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
Reporter: Koji Noguchi
Assignee: Arun C Murthy
Priority: Minor
 Fix For: 0.21.0

 Attachments: HADOOP-5684_0_20090420.patch, 
 MAPREDUCE-478_0_20090804.patch, MAPREDUCE-478_0_20090804_yhadoop20.patch, 
 MAPREDUCE-478_1_20090806.patch, MAPREDUCE-478_1_20090806_yhadoop20.patch


 Memory footprint of mapper and reducer can differ. 
 It would be nice if we can pass different jvm param (mapred.child.java.opts) 
 for mappers and reducers.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-372) Change org.apache.hadoop.mapred.lib.ChainMapper/Reducer to use new api.

2009-08-06 Thread Amareshwari Sriramadasu (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12740386#action_12740386
 ] 

Amareshwari Sriramadasu commented on MAPREDUCE-372:
---

Test failure TestTaskTrackerMemoryManager is not related to the patch. It is 
due to HADOOP-6075

 Change org.apache.hadoop.mapred.lib.ChainMapper/Reducer to use new api.
 ---

 Key: MAPREDUCE-372
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-372
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
 Fix For: 0.21.0

 Attachments: patch-372-1.txt, patch-372.txt




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-779) Add node health failures into JobTrackerStatistics

2009-08-06 Thread Hemanth Yamijala (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12740387#action_12740387
 ] 

Hemanth Yamijala commented on MAPREDUCE-779:


I looked at the patch. Overall, looking good. 

Sharad and I feel that the method 
JobTrackerStatistics.taskTrackerHealthCheckFailed() is not required. I think it 
is assumed that the tasktracker stat will be created and available. Further, it 
seems like the class JobTrackerStatistics manages the TasktrackerStatistics 
instances and nothing more. So, the method seems out of place. Hence, wherever 
we need to update the health check failure count, we could simply call 
JobTrackerStatistics.getTaskTrackerStat().incrHealthCheckFailed(). Sreekanth, 
any reason you thought the TaskTrackerStat for a TT will not be available and 
hence were implicitly managing the creation of it ?

One more minor nit is we can rename the label Blacklisted due to health 
Failures to Failed Health Checks just to reduce the verbosity.

Other than this +1

 Add node health failures into JobTrackerStatistics
 --

 Key: MAPREDUCE-779
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-779
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: jobtracker
Reporter: Sreekanth Ramakrishnan
Assignee: Sreekanth Ramakrishnan
 Attachments: mapreduce-779-1.patch, mapreduce-779-2.patch, 
 mapreduce-779-3.patch


 Add the node health failure counts into {{JobTrackerStatistics}}.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-757) JobConf will not be deleted from the logs folder if job retires from finalizeJob()

2009-08-06 Thread Amar Kamat (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12740399#action_12740399
 ] 

Amar Kamat commented on MAPREDUCE-757:
--

ant tests and contrib tests passed. 

 JobConf will not be deleted from the logs folder if job retires from 
 finalizeJob()
 --

 Key: MAPREDUCE-757
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-757
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: jobtracker
Reporter: Amar Kamat
Assignee: Amar Kamat
 Attachments: MAPREDUCE-757-v1.0.patch, MAPREDUCE-757-v2.0.patch


 MAPREDUCE-130 fixed the case where the job is retired from the retire jobs 
 thread. But jobs can also retire when the num-job-per-user limit is exceeded. 
 In such cases the conf file will not be deleted.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (MAPREDUCE-779) Add node health failures into JobTrackerStatistics

2009-08-06 Thread Sreekanth Ramakrishnan (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sreekanth Ramakrishnan updated MAPREDUCE-779:
-

Attachment: mapreduce-779-4.patch

Had added that method assuming that TaskTrackerStat is not created until first 
task is run/failed in a lazy manner, now rechecked that the stat object is 
created when tracker is added. So I have removed the method.

Renamed the heading in JSP.

Also added {{TestTaskTrackerBlacklisting}} to commit-tests list. 

 Add node health failures into JobTrackerStatistics
 --

 Key: MAPREDUCE-779
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-779
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: jobtracker
Reporter: Sreekanth Ramakrishnan
Assignee: Sreekanth Ramakrishnan
 Attachments: mapreduce-779-1.patch, mapreduce-779-2.patch, 
 mapreduce-779-3.patch, mapreduce-779-4.patch


 Add the node health failure counts into {{JobTrackerStatistics}}.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-375) Change org.apache.hadoop.mapred.lib.NLineInputFormat and org.apache.hadoop.mapred.MapFileOutputFormat to use new api.

2009-08-06 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12740409#action_12740409
 ] 

Hadoop QA commented on MAPREDUCE-375:
-

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12415117/patch-375-2.txt
  against trunk revision 801517.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 6 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

Test results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/448/testReport/
Findbugs warnings: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/448/artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/448/artifact/trunk/build/test/checkstyle-errors.html
Console output: 
http://hudson.zones.apache.org/hudson/job/Mapreduce-Patch-vesta.apache.org/448/console

This message is automatically generated.

  Change org.apache.hadoop.mapred.lib.NLineInputFormat and 
 org.apache.hadoop.mapred.MapFileOutputFormat to use new api.
 --

 Key: MAPREDUCE-375
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-375
 Project: Hadoop Map/Reduce
  Issue Type: Sub-task
Reporter: Amareshwari Sriramadasu
Assignee: Amareshwari Sriramadasu
 Fix For: 0.21.0

 Attachments: patch-375-1.txt, patch-375-2.txt, patch-375.txt




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.