[jira] [Commented] (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2011-09-14 Thread Scott Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13105119#comment-13105119
 ] 

Scott Chen commented on MAPREDUCE-2198:
---

Arun: Thanks for the comments. You are right. I guess this is not an issue 
since we have MRv2. Closing this now.


> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.23.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2198-v2.txt, MAPREDUCE-2198.txt
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2011-02-26 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12999876#comment-12999876
 ] 

Hadoop QA commented on MAPREDUCE-2198:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12464776/MAPREDUCE-2198-v2.txt
  against trunk revision 1074251.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

-1 javac.  The patch appears to cause tar ant target to fail.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed core unit tests.

-1 contrib tests.  The patch failed contrib unit tests.

+1 system test framework.  The patch passed system test framework compile.

Test results: 
https://hudson.apache.org/hudson/job/PreCommit-MAPREDUCE-Build/51//testReport/
Findbugs warnings: 
https://hudson.apache.org/hudson/job/PreCommit-MAPREDUCE-Build/51//artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Console output: 
https://hudson.apache.org/hudson/job/PreCommit-MAPREDUCE-Build/51//console

This message is automatically generated.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.23.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2198-v2.txt, MAPREDUCE-2198.txt
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-12-07 Thread Joydeep Sen Sarma (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968862#action_12968862
 ] 

Joydeep Sen Sarma commented on MAPREDUCE-2198:
--

+1 on Matei's previous comments about needing to wait until slots are actually 
released. We should have another api to request the actual number of 
[map/reduce] slots in use on any given tracker and only claim slots when they 
are actually confirmed to be released.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.23.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2198-v2.txt, MAPREDUCE-2198.txt
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-30 Thread Scott Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965478#action_12965478
 ] 

Scott Chen commented on MAPREDUCE-2198:
---

bq. However, I think this can already be done through the MRAdmin shell command 
/ API.
I also prefer using the existing interface and keep this one simpler.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.23.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2198-v2.txt, MAPREDUCE-2198.txt
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-27 Thread dhruba borthakur (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12964503#action_12964503
 ] 

dhruba borthakur commented on MAPREDUCE-2198:
-

> Either have the process that's scaling down the cluster watch the running 
> tasks before giving the slots to someone else

I like this idea (instead of having callbacks, keeps the design simple)



> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.23.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2198-v2.txt, MAPREDUCE-2198.txt
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-27 Thread Matei Zaharia (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12964477#action_12964477
 ] 

Matei Zaharia commented on MAPREDUCE-2198:
--

The changes look good, but I thought about one other issue: What should we do 
when we are asked to lower the slots on a node to below the number of running 
tasks on it? In the current version, the scheduler won't launch tasks on that 
node until its running task count falls below its slot count. However, if we 
wanted to use this for rollover, we'd probably want to wait until enough of 
those tasks are done before giving a slot to the new JobTracker. There are two 
ways we can do this: Either have the process that's scaling down the cluster 
watch the running tasks before giving the slots to someone else, or include an 
API that somehow makes a callback when the number of running tasks has 
decreased below the target slot count. What are your thoughts on this?

One other thing we may want to support is killing tasks after a timeout if the 
cluster hasn't scaled down. However, I think this can already be done through 
the MRAdmin shell command / API.

In either case, we probably need some API to see what's running on the cluster. 
Some of the commands in MRAdmin might be enough, but we may want to add 
something there. However, this can be a different JIRA.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.23.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2198-v2.txt, MAPREDUCE-2198.txt
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-26 Thread Scott Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12936147#action_12936147
 ] 

Scott Chen commented on MAPREDUCE-2198:
---

Hey Matei, Thanks for the review.

{quote}
How can you run the FairSchedulerShell from the command line? It doesn't seem 
to have a main method (so just using bin/hadoop 
org.apache.hadoop.mapred.FairSchedulerShell doesn't work), and I don't see it 
registered as a tool anywhere.
{quote}
It's my bad. I forgot to put a main method in FairSchedulerShell. I will update 
it.

{quote}
Will the slot counts set by the scheduler be visible in the JobTracker web UI? 
It looks like jobtracker.jsp looks at ClusterMetrics and machines.jsp looks at 
TaskTrackerStatus objects.
{quote}
For the ClusterMetircs, it's OK because we use TaskScheduler.getMaxSlots() to 
calculate the total slots in JobTracker. But you are right about machines.jsp. 
I should change it so that it also pulls the information from scheduler.


> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.23.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2198.txt
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-26 Thread Matei Zaharia (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12936124#action_12936124
 ] 

Matei Zaharia commented on MAPREDUCE-2198:
--

The approach in the patch looks good, but I have two questions:

- How can you run the FairSchedulerShell from the command line? It doesn't seem 
to have a main method (so just using bin/hadoop 
org.apache.hadoop.mapred.FairSchedulerShell doesn't work), and I don't see it 
registered as a tool anywhere.

- Will the slot counts set by the scheduler be visible in the JobTracker web 
UI? It looks like jobtracker.jsp looks at ClusterMetrics and machines.jsp looks 
at TaskTrackerStatus objects.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.23.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2198.txt
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-25 Thread Scott Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12935935#action_12935935
 ] 

Scott Chen commented on MAPREDUCE-2198:
---

I tried using review board. But I always get 500 error.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.23.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.23.0
>
> Attachments: MAPREDUCE-2198.txt
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-24 Thread Matei Zaharia (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12935577#action_12935577
 ] 

Matei Zaharia commented on MAPREDUCE-2198:
--

One other minor comment: if the fair scheduler is over-scheduling task 
trackers, maybe we should consider that a bug. I don't think it was intended to 
do that, although in trunk at least, it looks like it may do it if mapAssignCap 
and reduceAssignCap are set to something less than infinity. (Otherwise, it 
looks at the number of slots free on the TT and does not assign more than 
that.) To deal with any sort of race condition that occurs if you lower a slot 
count while a heartbeat is in progress, I'd suggest making the TT report 
over-scheduled tasks as killed and drop them.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-24 Thread Matei Zaharia (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12935568#action_12935568
 ] 

Matei Zaharia commented on MAPREDUCE-2198:
--

Do setup and cleanup tasks do a significant amount of work? I can see two 
simple ways of dealing with them other than moving control to the scheduler: 
Either always allow them to run (even if the TT is full on map and reduce 
slots), or allow them to run but limit the number of such tasks per node to 1 
(i.e. have a "setup slot"). If they do need to do CPU-intensive work, then it 
makes sense to give control of them to the scheduler.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-23 Thread Arun C Murthy (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12935119#action_12935119
 ] 

Arun C Murthy commented on MAPREDUCE-2198:
--

Right. Also, you need to worry about task-cleanup-tasks i.e. responses to move 
tasks from COMMIT_PENDING to SUCCESS/KILLED.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-23 Thread Scott Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12935052#action_12935052
 ] 

Scott Chen commented on MAPREDUCE-2198:
---

Hey Arun,

bq. You also have to worry about piggy-backing of task-cleanup tasks done by 
the JT...
I read some codes. Now I see your point.
JobTracker.getSetupAndCleanupTasks() is not controlled by the Scheduler.
So we need to put some logic there to make it aware of this task limit. Or we 
might be over-scheduled.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-23 Thread Scott Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12935001#action_12935001
 ] 

Scott Chen commented on MAPREDUCE-2198:
---

Hey Arun and Joydeep,

bq. How do you handle heterogeneous clusters? Or will your CLI command be per 
TT?
Yes, the CLI command will be per TT.

bq. in addition - one bug/feature that we need to fix as part of this where the 
JT overschedules TTs (when configured to schedule multiple tasks per hbt). this 
is benign today (TT puts those tasks in unassigned state) - but in this world 
will not be so benign.
Yes, we should never let JT submit more tasks than TT's limit. Scheduler should 
incorporate this logic.

bq. You also have to worry about piggy-backing of task-cleanup tasks done by 
the JT...
I am not very clear about this part. Will read more codes.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-23 Thread Arun C Murthy (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934939#action_12934939
 ] 

Arun C Murthy commented on MAPREDUCE-2198:
--

bq. Also, as Joydeep pointed out there are several issues with unassigned slots 
in TTs...

You also have to worry about piggy-backing of task-cleanup tasks done by the 
JT...

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-23 Thread Arun C Murthy (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934931#action_12934931
 ] 

Arun C Murthy commented on MAPREDUCE-2198:
--

bq. Right now if you want to change the number of slots, you have to change the 
conf on every TT and restart.

How do you handle heterogeneous clusters? Or will your CLI command be per TT?

Also, as Joydeep pointed out there are several issues with unassigned slots in 
TTs...

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-22 Thread Joydeep Sen Sarma (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934753#action_12934753
 ] 

Joydeep Sen Sarma commented on MAPREDUCE-2198:
--

what is fairscheduler specific about this? could we not make this a change in 
the JT directly to scale the slots advertised by TT?

in addition - one bug/feature that we need to fix as part of this where the JT 
overschedules TTs (when configured to schedule multiple tasks per hbt). this is 
benign today (TT puts those tasks in unassigned state) - but in this world will 
not be so benign.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-22 Thread Scott Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934739#action_12934739
 ] 

Scott Chen commented on MAPREDUCE-2198:
---

Hey M.C.

Yes, we can set a higher slot limit on TT and let scheduler manage the slots.

bq.  Are we running two TT's on each node, that talk to different JT's, and 
"migrating" slots from one to the other?
Yes. The motivation here is that when deploying new JT and TT. We need to 
restart the cluster and we lose all the running jobs.
This can be solved by the way you described.

Other use case is that people can experiment with the best slot settings by 
using the CLI without restarting the cluster.
Right now if you want to change the number of slots, you have to change the 
conf on every TT and restart.

Scott

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-22 Thread dhruba borthakur (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934738#action_12934738
 ] 

dhruba borthakur commented on MAPREDUCE-2198:
-

>  Are we running two TT's on each node, that talk to different JT's, and 
> "migrating" slots from one to the other?

Precisely. We have a solution that is used to deploy new software to the JT-TT. 
Earlier, the JT/TT have to be shutdown, new code deployed, and then the cluster 
is restarted. This means that the cluster was unavailable for a while and 
currently running jobs all fails when the cluster is shutdown.

The modified approach is to direct all new jobs to the newly created JT 
instance, and then slowly (and proportionally) migrate slots from the old JT 
instance to the new JT instance. This allows JT software upgrades without 
incurring any cluster downtime.

> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (MAPREDUCE-2198) Allow FairScheduler to control the number of slots on each TaskTracker

2010-11-22 Thread M. C. Srivas (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934732#action_12934732
 ] 

M. C. Srivas commented on MAPREDUCE-2198:
-

The scheduler can easily choose to ignore the slot information while 
scheduling, correct? Then why is this needed? Are we running two TT's on each 
node, that talk to different JT's, and "migrating" slots from one to the other?


> Allow FairScheduler to control the number of slots on each TaskTracker
> --
>
> Key: MAPREDUCE-2198
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-2198
> Project: Hadoop Map/Reduce
>  Issue Type: New Feature
>  Components: contrib/fair-share
>Affects Versions: 0.22.0
>Reporter: Scott Chen
>Assignee: Scott Chen
> Fix For: 0.22.0
>
>
> We can set the number of slots on the TaskTracker to be high and let 
> FairScheduler handles the slots.
> This approach allows us to change the number of slots on each node 
> dynamically.
> The administrator can change the number of slots with a CLI tool.
> One use case of this is for upgrading the MapReduce.
> Instead of restarting the cluster, we can run the new MapReduce on the same 
> cluster.
> And use the CLI tool to gradually migrate the slots.
> This way we don't lost the progress fo the jobs that's already executed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.