Thanks, Sunil. It makes a lot of sense. I will try it out.

From: Sunil Govind [mailto:sunil.gov...@gmail.com]
Sent: Wednesday, October 12, 2016 9:21 AM
To: Frank Luo <j...@merkleinc.com>; user@hadoop.apache.org
Subject: Re: how to add a shareable node label?

Hi Frank

Thanks for sharing more details. Let me try this combination and I might be 
wrong, so pls correct me. And I think sharable node-label could help here.



Labels:

Node1-4 = default label

Node8-9 = "special" label



Queues:

"ProdQ" accessable-labels  is ""  (only default label)

"TestQ" accessable-labels  is ""  (only default label)



"LabeledQ" accessable-labels "special"



Capacity Per Queue:

"ProdQ"

capacity=50%

max-capacity=100%



"TestQ"

capacity=50%

max-capacity=50%



"LabeledQ"

special.capacity=100%

special.max-capacity=100%



Various Choices:

* Jobs in ProdQ is assured with 50% of default label resources and it can go to 
100% if there are no resource running in TestQ

* Jobs in TestQ can only get 50% of default label resources.

* If jobs in ProdQ or TestQ needed to make use of "special" label machines, it 
is only possible when there are resource available in "special" label. 
"special" is a non-exclusive label which can share its resource with "default" 
label.

* Any job submitted in "LabeledQ"  is assured with 100% of special resources 
and can use 100% if nothing is there. I think preemption could be made optional 
here.

If Inter queue preemption is enabled, we can enforce a normalization faster for 
default label, else apps might need to wait.  We could also try another 
approach as I shared in an earlier mail. But it ensure some % of resources for 
ProdQ and TestQ in LabeledQ which may not be suitable.



Thanks

Sunil

On Tue, Oct 11, 2016 at 10:38 PM Frank Luo 
<j...@merkleinc.com<mailto:j...@merkleinc.com>> wrote:







Hah, how so? I am confused as I was under impression that I needed sharing but 
not preemption.



Let’s model this out.



Assuming I got 4 “normal” machines node1-4, and two special node8 and node9 
where JobA can be executed on.



And I need two queues, ProdQ and TestQ equally sharing Node1-4, and a 
“LabeledQ” with node8/9.



When ProdQ is full, it can overflow to TestQ and further to LabeledQ. If TestQ 
is full, the tasks stay in TestQ, or optionally overflow to LabeledQ (either way
is fine as long as it doesn’t go to ProdQ). And when JobA is running, it can 
only go to LabelledQ. If something else is on LabelledQ, JobA waits.



Do you mind to illustrate how to config the queues to achieve what I am looking 
for?



Thank you Sunil.



From: Sunil Govind 
[mailto:sunil.gov...@gmail.com<mailto:sunil.gov...@gmail.com>]


Sent: Tuesday, October 11, 2016 11:44 AM


To: Frank Luo <j...@merkleinc.com<mailto:j...@merkleinc.com>>; 
user@hadoop.apache.org<mailto:user@hadoop.apache.org>

Subject: Re: how to add a shareable node label?




Hi Frank






Extremely sorry for the delay..







Yes, you are correct. Sharing feature of node label is not needed in your case.



Existing node labels and a queue model could solve the problem.







Thanks



Sunil







On Fri, Oct 7, 2016 at 11:59 PM Frank Luo 
<j...@merkleinc.com<mailto:j...@merkleinc.com>> wrote:





That is correct, Sunil.



Just to confirm,  the Node Labeling feature on 2.8 or 3.0 alpha won’t satisfy
my need, right?



From:
Sunil Govind [mailto:sunil.gov...@gmail.com<mailto:sunil.gov...@gmail.com>]


Sent: Friday, October 07, 2016 12:09 PM







To: Frank Luo <j...@merkleinc.com<mailto:j...@merkleinc.com>>;
user@hadoop.apache.org<mailto:user@hadoop.apache.org>

Subject: Re: how to add a shareable node label?








HI Frank






In that case, preemption may not be needed. So over-utilizing resources of 
queueB will be running till it completes. Since queueA is under served, then 
any next free container could
go to queueA which is for Job_A.







Thanks



Sunil







On Fri, Oct 7, 2016 at 9:58 PM Frank Luo 
<j...@merkleinc.com<mailto:j...@merkleinc.com>> wrote:





Sunil,



Your description pretty much matches my understanding. Except for “Job_A
will have to run as per its schedule w/o any delay”. My situation is that Job_A 
can be delayed. As long as it runs in queueA, I am happy.



Just as you said, processes normally running in queueB might not be preemptable.
So if they overflow to queueA then got preempted, then that is not good.



From:
Sunil Govind [mailto:sunil.gov...@gmail.com<mailto:sunil.gov...@gmail.com>]


Sent: Friday, October 07, 2016 10:50 AM







To: Frank Luo <j...@merkleinc.com<mailto:j...@merkleinc.com>>;
user@hadoop.apache.org<mailto:user@hadoop.apache.org>





Subject:
Re: how to add a shareable node label?









HI Frank






Thanks for the details.







I am not quite sure if I understood you problem correctly. I think you are 
looking for a solution to ensure that Job_A will have to run as per its 
schedule w/o any delay. Meantime
you also do not want to waste resources on those high end machine where Job_A 
is running.







I think you still need node label exclusivity here since there is h/w 
dependency. But if you have 2 queues' which are shared to use "labelA" here, 
then always "Job_A" can be planned
to run in that queue, say "queueA". Other jobs could be run in "queueB" here. 
So if you tune capacities and if preemption is enabled per queue level, 
overutilized resources used by "queueB" could be preempted for "Job_A".







But if your sharable jobs are like some linux jobs which should not be 
preempted, then this may be only a half solution.







Thanks





Sunil







On Fri, Oct 7, 2016 at 7:36 AM Frank Luo 
<j...@merkleinc.com<mailto:j...@merkleinc.com>> wrote:





Sunil,



You confirmed my understanding. I got the understanding by reading the docs
and haven’t really tried 2.8 or 3.0-alphal.



My situation is that I am in a multi-tenant env, and  got several very powerful
machines with expensive licenses to run a particular linux job, let’s say 
Job_A. But the job is executed infrequently, so I want to let other jobs to use 
the machines when Job_A is not running. In the meaning time, I am not powerful 
enough to force all other
jobs to be preemptable. As matter of fact, I know they have Hadoop jobs 
inserting into sql-server, or just pure linux jobs that are not preemptable in 
nature. So preempt jobs is not an option for me.



I hope it makes sense.



Frank



From:
Sunil Govind [mailto:sunil.gov...@gmail.com<mailto:sunil.gov...@gmail.com>]


Sent: Thursday, October 06, 2016 2:15 PM







To: Frank Luo <j...@merkleinc.com<mailto:j...@merkleinc.com>>;
user@hadoop.apache.org<mailto:user@hadoop.apache.org>

Subject: Re: how to add a shareable node label?








HI Frank






Ideally those containers will be preempted if there are unsatisfied demand for 
"configured label".







I could explain this:



"labelA" has few empty resources.  All nodes under "default" label is used. 
Hence a new application which is submitted to "default" label has to wait. But 
if "labelA" is non-exclusive
and there are some free resources, this new application can run on "labelA".



However if there are some more new apps submitted to "labelA", and if there are 
no more resources available in "labelA", then it may preempt containers from 
the app which was sharing
containers earlier.







May be you could share some more information so tht it may become more clear. 
Also I suppose you are running this in hadoop 3 alpha1 release. please correct 
me if I m wrong.







Thanks



Sunil







On Thu, Oct 6, 2016 at 9:44 PM Frank Luo 
<j...@merkleinc.com<mailto:j...@merkleinc.com>> wrote:





Thanks Sunil.




>
3. If there is any future ask for those resources , we will preempt the non 
labeled apps and give them back to labeled apps.



Unfortunately, I am still not able to use it, because of the preemptive 
behavior.
The jobs that steals labelled resources are not preemptable, and I’d rather 
waiting instead of killing.



From:
Sunil Govind [mailto:sunil.gov...@gmail.com<mailto:sunil.gov...@gmail.com>]


Sent: Thursday, October 06, 2016 1:59 AM







To: Frank Luo <j...@merkleinc.com<mailto:j...@merkleinc.com>>;
user@hadoop.apache.org<mailto:user@hadoop.apache.org>





Subject:
Re: how to add a shareable node label?








Hi Frank


I think as of today this is not possible. You could try and experience the 
"non-exlusive" feature of node-label which will officially come in 2.8 soon. Or 
you can try it in "Hadoop
3 alpha1" release too if its fine to check. 
YARN-3214<https://issues.apache.org/jira/browse/YARN-3214> has
the details for the nodelabel sharing concept.







Thanks



Sunil







On Wed, Oct 5, 2016 at 8:14 PM Frank Luo 
<j...@merkleinc.com<mailto:j...@merkleinc.com>> wrote:





Sunil, thanks for responding.



So is there any way to dedicate one kind of jobs to certain machines, then 
having
those machines be shared if no dedicated job running?



From:
Sunil Govind [mailto:sunil.gov...@gmail.com<mailto:sunil.gov...@gmail.com>]


Sent: Wednesday, October 05, 2016 12:50 AM

To: Frank Luo <j...@merkleinc.com<mailto:j...@merkleinc.com>>;
user@hadoop.apache.org<mailto:user@hadoop.apache.org>;
u...@yarn.apache.org<mailto:u...@yarn.apache.org>







Subject: Re: how to add a shareable node label?








Hi Frank,












As far as I checked, all labels are "exclusive" in 2.7. In upcoming 2.8 
release, we can get "non-exclusive" or sharable node labels.







Thanks



Sunil











On Wed, Oct 5, 2016 at 8:40 AM Frank Luo 
<j...@merkleinc.com<mailto:j...@merkleinc.com>> wrote:





I am using Hadoop 2.7.3, when I run:


$ yarn rmadmin -addToClusterNodeLabels "Label1(exclusive=false)"



I got an error as:


… addToClusterNodeLabels: java.io.IOException: label name should only contains 
{0-9, a-z, A-Z, -, _} and should not started with {-,_}



If I just use “Label1”, it will work fine, but I want a shareable one.




Anyone knows a better way to do it?

































Access the Q2 2016 Digital Marketing Report for a fresh set of trends and 
benchmarks in digital 
marketing<http://www2.merkleinc.com/l/47252/2016-07-26/47gt7c>


Download our latest report titled “The Case for Change: Exploring the Myths of 
Customer-Centric Transformation”
<http://www2.merkleinc.com/l/47252/2016-08-04/4b9p7c>


This email and any attachments transmitted with it are intended for use by the 
intended recipient(s) only. If you have received this email in error, please 
notify the sender
immediately and then delete it. If you are not the intended recipient, you must 
not keep, use, disclose, copy or distribute this email without the author’s 
prior permission. We take precautions to minimize the risk of transmitting 
software viruses, but we
advise you to perform your own virus checks on any attachment to this message. 
We cannot accept liability for any loss or damage caused by software viruses. 
The information contained in this communication may be confidential and may be 
subject to the attorney-client
privilege.


Access the Q2 2016 Digital Marketing Report for a fresh set of trends and 
benchmarks in digital 
marketing<http://www2.merkleinc.com/l/47252/2016-07-26/47gt7c>

Download our latest report titled “The Case for Change: Exploring the Myths of 
Customer-Centric Transformation” 
<http://www2.merkleinc.com/l/47252/2016-08-04/4b9p7c>

This email and any attachments transmitted with it are intended for use by the 
intended recipient(s) only. If you have received this email in error, please 
notify the sender immediately and then delete it. If you are not the intended 
recipient, you must not keep, use, disclose, copy or distribute this email 
without the author’s prior permission. We take precautions to minimize the risk 
of transmitting software viruses, but we advise you to perform your own virus 
checks on any attachment to this message. We cannot accept liability for any 
loss or damage caused by software viruses. The information contained in this 
communication may be confidential and may be subject to the attorney-client 
privilege.

Reply via email to