[ 
https://issues.apache.org/jira/browse/YARN-392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13622544#comment-13622544
 ] 

Bikas Saha commented on YARN-392:
---------------------------------

I dont think its possible for location specific and non-location specific 
requests to live at the same priority. This is mainly because of the way 
current schedulers are implemented in the RM (grouped together and keyed by 
location and priority). Such requests have to be separated by priority and that 
may not be a bad thing IMO.
I discussed this offline with [~vinodkv] and I would like to suggest and 
extension to the approach. Instead of a flag, how about specifying a time 
interval that tells the RM how long to wait before dropping locality. A time 
interval of infinite would be the same as a boolean flag and so this approach 
covers the other one. Additionally, it lets a large map to be more conservative 
about dropping locality over latency and a short job more aggressive. Currently 
the value of this interval comes from config and maps to the number of 
scheduling attempts missed by this request. This is done by keeping a count of 
node heartbeats. Given the number of nodes and heartbeat interval, the user 
specified time interval can easily be mapped to a count that matches the 
current implementation. So this will not be a perf hit nor a change in logic 
compared to existing code.
Another thing to consider is allowing users to say I want to be scheduled only 
on these racks. Again, I dont think we can mix node-specfic and rack-specific 
scheduling at the same priority.
                
> Make it possible to schedule to specific nodes without dropping locality
> ------------------------------------------------------------------------
>
>                 Key: YARN-392
>                 URL: https://issues.apache.org/jira/browse/YARN-392
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Bikas Saha
>            Assignee: Sandy Ryza
>         Attachments: YARN-392-1.patch, YARN-392.patch
>
>
> Currently its not possible to specify scheduling requests for specific nodes 
> and nowhere else. The RM automatically relaxes locality to rack and * and 
> assigns non-specified machines to the app.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to