Re: Rack awareness support for Mesos

Du, Fan Mon, 13 Jun 2016 23:25:22 -0700

Hi everyone

Let me summarize the discussion about Rack awareness in the community sofar. First thanks for all the comments, advices or challenges! :)


#1. Stick with attributes for rack awareness

For compatibility with existing framework, I tend to be ok with usingattributes to convey the rack information, but with the goal to do itautomatically, easy to maintain and with good attributes schema. Thiswill bring up below question where the controversy starts.


#2. Scripts vs programmatic way

Both can be used to set attributes, I've made my arguments in the Jiraand the Design doc, I'm not gonna to argue more here. But please take alook discussion at MESOS-3366 before, which allow resources/attributesdiscovery.


A module to implement *slaveAttributesDecorator* hook will works like
a charm here in a static way. And need to justify attributes updating.

#3. Allow updating attributes
Several cases need to be covered here:

a). Mesos runs inside VMs or container, where live migration happens, sorack information need to be updated.

b). LLDP packets are broadcasted by the interval 10s~30s, a vendorspecific implementation, and rack information are usually stored in LLDPdaemon to be queried. Worst cases(nodes fresh reboot, or daemon restart)would be: Mesos slave have to wait 10s~30s for a valid rack informationbefore register to master. Allow updating attributes will mitigate thisproblem.


c). Framework affinity

Framework X prefers to run on the same nodes with another framwork Y.
For example, it's desirable for Shark or Spark-SQL to reside on the

*worker* node where Alluxio(former Tachyon) to gain more performanceboosting as SPARK-6707 ticket message {tachyon=true;us-east-1=false}

If framework could advertise agent attributes in the ResourcesOfferprocess, awesome!



#4. Rearrange agents in a more scalable manner, like per rack basis

Randomly offering agents resource to framework does not improve datalocality, imagine the likelihood of a framework getting resourcesunderneath the same rack, at the scale of +30000 nodes. Moreover time torandomly shuffle the agents also grows.

How about rearranging the agent in a per rack basis, and a minor changeto the way how resources are allocated will fix this.



I might not see the whole picture here, so comments are welcomed!


On 2016/6/6 17:17, Du, Fan wrote:

Hi, Mesos folks

I’ve been thinking about Mesos rack awareness support for a while,

it’s a common interest for lots of data center applications to provide
data locality,

fault tolerance and better task placement. Create MESOS-5545 to track
the story,

and here is the initial design doc [1] to support rack awareness in Mesos.

Looking forward to hear any comments from end user and other developers,

Thanks!

[1]:
https://docs.google.com/document/d/1rql_LZSwtQzBPALnk0qCLsmxcT3-zB7X7aJp-H3xxyE/edit?usp=sharing

Re: Rack awareness support for Mesos

Reply via email to