Re: Review Request 19383: Implemented the Registry-backed Master.

Ben Mahler Sun, 23 Mar 2014 23:29:27 -0700


> On March 20, 2014, 9:21 p.m., Vinod Kone wrote:
> > src/master/master.hpp, line 421
> > <https://reviews.apache.org/r/19383/diff/1/?file=526626#file526626line421>
> >
> >     Why cache instead of a circular buffer?
> 
> Ben Mahler wrote:
>     What's the issue here?
>     
>     Circular buffer does not support lookup. In the past we've used circular 
> buffer because we've wanted two properties:
>     
>     1. Set semantics.
>     2. LRI semantics.
>     
>     In this case, we want LRU semantics and cache<X, Nothing> provides the 
> desired set semantics.
> 
> Vinod Kone wrote:
>     I see. Maybe we need a cache like thing that holds just keys instead of 
> keys and values. Using a map for a set seems a bit weird. Maybe a TODO?


Sounds good.


> On March 20, 2014, 9:21 p.m., Vinod Kone wrote:
> > src/master/master.cpp, line 2072
> > <https://reviews.apache.org/r/19383/diff/1/?file=526627#file526627line2072>
> >
> >     s/reply/send(from,/ ?
> 
> Ben Mahler wrote:
>     Why change this, the method is using 'reply' elsewhere.
> 
> Vinod Kone wrote:
>     I was confused because you changed reply() to send() in registerSlave(). 
> Was that intentional? If not, please revert that.

Not sure how that got in this patch! Will revert.


> On March 20, 2014, 9:21 p.m., Vinod Kone wrote:
> > src/master/master.hpp, line 410
> > <https://reviews.apache.org/r/19383/diff/1/?file=526626#file526626line410>
> >
> >     s/Registrar/registrar/

Whoops, I dropped this one but I actually fixed it. ;)


> On March 20, 2014, 9:21 p.m., Vinod Kone wrote:
> > src/master/master.cpp, lines 609-610
> > <https://reviews.apache.org/r/19383/diff/1/?file=526627#file526627line609>
> >
> >     But they could still fire if they are already dispatched and are in the 
> > queue right? Would that cause problems?
> 
> Ben Mahler wrote:
>     It's a race of course, but it's likely that the dispatches would be 
> dropped before the new Master started. Also, the timing required for these to 
> fire but be in the queue when we're destructing is pretty precise, do you 
> have any suggestions here?
> 
> Vinod Kone wrote:
>     I suggest to update the comment to reflect what you just said for one :)
>     
>     I'm concerned because I had a similar case when implementing 
> authentication (see a few lines above where i had to do a discard) which 
> caused unexpected issues in subsequent tests that were hard to debug. How 
> about passing something to __recoverTimeout() that is unique to each 
> invocation. MasterInfo perhaps?

This is why I added the timer cancellation (because of what we saw with 
authentication). Added a comment similar to the one you have for the discard.

Are you suggesting that even with the authentication discard you're seeing 
flaky tests? My understanding was that we saw flaky tests, then added 
discard(), and fixed the issue.
We're not passing any unique information to _authenticate at the moment, right?


> On March 20, 2014, 9:21 p.m., Vinod Kone wrote:
> > src/master/master.cpp, lines 3267-3270
> > <https://reviews.apache.org/r/19383/diff/1/?file=526627#file526627line3267>
> >
> >     Why do it this way? Why not do the bulk of this method in 
> > _removeSlave() ?
> 
> Ben Mahler wrote:
>     Have you thought through the implications of that approach?
>     
>     If we keep the slave around in-memory when we decided to remove it, we 
> need to treat this slave in a special manner to keep things consistent for 
> frameworks.
>     
>     We're likely to end up with bugs / consistency problems if we do this. 
> Say we decide to remove the slave but we keep it in-memory. If a status 
> update arrives from this slave before the write completes, we better drop the 
> update or else we're potentially going to lie about a TASK_LOST in 
> _removeSlave.
>     
>     The only way the Registrar can deny a removal operation is if the slave 
> is already removed, so removing it from the in-memory state first is safe. We 
> also don't have to think about all the cases where a slave might send a 
> message between removeSlave and _removeSlave.
> 
> Vinod Kone wrote:
>     I am not convinced this is the right approach. The way I see using a 
> commit log is to persist the intent first and then changing the internal 
> state.
>     
>     For example, the way you did adding a slave seems natural to me. You 
> stored the intent locally (added to activated) and then persisted it in the 
> registrar. After registrar persisted you created the slave structure. It 
> would be nice if the logic is symmetric for removing a slave. I recommend 
> just adding the slave to 'removing', removing it from 'activated' and passing 
> the Slave* to the continuation.

I agree with you but I think it's more important to think in terms of "intent" 
and "effect", rather than "intent" and "internal state" (since we have to write 
the "intent" to the "internal state", it's not clear how they are different). 
The requirement being that we can only induce the effect of an operation after 
it is applied.

The "effect" that we *must* postpone in this case is telling the framework that 
the slave/tasks are LOST.

In your suggestion, if we only remove from "activated", when do we update the 
allocator to be aware of this? Before or after?

1. If it's after, we'll be getting invalid calls to Master::offer from the 
allocator. We probably don't want this.
   Also, frameworks will not have the slave's offer(s) rescinded, and 
consequently they can launch on an invalid offer getting back a response of 
TASK_LOST. Also seems like poor behavior, no?

2. If it's before, we're doing most of the work that I'm currently doing in 
removeSlave(). Except for pulling out status updates and deleting the slave 
object.
   If we split the logic like this, then we need a second-level continuation 
__removeSlave() that only does the SlaveLostMessage sending so that 
__recoverSlaveTimeout can re-use this logic.

Because of the downsides of updating accounting afterwards, it seems like we 
should do that before. Do you agree?

If so then it would be:

removeSlave(Slave*) // Update maps, rescind offers, update allocator, delete 
observer?
_removeSlave(Slave*) // Send *task* status updates, destruct Slave object.
__removeSlave(const SlaveID&) // Send "slave lost" messages.

Which still seems like an unnecessary splitting of the removeSlave logic.


- Ben


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/19383/#review37918
-----------------------------------------------------------


On March 21, 2014, 12:56 a.m., Ben Mahler wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/19383/
> -----------------------------------------------------------
> 
> (Updated March 21, 2014, 12:56 a.m.)
> 
> 
> Review request for mesos, Benjamin Hindman and Vinod Kone.
> 
> 
> Bugs: MESOS-764
>     https://issues.apache.org/jira/browse/MESOS-764
> 
> 
> Repository: mesos-git
> 
> 
> Description
> -------
> 
> This implements the Registry-backed Master, with the following exceptions 
> that will be addressed in follow up changes:
> 
> -Note that the --registry_strict flag is enforced to be false in 
> master/main.cpp.
> -Reconciliation remains unimplemented as before.
> -Improvements can be made to killTask, specifically we should add SlaveID to 
> the message in order to drop fewer requests for unknown slaves.
> -Orthogonally, this does not address MESOS-682.
> 
> I've updated 'deactivated' slaves to be a cache of SlaveIDs rather than UPIDs 
> as this was the intent originally (we were concerned about the unbounded 
> growth of the set, but cache<SlaveID, Nothing> keeps a fixed capacity).
> 
> 
> Diffs
> -----
> 
>   src/master/constants.hpp cdaaad060d4ee777f8b0838b63c0fd031da861ea 
>   src/master/constants.cpp 18548834468243bef8ae090f70363e2b9f571ac5 
>   src/master/master.hpp a37a2a2fa3713b8251eb318dfb45e0793cb344ff 
>   src/master/master.cpp 3b28f72be2016997e30649d2b12ed0e8f1a57dd5 
>   src/messages/messages.proto c26a3d0e69bbbd447c859cf175c139ab8948fde2 
>   src/slave/slave.cpp d8d3e0fa54972201d72b2650ec0ba922a4912d54 
> 
> Diff: https://reviews.apache.org/r/19383/diff/
> 
> 
> Testing
> -------
> 
> This change preserves the previous semantics and so all existing tests pass.
> 
> This is because the Registrar can only operate in a "non-strict" manner.
> 
> An unfortunate effect of this change is that many tests run slower due to the 
> fact that messages are dropped while we're recovering, an alternative 
> approach here would be to re-enqueue *all* incoming messages through 
> recover(). However, this adds queuing delay to each message processed in the 
> Master and the performance implications of this are not well understood for 
> large clusters.
> 
> 
> Thanks,
> 
> Ben Mahler
> 
>

Re: Review Request 19383: Implemented the Registry-backed Master.

Reply via email to