Re: Dedup mesos agent status updates at framework

2018-10-28 Thread Varun Gupta
in terms of de-duplication. Can you > explain how the scheduler's status update processing works? Does it use > explicit acknowledgements and process batches asynchronously? Aurora > example: https://reviews.apache.org/r/33689/ > > On Sun, Oct 28, 2018 at 8:58 PM Varun Gupta > wro

Re: Dedup mesos agent status updates at framework

2018-10-28 Thread Varun Gupta
ow cost, so that's rather surprising. Do you have any data related to > this? > > You can also tune the backoff rate on the agents, if the defaults are too > noisy in your setup. > > Ben > > On Sun, Oct 28, 2018 at 4:51 PM Varun Gupta wrote: > > > >

Re: Dedup mesos agent status updates at framework

2018-10-28 Thread Varun Gupta
> Hi, > > Mesos agent will send status updates with exponential backoff until ack is > received. > > If processing of events at framework and sending ack to Master is running > slow then it builds a back pressure at framework due to duplicate updates > for same status. > > Has someone explored the

Dedup mesos agent status updates at framework

2018-10-23 Thread Varun Gupta
Hi, Mesos agent will send status updates with exponential backoff until ack is received. If processing of events at framework and sending ack to Master is running slow then it builds a back pressure at framework due to duplicate updates for same status. Has someone explored the option to dedup s

Re: Add hostname or agentid in rescind offers callback

2018-05-06 Thread Varun Gupta
Does the feature request seems reasonable? On Wed, May 2, 2018 at 7:35 PM Varun Gupta wrote: > Implementation is only needed for V1 API. > On Wed, May 2, 2018 at 7:31 PM Varun Gupta wrote: > >> We aggregate all the offers for a host, such that placement engine can >> pac

Re: Add hostname or agentid in rescind offers callback

2018-05-02 Thread Varun Gupta
Implementation is only needed for V1 API. On Wed, May 2, 2018 at 7:31 PM Varun Gupta wrote: > We aggregate all the offers for a host, such that placement engine can > pack multiple tasks that can be launched on this host using aggregated > resources. If offers are unused for that host

Re: Add hostname or agentid in rescind offers callback

2018-05-02 Thread Varun Gupta
; > be included / not included (e.g., should we include agent IP too) in such > > calls. > > > > Second, adding this extra information in v1 scheduler API would be > > relatively easy. But adding this to v0 API would be hard. Which API do > you > > need to b

Add hostname or agentid in rescind offers callback

2018-05-02 Thread Varun Gupta
Hi, Currently in our implementation we maintain two maps. Hostname -> []Offers offerID -> Hostname Second map is needed because rescind offers callback only provides offerid and we need hostname to do performant lookup in first map. Is it feasible to add hostname or agentid in rescind offers?

Re: Question on status update retry in agent

2018-04-16 Thread Varun Gupta
d the status updates), so it continues to retry TASK_RUNNING > > infinitely. > > > > The agent only retries so long as it does not get an acknowledgement, is > > the scheduler acknowledging the duplicates updates or ignoring them? > > > > On Mon, Apr 9, 2018 at 12:10 PM,

Re: Question on status update retry in agent

2018-04-09 Thread Varun Gupta
Hi, We are running into an issue with slave status update manager. Below is the behavior I am seeing. Our use case is, we run Stateful container (Cassandra process), here Executor polls JMX port at 60 second interval to get Cassandra State and sends the state to agent -> master -> framework. *RU

Re: Disallowing comma in attribute values

2017-10-18 Thread Varun Gupta
+1 On Wed, Oct 18, 2017 at 12:39 AM, Vinod Kone wrote: > Hi folks, > > I would like to propose that we enforce the character set used for > attribute values as per the documentation. > > Currently, the documentation > > states