Puneet Kumar created MESOS-10143:
------------------------------------
Summary: Outstanding Offers accumulating
Key: MESOS-10143
URL: https://issues.apache.org/jira/browse/MESOS-10143
Project: Mesos
Issue Type: Bug
Components: master, scheduler driver
Affects Versions: 1.7.0
Environment: Mesos Version 1.7.0
JDK 8.0
Reporter: Puneet Kumar
We manage an Apache Mesos cluster version 1.7.0. We have written a framework in
Java that schedules tasks to Mesos master at a rate of 300 TPS. Everything
works fine for almost 24 hours but then outstanding offers accumulate &
saturate within 15 minutes. Outstanding aren't reclaimed by Mesos master. We
observe "RescindOffer" messages in verbose (GLOG v=3) framework logs but
outstanding offers don't reduce. We have to restart the scheduler to reset
outstanding offers to zero so that framework can resume scheduling new tasks to
the cluster.
Any suggestions to debug this issue are welcome.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)