Re: statusUpdate() duplicate messages?

2015-11-18 Thread Vinod Kone
Look for UPDATE and ACKNOWLEDGE sections in https://github.com/apache/mesos/blob/master/docs/scheduler-http-api.md for semantics. Note that this doc for schedulers using the new HTTP API, but the semantics around updates are the same. On Wed, Nov 18, 2015 at 4:41 AM, James Vanns

Re: resolving hosts with mesos-dns not working with "/" in the appid

2015-11-17 Thread Vinod Kone
I think this is a question better suited for the mesos-dns or mesosphere mailing list. On Tue, Nov 17, 2015 at 11:40 AM, Rodrick Brown wrote: > Is it possible to resolve app-ids with / in them when using mesos-dns? > > > I have apps defined like the following: > >

Re: Welcome Kapil as Mesos committer and PMC member!

2015-11-05 Thread Vinod Kone
welcome kapil! On Thu, Nov 5, 2015 at 6:49 AM, wrote: > Congrats Dr. Arya! > > > On Nov 5, 2015, at 02:02, Till Toenshoff wrote: > > > > I'm happy to announce that Kapil Arya has been voted a Mesos committer > and PMC member! > > > > Welcome Kapil, and

Re: Odd Scenerio with Mesos.

2015-10-21 Thread Vinod Kone
Logs please. On Wed, Oct 21, 2015 at 12:44 PM, John Omernik wrote: > I am running 0.24. > > I am running some tasks in marathon, and when they hit an OOM condition a > task is killed that is expected. Than I get a bunch of errors related to > "Failed to read

Re: Detect Slave Memory Resources

2015-09-29 Thread Vinod Kone
start mesos slave with "--resources" explicitly set to what you want it to offer. On Tue, Sep 29, 2015 at 5:12 PM, Rafael Capucho wrote: > I solve the Warning by disabling swap memory, like that: > > --memory-swap=-1 \ > --memory="2G" \ > > But in MESOS UI I still see

Re: [VOTE] Release Apache Mesos 0.21.2 (rc1)

2015-09-25 Thread Vinod Kone
+1 (binding) Tested on CI for CentOS5/6. On Thu, Sep 24, 2015 at 6:12 PM, Adam Bordelon wrote: > +1 (binding) Tested on CI for CentOS7 and Ubuntu 14.04. > > On Thu, Sep 24, 2015 at 5:44 PM, Adam Bordelon wrote: > >> Hi friends, >> >> Here's a candidate

Re: Web ui offers tab blank

2015-09-25 Thread Vinod Kone
What do the master and scheduler logs say? On Fri, Sep 25, 2015 at 5:02 AM, F21 wrote: > I am running mesos 0.24 on CoreOS. I also have Marathon 0.10.1 running. > > I am trying to run the HDFS framework and notice that it's complaining > that there isn't enough resources

Re: [VOTE] Release Apache Mesos 0.23.1 (rc1)

2015-09-23 Thread Vinod Kone
+1 (binding) Tested on CI for centos5/6 @vinodkone > On Sep 23, 2015, at 3:45 PM, Adam Bordelon wrote: > > +1 (binding) Tested on CI for CentOS7, Fedora22, and Ubuntu 14.04. > >> On Wed, Sep 23, 2015 at 10:25 AM, haosdent wrote: >> >> +1 test on

Re: Official RPMs

2015-09-21 Thread Vinod Kone
+Jake Farrell The mesos project doesn't publish platform dependent artifacts. We currently only publish platform independent artifacts like JAR (to apache maven) and interface EGG (to PyPI). Recently we made the decision for the

Re: mesos 0.24 released?

2015-09-21 Thread Vinod Kone
Published the blogpost for 24.0. On Fri, Sep 18, 2015 at 9:42 PM, CCAAT wrote: > > Oh, > > Here is a link that explains the Variable meanings for the packages > downloaded by gentoo's package manager, portage:: > > https://devmanual.gentoo.org/ebuild-writing/variables/ >

Re: [VOTE] Release Apache Mesos 0.24.1 (rc1)

2015-09-21 Thread Vinod Kone
+1 (binding) Tested on CI for CentOS5 and CentOS6. On Fri, Sep 18, 2015 at 6:21 PM, Adam Bordelon wrote: > Hi friends, > > Please vote on releasing the following candidate as Apache Mesos 0.24.1. > > 0.24.1 includes the following: > >

Re: mesos 0.24 released?

2015-09-18 Thread Vinod Kone
On Fri, Sep 18, 2015 at 11:31 AM, craig w wrote: > Gotcha will there be a blog post / release announcement on the website > soon? > yea i'll get to it. sorry for the delay.

Re: how to overcome orphaned tasks after master failure

2015-09-14 Thread Vinod Kone
The framework is expected to re-register with a failed over master with the same framework id as before. If you are using the scheduler driver, that should happen automatically. On Mon, Sep 14, 2015 at 6:38 AM, Mike Barborak wrote: > Hi, > > > > I’m trying to understand a

Re: how to overcome orphaned tasks after master failure

2015-09-14 Thread Vinod Kone
On Mon, Sep 14, 2015 at 12:40 PM, Mike Barborak wrote: > Sorry for my ignorance, but what is the “scheduler driver?” My framework > is based on the Java example: > > Some details about the driver should be here:

Re: Help interpreting output from running java test-framework example

2015-09-09 Thread Vinod Kone
sounds like it. can you see what the slave/agent and executor logs say? On Tue, Sep 8, 2015 at 11:46 AM, Stephen Boesch wrote: > > I am in the process of learning how to run a mesos cluster with the intent > for it to be the resource manager for Spark. As a small step in

[RESULT][VOTE] Release Apache Mesos 0.24.0 (rc2)

2015-09-04 Thread Vinod Kone
, at 11:47 PM, Niklas Nielsen <nik...@mesosphere.io> wrote: > > +1 - tested on our CI > > On Tuesday, September 1, 2015, Vinod Kone <vinodk...@apache.org> wrote: > >> Hi all, >> >> >> Please vo

Re: mesos-master resource offer details

2015-09-02 Thread Vinod Kone
Sounds like you should bump up the logging level of marathon. Did you ask in the marathon mailing list? On Wed, Sep 2, 2015 at 10:02 AM, Haripriya Ayyalasomayajula < aharipriy...@gmail.com> wrote: > Alex, > > The problem I am facing is that there are no allocations made. Mesos > -master gives 5

API client libraries

2015-09-02 Thread Vinod Kone
Hi folks, Now that the v1 scheduler HTTP API (beta) is on the verge of being released, I wanted to open up the discussion about client libraries for the API. Mainly around support and home for the libs. One idea is that, going forward, the only supported client library would be C++ library which

Re: Apache Mesos Community Sync

2015-09-02 Thread Vinod Kone
We'll have the next community sync tomorrow (Sept 3rd) at 3 PM PST. Please add items to agenda <https://docs.google.com/document/d/153CUCj5LOJCFAVpdDZC7COJDwKh9RDjxaTA0S7lzwDA/edit#heading=h.u1x3j7f3uixf> . On Wed, Aug 5, 2015 at 4:12 PM, Vinod Kone <vinodk...@gmail.com> wrote:

Re: API client libraries

2015-09-02 Thread Vinod Kone
On Wed, Sep 2, 2015 at 11:49 AM, Vinod Kone <vinodk...@apache.org> wrote: > --> Might be a step backward because we would be officially dropping > support for Java and Python. This is probably a good thing? > s/officially dropping support/dropping official support/

Re: [VOTE] Release Apache Mesos 0.24.0 (rc1)

2015-09-01 Thread Vinod Kone
events a lot easier >> for me. >> >> Thanks, >> Dario >> >> On Sep 1, 2015, at 8:42 AM, dario.re...@me.com wrote: >> >> Hi Vinod, >> >> thanks for the explanation, I got it now. >> >> Thanks, >> Dario >> >> On 31.0

Re: [VOTE] Release Apache Mesos 0.24.0 (rc1)

2015-09-01 Thread Vinod Kone
s- we agreed were non-blocking: I'll follow up with him too) > > *Marco Massenzio* > > *Distributed Systems Engineerhttp://codetrips.com <http://codetrips.com>* > > On Tue, Sep 1, 2015 at 10:49 AM, Vinod Kone <vinodk...@apache.org> wrote: > >> Thanks for the nudg

Re: Prepping for next release

2015-09-01 Thread Vinod Kone
JDK7 when uploading. Not sure how easy it is to adapt our Mesos build tool chain for this. Anyone has expertise in this area? Thoughts? On Tue, Aug 18, 2015 at 3:14 PM, Vinod Kone <vinodk...@apache.org> wrote: > I re-encrypted the maven passwords and that seemed to have done the trick.

[VOTE] Release Apache Mesos 0.24.0 (rc2)

2015-09-01 Thread Vinod Kone
Hi all, Please vote on releasing the following candidate as Apache Mesos 0.24.0. 0.24.0 includes the following: Experimental support for v1 scheduler HTTP API! This release also wraps up support for fetcher.

Re: [VOTE] Release Apache Mesos 0.24.0 (rc1)

2015-08-31 Thread Vinod Kone
soon as I have a chance. > > Vinod, > > what exactly is the benefit of using recordio here? Doesn't it make the > content-type somewhat wrong? If I send 'Accept: application/json' and > receive 'Content-Type: application/json', I actually expect to receive only > json in the

Re: Marathon split brain situation

2015-08-28 Thread Vinod Kone
Sounds like a marathon issue. You should ask in marathon mailing list. @vinodkone On Aug 28, 2015, at 7:05 AM, Rogier Dikkes rogier.dik...@surfsara.nl wrote: Hello all, I am running test cluster with Mesos and Marathon in a cluster of 20 compute nodes and 2 head nodes running vm's that

Re: [VOTE] Release Apache Mesos 0.24.0 (rc1)

2015-08-28 Thread Vinod Kone
I'm happy to add the \n after the event (note it's different from chunk) if that makes CURL play nicer. I'm not sure about the \r part though? Is that a nice to have or does it have some other benefit? The design doc is not set in the stone since this has not been released yet. So definitely want

Re: [VOTE] Release Apache Mesos 0.24.0 (rc1)

2015-08-27 Thread Vinod Kone
with: ../mesos/configure --prefix=/home/vagrant/releases/0.24.0/ --disable-python On 26 August 2015 at 17:00, Khanduja, Vaibhav vaibhav.khand...@emc.com wrote: +1 On Aug 26, 2015, at 4:43 PM, Vinod Kone vinodk...@gmail.com wrote: Pinging the thread for more (binding) votes. Hopefully people have

Re: [VOTE] Release Apache Mesos 0.24.0 (rc1)

2015-08-26 Thread Vinod Kone
-ROOT tests pass on: CentOS 7 (VirtualBox VM) Known issue (MESOS-3050) for ROOT tests on CentOS 7, non-blocker. Thanks, *Marco Massenzio* *Distributed Systems Engineerhttp://codetrips.com http://codetrips.com* On Tue, Aug 18, 2015 at 3:26 PM, Vinod Kone vinodk...@apache.org wrote: 0.24.0

Re: Allocation algorithm

2015-08-25 Thread Vinod Kone
The hierarchical allocator looks at one agent's resource at a time. For each agent, it runs DRF to figure out the candidate framework. More details here: https://github.com/apache/mesos/blob/master/src/master/allocator/mesos/hierarchical.hpp#L935 Regarding starvation you observed, yes that is

[VOTE] Release Apache Mesos 0.24.0 (rc1)

2015-08-18 Thread Vinod Kone
0.24.0 includes the following: Experimental support for v1 scheduler HTTP API! This release also wraps up support for fetcher. The CHANGELOG for the release is available at:

Re: resources not offered to framework

2015-08-18 Thread Vinod Kone
understanding around not getting resource offers? Sorry to be so non-specific – I’m a few days into this and starting to grasp. Thanks, Mike From: Vinod Kone [mailto:vinodk...@gmail.com] Sent: Friday, August 14, 2015 11:37 AM To: user@mesos.apache.org Subject: Re: resources

Re: MesosCon Seattle attendee introduction thread

2015-08-18 Thread Vinod Kone
Maybe we periscope the talks unofficially?  I'm Vinod Kone, Twitter engineer and Apache Mesos committer. If you are interested in learning about the new Mesos API and the road to 1.0, don't miss our talk! Can't wait to meet you all! @vinodkone On Aug 18, 2015, at 8:28 AM, Dave Lester d

Re: resources not offered to framework

2015-08-14 Thread Vinod Kone
Sent: Tuesday, July 14, 2015 5:33 PM To: user@mesos.apache.org Subject: RE: resources not offered to framework I didn’t understood the difference between roles and attributes. That sounds like what I am looking for. Thanks for your help. -Mike From: Vinod Kone [mailto:vinodk

Re: Static Resource Reservation on mesos slave

2015-08-11 Thread Vinod Kone
I wouldn't recommend using resources flag to achieve what you want because what happens if a dev task lands first on a box and then a production task wants to launch? Does the production task get starved? Does the framework pre-empt the dev task? I think what you are looking for here is

Re: Mesos and Docker Slave Problem

2015-08-07 Thread Vinod Kone
On Fri, Aug 7, 2015 at 11:50 AM, Nastooh Avessta (navesta) nave...@cisco.com wrote: I0807 17:56:11.772568 577 slave.cpp:4208] Launching executor hello-gpu-sleep.9aef12e0-3d2d-11e5-935a-fe1614f46ae0 of framework 20150624-232916-16777343-5050-1628- in work directory

Re: libmesos for Windows(client)

2015-08-07 Thread Vinod Kone
Not yet, but there is work in progress https://issues.apache.org/jira/browse/MESOS-3094 On Fri, Aug 7, 2015 at 3:08 PM, Priyank Narvekar priyanknarve...@yahoo.com wrote: Hi, Is there any way to compile a mesos client library for windows - libmesos.dll Trying to connect to mesos to run

Re: Threading in (Java) Scheduler

2015-08-05 Thread Vinod Kone
That should be fine. On Wed, Aug 5, 2015 at 9:18 PM, Ashwin Jayaprakash ashwin.jayaprak...@gmail.com wrote: Hello, Is it safe for a Scheduler implementation to spawn a new thread and periodically call SchedulerDriver.sendFrameworkMessage(..)? I know that MesosSchedulerDriver JavaDoc does

Re: Apache Mesos Community Sync

2015-08-05 Thread Vinod Kone
, or join remotely via hangout: https://plus.google.com/hangouts/_/mesosphere.io/mesos-developer Thanks, -Adam- On Thu, May 28, 2015 at 10:08 AM, Vinod Kone vinodk...@gmail.com wrote: Cool. Here's the agenda doc https://docs.google.com/document/d/153CUCj5LOJCFAVpdDZC7COJDwKh9RDjxaTA0S7lzwDA

Re: Get List of Active Slaves

2015-08-04 Thread Vinod Kone
Is that the leading master? On Tue, Aug 4, 2015 at 3:09 PM, Nastooh Avessta (navesta) nave...@cisco.com wrote: Hi Trying to get the list of active slaves, via cli, e.g. curl http://10.4.50.80:5050/master/slaves | python -m json.tool and am not getting the expected results. The returned

Re: Get List of Active Slaves

2015-08-04 Thread Vinod Kone
/subscribe/?sid=000478326 - Unsubscribe http://www.cisco.com/offer/unsubscribe/?sid=000478327 – Privacy http://www.cisco.com/web/siteassets/legal/privacy.html* *From:* Vinod Kone [mailto:vinodk...@gmail.com] *Sent:* Tuesday, August 04, 2015 3:19 PM *To:* user@mesos.apache.org *Subject:* Re: Get List

Design doc for HTTP API versioning

2015-07-31 Thread Vinod Kone
Hi folks, As part of our effort to introduce a new HTTP API https://issues.apache.org/jira/browse/MESOS-2288 for Mesos, we have also started to formulate a plan for how we want to version our API. The design doc for the versioning is here

Re: Mesos works fine, but problems with Marathon console.

2015-07-30 Thread Vinod Kone
sounds like a marathon issue. did you try asking in the marathon mailing list? On Thu, Jul 30, 2015 at 3:28 PM, Haripriya Ayyalasomayajula aharipriy...@gmail.com wrote: Hi all, I'm a marathon newbie. I'm running Marathon with Mesos 0.23.0. I followed the steps documented in

Re: Custom executor

2015-07-29 Thread Vinod Kone
To clarify, if you didn't already know, your custom executor should implement the Mesos executor interface. On Wed, Jul 29, 2015 at 10:05 AM, Ondrej Smola ondrej.sm...@gmail.com wrote: As Connor already mentioned you can use URL for your artifact/app - there is whole REST API - artifact store

Re: Problems connecting with Mesos Master

2015-07-28 Thread Vinod Kone
did you set LIBPROCESS_IP env variable as the warning suggested? On Tue, Jul 28, 2015 at 11:16 AM, Haripriya Ayyalasomayajula aharipriy...@gmail.com wrote: Hi all, I am trying to use Spark 1.4.1 with Mesos 0.23.0. When I try to start my spark-shell, it gives me the following warning :

Re: Problems connecting with Mesos Master

2015-07-28 Thread Vinod Kone
LIBPROCESS_IP is the IP address that you want the scheduler (driver) to bind to. It has nothing to do with the ZooKeeper address. In other words, do export LIBPROCESS_IP=scheduler_ip_address On Tue, Jul 28, 2015 at 2:00 PM, Haripriya Ayyalasomayajula aharipriy...@gmail.com wrote: I am

Re: Problems connecting with Mesos Master

2015-07-28 Thread Vinod Kone
] When I do a ps -ef | grep mesos on both my master and slave nodes, it works fine. And, I am also able to ping both the nodes from each other - they are reachable to one another. On Tue, Jul 28, 2015 at 2:18 PM, Vinod Kone vinodk...@gmail.com wrote: LIBPROCESS_IP is the IP address

Re: Shedding some light on oversubscription

2015-07-20 Thread Vinod Kone
On Mon, Jul 20, 2015 at 4:18 AM, Brian Candler b.cand...@pobox.com wrote: So in summary: is 0.23.0 laying the foundations for all this, but with more work required down the line? And does that work include changes to frameworks as well as the core of mesos? That is more or less correct. The

Re: [VOTE] Release Apache Mesos 0.23.0 (rc4)

2015-07-17 Thread Vinod Kone
+1 (binding) Successfully built RPMs for CentOS5 and CentOS6 with network isolator. On Fri, Jul 17, 2015 at 4:56 PM, Khanduja, Vaibhav vaibhav.khand...@emc.com wrote: +1 Sent from my iPhone. Please excuse the typos and brevity of this message. On Jul 17, 2015, at 4:43 PM, Adam Bordelon

Re: Mesos Slave Failover time

2015-07-17 Thread Vinod Kone
It's not configurable yet, but will be in the upcoming 0.23.0 release. On Fri, Jul 17, 2015 at 3:46 PM, Nastooh Avessta (navesta) nave...@cisco.com wrote: Hi Trying to adjust the current failover time to below 10 seconds and don’t seem to be able to find the right set of parameters.

Re: Marathon can no longer deploy any apps after a failover

2015-07-16 Thread Vinod Kone
Sounds like a marathon issue. Mind asking in marathon's mailing list? On Thu, Jul 16, 2015 at 8:02 AM, Nikolay Borodachev nbo...@adobe.com wrote: Maciej, I had a similar problem but it got solved by setting LIBPROCESS_IP environment variable to the host IP address for the Marathon

Re: [VOTE] Release Apache Mesos 0.23.0 (rc3)

2015-07-16 Thread Vinod Kone
Found a bug in HTTP API related code: MESOS-3055 https://issues.apache.org/jira/browse/MESOS-3055 If we don't fix this in 0.23.0, we cannot expect the 0.24.0 scheduler driver (that will send Calls) to properly subscribe with a 0.23.0 master. I could add a work around in the driver to only send

Re: [VOTE] Release Apache Mesos 0.23.0 (rc3)

2015-07-16 Thread Vinod Kone
think this is not a blocker, as it only affects the way we test the cgroup on CentOS 7.x due to a CentOS bug and doesn't actually impact Mesos normal operations. My vote is +1 as well. Tim On Thu, Jul 16, 2015 at 12:10 PM, Vinod Kone vinodk...@gmail.com wrote: Found a bug in HTTP

Re: resources not offered to framework

2015-07-14 Thread Vinod Kone
On Tue, Jul 14, 2015 at 4:36 AM, Mike B mi...@mindlakes.com wrote: I could see the master processing ACCEPT calls for offers and I could see the resources associated with the new slave being recovered because none of the frameworks they were offered to wanted them. What I never saw was these

Re: Cleaning out old mesos-slave sandbox directories

2015-07-09 Thread Vinod Kone
Developer // DueDil (+44) 7525940046 25 Christopher Street, London, EC2A 2BS On Wed, Jul 8, 2015 at 7:24 PM, Vinod Kone vinodk...@gmail.com wrote: On Wed, Jul 8, 2015 at 11:20 AM, Tom Arnfeld t...@duedil.com wrote: Do you know if the mesos-slave will re-schedule something for GC

Re: [VOTE] Release Apache Mesos 0.23.0 (rc2)

2015-07-09 Thread Vinod Kone
As a tangent, you can speed up the build by doing make -j#threads check. On Thu, Jul 9, 2015 at 1:35 PM, Jeff Schroeder jeffschroe...@computer.org wrote: I'm unable to replicate the same failure on another up to date RHEL 7.1 machine for some strange reason. Even blowing away the checkout,

Re: [VOTE] Release Apache Mesos 0.23.0 (rc2)

2015-07-09 Thread Vinod Kone
a 'test compilation' step (which can be run in parallel and speeds up) distinct from a 'run tests' step (which must run sequentially). *Marco Massenzio* *Distributed Systems Engineer* On Thu, Jul 9, 2015 at 2:28 PM, Vinod Kone vinodk...@gmail.com wrote: As a tangent, you can speed up

Re: Cleaning out old mesos-slave sandbox directories

2015-07-08 Thread Vinod Kone
Senior Developer // DueDil On Wed, Jul 8, 2015 at 7:00 PM, Vinod Kone vinodk...@gmail.com wrote: On Wed, Jul 8, 2015 at 10:54 AM, Tom Arnfeld t...@duedil.com wrote: When this happens the old slave directories appear not to be tracked by the mesos GC process, and stay around indefinitely

Re: Cleaning out old mesos-slave sandbox directories

2015-07-08 Thread Vinod Kone
On Wed, Jul 8, 2015 at 11:20 AM, Tom Arnfeld t...@duedil.com wrote: Do you know if the mesos-slave will re-schedule something for GC if it fails deletion? No it doesn't.

Re: Cleaning out old mesos-slave sandbox directories

2015-07-08 Thread Vinod Kone
On Wed, Jul 8, 2015 at 10:54 AM, Tom Arnfeld t...@duedil.com wrote: When this happens the old slave directories appear not to be tracked by the mesos GC process, and stay around indefinitely. Over time if enough full slave restarts happen (say, due to reconfiguration) the disks can be

Re: Resource Allocation Question

2015-07-07 Thread Vinod Kone
The DRF algorithm works by looking at one slave's resource at a time. If a slave's resource is filtered by a framework (due to declining), that framework will not be considered a candidate for DRF for that resource. So in your example, if Framework2 rejects, Framework1 should get the offer. On

Re: Can Framework accept partial offers

2015-07-06 Thread Vinod Kone
Mesos doesn't currently support the notion of requesting resources. Resources are offered by Mesos based on a fair sharing algorithm (DRF) and it is up to the allocator to accept (partial) resources or decline. On Mon, Jul 6, 2015 at 5:00 PM, Ying Ji jiyin...@gmail.com wrote: Thanks for quick

Re: Mesos Slave Port Change Fails Recovery

2015-07-03 Thread Vinod Kone
On Thu, Jul 2, 2015 at 8:59 PM, Vinod Kone vinodk...@gmail.com wrote: It is surprising that the slave didn't bail out during the initial phase of recovery when the port changed. I'm assuming you enabled checkpointing in 0.20.0 and that you didn't wipe the meta data directory or anything when

Re: Mesos Slave Port Change Fails Recovery

2015-07-02 Thread Vinod Kone
configuration? Thanks, Philippe On Thu, Jul 2, 2015 at 5:54 PM, Vinod Kone vinodk...@gmail.com wrote: Can you paste some logs? On Thu, Jul 2, 2015 at 2:23 PM, Philippe Laflamme phili...@hopper.com wrote: Ok, that's reasonable, but I'm not sure why it would successfully re-register

Re: Mesos Slave Port Change Fails Recovery

2015-07-02 Thread Vinod Kone
For slave recovery to work, it is expected to not change its config. On Thu, Jul 2, 2015 at 2:10 PM, Philippe Laflamme phili...@hopper.com wrote: Hi, I'm trying to roll out an upgrade from 0.20.0 to 0.21.0 with slaves configured with checkpointing and with reconnect recovery. I was

Re: When do executors shutdown?

2015-06-30 Thread Vinod Kone
To clarify Alex's response. An executor is not shutdown if it has no running tasks. It is only shutdown when the framework asks it to (or the framework itself shuts down).

Re: Understanding Slave Recovery Timeouts

2015-06-19 Thread Vinod Kone
*If* the 75 seconds is exceeded but we're within the recovery_timeout, the slave *should* register with a new slave ID. The slave daemon (with the new slave ID) reconnects to the old executors and updates them to use the new slave ID. This is not true. 'recovery_timeout' was added to make

Re: Executor Resource Requirements

2015-06-17 Thread Vinod Kone
On Wed, Jun 17, 2015 at 12:19 PM, Christopher Ketchum cketc...@ucsc.edu wrote: My question is, is there an easy way to know if there is an executor currently running on a slave? It seems like we want to treat offers differently depending on wether the executor is already taken into account,

Re: Setting Rate of Resource Offers

2015-06-17 Thread Vinod Kone
Can you paste the master logs for when the task is finished and the next offer is sent? On Wed, Jun 17, 2015 at 9:11 AM, Christopher Ketchum cketc...@ucsc.edu wrote: Hi everyone, Thanks for the responses. To clarify, I’m only running one framework with a single slave for testing purposes,

Re: Changing resources available to the slave

2015-06-16 Thread Vinod Kone
Not yet. There is a design doc to add that support. See https://issues.apache.org/jira/browse/MESOS-1739 for details. On Tue, Jun 16, 2015 at 2:02 PM, Badhri Sampathkumar bad...@gmail.com wrote: Hi, I'm planning to running mesos slaves inside VMs. In my case, I also hot-add resources (cpu,

Re: Changing resources available to the slave

2015-06-16 Thread Vinod Kone
On Tue, Jun 16, 2015 at 4:42 PM, Badhri Sampathkumar bad...@gmail.com wrote: Somewhat related question. Is there a separate JIRA item for making external containerizers report resources they're aware of? Or will that also be addressed as part of this change? I don't think that's addressed as

Re: Debugging framework registration from inside docker

2015-06-11 Thread Vinod Kone
On Thu, Jun 11, 2015 at 4:00 AM, James Vanns jvanns@gmail.com wrote: I think I can conclude then that this just won't work; one cannot run a framework as a docker container using bridged networking. This is because a POST to the MM that libprocess does on your framework's behalf, includes

Re: Mesos Security Recommendations

2015-06-04 Thread Vinod Kone
On Thu, Jun 4, 2015 at 5:12 PM, John Sirois john.sir...@gmail.com wrote: Its not a design doc, but the issue breakdown spells out much: https://issues.apache.org/jira/browse/MESOS-910 Joris will be sending out a doc soon (O(week))!

Re: Cluster autoscaling in Spark+Mesos ?

2015-06-04 Thread Vinod Kone
Hey Dmitry. At the current time there is no built-in support for Mesos to autoscale nodes in the cluster. I've heard people (Netflix?) do it out of band on EC2. On Thu, Jun 4, 2015 at 9:08 AM, Dmitry Goldenberg dgoldenberg...@gmail.com wrote: A Mesos noob here. Could someone point me at the doc

Re: Slave node is not done when disconnected from cluster

2015-06-04 Thread Vinod Kone
Can you share the screenshots? Master should remove the slave after 75s.

Re: Apache Mesos Community Sync

2015-05-28 Thread Vinod Kone
it was always the first Thursday of the month. On Thu, May 28, 2015 at 9:33 AM, Vinod Kone vinodk...@gmail.com wrote: Do we have community sync today or next week? I'm a bit confused. @vinodkone On Apr 1, 2015, at 3:18 AM, Adam Bordelon a...@mesosphere.io wrote: Reminder: We're

Re: Apache Mesos Community Sync

2015-05-28 Thread Vinod Kone
/document/d/153CUCj5LOJCFAVpdDZC7COJDwKh9RDjxaTA0S7lzwDA/edit?usp=sharing To Join: follow the BlueJeans instructions from the recurring meeting invite at the start of this thread. On Fri, Mar 6, 2015 at 11:11 AM, Vinod Kone vinodk...@apache.org wrote: Hi folks, We are planning to do monthly

Re: make[3]: *** [check-local] Aborted (core dumped) in make test

2015-05-15 Thread Vinod Kone
That's an error from our JSON parsing library, picosjson. I'm surprised that our metrics JSON output is invalid according to picojson. Is this error repeatable? You can test with: make check GTEST_FILTER=MasterAuthorizationTest.SlaveRemoved GTEST_REPEAT=1000 GTEST_BREAK_ON_FAILURE=1 On Fri, May

Re: Apache Mesos Community Sync

2015-05-07 Thread Vinod Kone
, 2015 at 11:11 AM, Vinod Kone vinodk...@apache.org wrote: Hi folks, We are planning to do monthly Mesos community meetings. Tentatively these are scheduled to occur on 1st Thursday of every month at 3 PM PST. See below for details to join the meeting remotely. This is a forum to ask

Re: Design doc for Mesos HTTP API

2015-05-01 Thread Vinod Kone
On Fri, May 1, 2015 at 2:23 AM, Tom Arnfeld t...@duedil.com wrote: Q: Could you explain in a little detail why the decision was made to use a single HTTP endpoint rather than something like */event* (for the stream) and */call* for making calls? It seems a little strange / contrived to me

Re: Design doc for Mesos HTTP API

2015-04-30 Thread Vinod Kone
29, 2015 at 3:27 PM, Vinod Kone vinodk...@gmail.com wrote: I would like to take this opportunity to move the doc from Twitter domain so that I dont have to manually grant access to each new viewer. I'll post the new doc link tomorrow. @vinodkone On Apr 29, 2015, at 8:54 AM, Isabel Jimenez

Re: Design doc for Mesos HTTP API

2015-04-29 Thread Vinod Kone
, Vinod Kone vinodk...@apache.org wrote: It is an artifact of the doc being hosted on twitter.com google apps domain :/ Gave you perms. I'll grant perms to anyone who requests it. On Mon, Mar 9, 2015 at 11:43 AM, James DeFelice james.defel...@gmail.com wrote: Google drive says that I need

Re: Oversubscription design doc

2015-04-22 Thread Vinod Kone
Broke down the epic https://issues.apache.org/jira/browse/MESOS-354 into tasks per earlier discussions. Feel free to edit / break down further. On Tue, Apr 14, 2015 at 11:22 AM, Joris Van Remoortere jo...@mesosphere.io wrote: We have a dedicated IRC channel for this: mesos-oversubscription.

Re: A question about left over rejected resources

2015-04-06 Thread Vinod Kone
To clarify David's answer, you should only get 16, 8 offer until the filter on 0, 8 is active. Once the filter expires (or you call reviveOffers), Mesos will consolidate those resources and send a 16, 16 offer. You don't need to restart the master for the aggregation. On Mon, Apr 6, 2015 at 10:46

Re: StratOS: A Big Data platform for scientific computing

2015-03-10 Thread Vinod Kone
Sounds interesting! Thanks for letting us know. On Tue, Mar 10, 2015 at 2:14 PM, Nathaniel Stickley idi...@gmail.com wrote: Hello Mesos users, I am not sure that this is the best place for this announcement, but I thought it would be worth a try... The Multidisciplinary Image Processing

Re: Apache Mesos Community Sync

2015-03-06 Thread Vinod Kone
is welcome to join. Feel free to add items to the agenda for the next meeting here https://docs.google.com/document/d/153CUCj5LOJCFAVpdDZC7COJDwKh9RDjxaTA0S7lzwDA/edit?usp=sharing . Cheers, On Thu, Mar 5, 2015 at 11:23 AM, Vinod Kone via Blue Jeans Network inv...@bluejeans.com wrote: [image

Re: Mesos Community Meetings

2015-03-05 Thread Vinod Kone
: If BlueJeans *doesn't* work out we will use Google Hangout ( https://plus.google.com/hangouts/_/twitter.com/mesos-sync) instead. On Tue, Jan 20, 2015 at 4:10 PM, Vinod Kone vinodk...@gmail.com wrote: Thanks for the interest. The next meeting will be on 5th February, 3-5 pm PST. The hangout link

Re: Who is the user in Mesos Authorization ACL definition?

2015-03-05 Thread Vinod Kone
I'm not very familiar with Marathon, but the 'user' in the 'run_tasks' ACL is the unix user that the executor/run process runs as. This user is set by frameworks by setting 'CommandInfo.user'. On Thu, Mar 5, 2015 at 9:06 AM, Sivaram Kannan sivara...@gmail.com wrote: Hi, The user that is

Re: Mesos cluster auto scaling slaves

2015-02-27 Thread Vinod Kone
Jenkins framework scales *Jenkins* slaves which are just Mesos tasks. It has nothing to do with scaling Mesos slaves. On Fri, Feb 27, 2015 at 1:53 PM, Kenneth Su su.ke...@gmail.com wrote: Thanks, Andrew! I will search for that and good to know Jenkins Mesos framework also does that work.

Re: Task revocation (preemption) in Mesos

2015-02-24 Thread Vinod Kone
There's a killTask() call frameworks can use. @vinodkone On Feb 24, 2015, at 7:42 AM, dev middleware dev.middlew...@gmail.com wrote: Hi, I'm new to Mesos technology. Browsing thru the documentation, I couldn't find how a running task can be killed by Mesos master (/ its pluggable

Re: Task revocation (preemption) in Mesos

2015-02-24 Thread Vinod Kone
:19 PM, Vinod Kone vinodk...@gmail.com wrote: There's a killTask() call frameworks can use. @vinodkone On Feb 24, 2015, at 7:42 AM, dev middleware dev.middlew...@gmail.com wrote: Hi, I'm new to Mesos technology. Browsing thru the documentation, I couldn't find how a running task can

Re: Updating FrameworkInfo settings

2015-02-24 Thread Vinod Kone
Changing FrameworkInfo (while keeping the FrameworkID) is not handled correctly by Mesos at the moment. This is what you currently need to do to propagate FrameworkInfo.checkpoint throughout the cluster. -- Update FrameworkInfo inside your framework and re-register with master. (Old FrameworkInfo

Re: [VOTE] Release Apache Mesos 0.21.1 (rc2)

2015-02-17 Thread Vinod Kone
Tim, mind updating the release guide too? @vinodkone On Feb 17, 2015, at 8:19 AM, Timothy Chen t...@mesosphere.io wrote: Hi Ben, Didn't realize I need to update this. I've added the date now. Tim On Feb 16, 2015, at 10:30 PM, Benjamin Mahler benjamin.mah...@gmail.com wrote:

Re: Mesos Community Meetings

2015-02-05 Thread Vinod Kone
Heads up that this is happening in an hour. On Tue, Jan 20, 2015 at 4:10 PM, Vinod Kone vinodk...@gmail.com wrote: Thanks for the interest. The next meeting will be on 5th February, 3-5 pm PST. The hangout link: https://plus.google.com/hangouts/_/twitter.com/mesos-sync On Tue, Jan 6, 2015

Re: Mesos marks framework as registered successfully despite being unreachable

2015-02-04 Thread Vinod Kone
On Wed, Feb 4, 2015 at 12:22 PM, David Kesler dkes...@yodle.com wrote: I've posted this as an issue in marathon's github ( https://github.com/mesosphere/marathon/issues/1140), but I also wanted to post here as it may be an issue that mesos seems to not be handling the case where it cannot

Re: Is mesos spamming me?

2015-02-02 Thread Vinod Kone
The config-history thing is interesting. Do you happen to know when it gets generated? Are the config.xml files different in each of those directories? On Mon, Feb 2, 2015 at 10:43 AM, Adam Bordelon a...@mesosphere.io wrote: The Jenkins framework may not be properly declining offers, nor

Re: Is mesos spamming me?

2015-02-02 Thread Vinod Kone
On Mon, Feb 2, 2015 at 5:18 PM, Hepple, Robert rhep...@tnsi.com wrote: ... whatever all that means!! And why would it be requesting 704Mb and 0.2 CPUs? Where do those numbers come from? Adding Jenkins Slave Memory in MB and Jenkins Executor Memory in MB comes to 640Mb The extra overhead is

Re: Is mesos spamming me?

2015-02-02 Thread Vinod Kone
On Mon, Feb 2, 2015 at 5:34 PM, Hepple, Robert rhep...@tnsi.com wrote: No - it is not 'restricted' to a label but the only executors this jenkins has are 'mesos' ones. You need to restrict it to 'mesos'. That is how the plugin is implemented. It will look into offers from mesos *iff* it has

Re: Is mesos spamming me?

2015-02-02 Thread Vinod Kone
On Mon, Feb 2, 2015 at 5:04 PM, Hepple, Robert rhep...@tnsi.com wrote: As I say, it can't seem to get a slave started, the jenkins job is just hanging waiting for an executor (this test server presently only has mesos slaves as executors). Did you set the mesos label on the job?

Re: Is mesos spamming me?

2015-02-02 Thread Vinod Kone
Can you ask on jenkins user list? You'll probably get quicker response there. On Mon, Feb 2, 2015 at 6:33 PM, Hepple, Robert rhep...@tnsi.com wrote: On Mon, 2015-02-02 at 11:43 -0800, Vinod Kone wrote: The config-history thing is interesting. Do you happen to know when it gets generated

<    1   2   3   4   5   6   >