Re: Mesos loses track of Docker containers

2016-08-10 Thread Jeff Schroeder
Have you considered upgrading Mesos and Marathon? Those are quite old versions of both with some fairly glaring problems with the docker containerizer if memory serves. Also what version of docker? On Wednesday, August 10, 2016, Paul Bell wrote: > Hello, > > One of our customers has twice encoun

Re: [VOTE] Release Apache Mesos 1.0.0 (rc4)

2016-07-27 Thread Jeff Schroeder
Small nit but can you s/experimnental/experimental/ under the "Storage" header in the release post please? Great work otherwise everyone! On Wednesday, July 27, 2016, Vinod Kone wrote: > Hi all, > > The vote for Mesos 1.0.0 (rc4) has passed with the following votes. > > > +1 (Binding) > > -

Re: Initial Design Document Apache Mesos Federation (JIRA 3548)

2016-07-14 Thread Jeff Schroeder
r ASF? > > I would love to hear more from others too. > > Regards, > Dhilip > > On Wed, Jul 13, 2016 at 8:46 PM, Jeff Schroeder < > jeffschroe...@computer.org > > wrote: > >> Would this mean introducing golang and as a result, consul, into mesos >>

Re: Initial Design Document Apache Mesos Federation (JIRA 3548)

2016-07-13 Thread Jeff Schroeder
lusters-for-global-data-center-designs > Video : > https://www.youtube.com/watch?v=kqyVQzwwD5E&index=17&list=PLGeM09tlguZQVL7ZsfNMffX9h1rGNVqnC > > -- Jeff Schroeder Don't drink and derive, alcohol and analysis don't mix. http://www.digitalprognosis.com

Re: Rack awareness support for Mesos

2016-06-07 Thread Jeff Schroeder
On Tuesday, June 7, 2016, Du, Fan wrote: > > > On 2016/6/6 23:48, Jörg Schad wrote: > >> Hi, >> thanks for your idea and design doc! >> Just a few thoughts: >> a) The scheduling part would be implemented in a framework scheduler and >> not the Mesos Core, or? >> > > I'm not sure which level of sc

Re: 1.0 Release Candidate

2016-05-25 Thread Jeff Schroeder
Does this mean the work to deprecate the docker containerizer will be post-1.0, or have those plans changed? On Wednesday, May 25, 2016, Vinod Kone wrote: > Hi folks, > > As discussed in the previous community sync, we plan to cut a release > candidate for our next release (1.0) early next week.

Re: mesos website workgroup

2016-05-16 Thread Jeff Schroeder
*I'd even On Monday, May 16, 2016, Jeff Schroeder wrote: > Is be in. > > On Monday, May 16, 2016, Vinod Kone > wrote: > >> Hi guys, >> >> Mesos website needs some love. It hasn't seen major changes for a while >> now and there is no real maint

Re: mesos website workgroup

2016-05-16 Thread Jeff Schroeder
Is be in. On Monday, May 16, 2016, Vinod Kone wrote: > Hi guys, > > Mesos website needs some love. It hasn't seen major changes for a while > now and there is no real maintainer for it. > > I'm proposing we start a work group for the folks who are interested in > contributing to the website. Esp

Re: Status of MESOS-2533?

2016-05-04 Thread Jeff Schroeder
hu, May 5, 2016 at 1:16 AM, Benjamin Mahler > wrote: > >> +AlexR >> >> On Mon, May 2, 2016 at 2:31 PM, Jeff Schroeder < >> jeffschroe...@computer.org >> > wrote: >> >>> Some frameworks like Aurora use custom executors to distribute the >&

Re: How to use a complete host

2016-05-02 Thread Jeff Schroeder
In addition to actually using roles, you might also consider subscribing to MESOS-4138, which will implement this idea in a generic way (but won't help you today). On Monday, May 2, 2016, Christoph Heer wrote: > Hi everyone, > > sometimes in my Mesos use-case it's required to ensure that my own

Status of MESOS-2533?

2016-05-02 Thread Jeff Schroeder
Some frameworks like Aurora use custom executors to distribute the healthchecks with the tasks. This allows the task to survive a network partition without the scheduler setting it to TASK_LOST. Marathon uses mesos-health-check for command based health checks, but does TCP and HTTP healthchecks fr

Re: mesos docker vs native container

2016-04-26 Thread Jeff Schroeder
I think you might be a bit confused now this all works. Docker by default uses bridged networking, which by default spins up a little crappy docker-proxy process for every port. You can disable docker-proxy and instead use hairpin routing mode if you have a modern kernel. However, I'm almost certai

Mesos on RHEL 5.X

2016-04-19 Thread Jeff Schroeder
you know if I can install devtools-2 for RHEL 5.4 using yum ? > > Thanks, > Mani > > On Tue, Apr 19, 2016 at 10:54 AM, Jie Yu wrote: > >> I know someone is still using Mesos in production on RHEL 5.4. You need >> devtoolset-2 to build Mesos. >> >> On T

Re: Mesos on RHEL 5.X

2016-04-19 Thread Jeff Schroeder
The RHEL5 kernel will not support the necessary bits for mesos. RHEL6 also lacks the overwhelming majority of support for namespaces and control groups. Try upgrading to RHEL7 and then giving Mesos a go. It doesn't support older kernels. On Tuesday, April 19, 2016, Manivannan wrote: > Hi, > > Wh

Re: Mesos interconnection among clusters project calico

2016-04-13 Thread Jeff Schroeder
Stefano, you might also follow the jira issue MESOS-3548, which is for mesos to support federation amongst multiple clusters natively. On Wednesday, April 13, 2016, Stefano Bianchi wrote: > Ah ok. > No problem. > See you and best regards!!! > Il 13/apr/2016 21:09, "June Taylor" > ha scritto: >

Re: Mesos agents across a WAN?

2016-04-01 Thread Jeff Schroeder
On Thursday, March 31, 2016, Evan Krall wrote: > On Wed, Mar 30, 2016 at 6:56 PM, Jeff Schroeder < > jeffschroe...@computer.org > > wrote: > >> Given regional bare metal Mesos clusters on multiple continents, are >> there any known issues running some of the

Re: Mesos agents across a WAN?

2016-03-31 Thread Jeff Schroeder
gt; On Thu, Mar 31, 2016 at 3:56 AM, Jeff Schroeder < > jeffschroe...@computer.org > > wrote: > >> Given regional bare metal Mesos clusters on multiple continents, are >> there any known issues running some of the agents over the WAN? Is anyone >> else doing it, or is

Mesos agents across a WAN?

2016-03-30 Thread Jeff Schroeder
es-cluster-federation -- Jeff Schroeder Don't drink and derive, alcohol and analysis don't mix. http://www.digitalprognosis.com

Re: How to make full version available in /version endpoint

2016-03-23 Thread Jeff Schroeder
on number is an artifact from >> Mesosphere's build system, and my understanding is they are going to get >> rid of it. So perhaps this will not be a problem in the future? >> >> - Erik >> >> On Wed, Mar 23, 2016 at 5:10 PM, Jeff Schroeder < >> je

Re: How to make full version available in /version endpoint

2016-03-23 Thread Jeff Schroeder
Perhaps building your own version, with your own version string would be sufficient? A general purpose feature to override the stated version with an environment variable doesn't seem very applicable in many environments. Perhaps there is a different way you could accomplish the same ultimate goal?

Re: LogrotateContainerLogger documentation

2016-03-22 Thread Jeff Schroeder
Would you be willing to fork the repo and create a documentation patch? On Tuesday, March 22, 2016, haosdent wrote: > As I know, currently don't have a document like that. But in > [configuration document]( > https://github.com/apache/mesos/blob/master/docs/configuration.md), > something like `(

Re: marathon-lb at scale

2016-03-03 Thread Jeff Schroeder
Being able to set HAPROXY_0_VHOST to the mesos-dns name and having everything just magically work is a pretty fantastic user experience however. Especially for users who might need to talk to SysAdmin teams to manually change DNS. Any alternatives? We have marathon-lb running in a container with ke

Re: Using Virtual Hosts

2016-02-11 Thread Jeff Schroeder
With a few of the newly added features, marathon-lb is actually a pretty elegant solution: https://github.com/mesosphere/marathon-lb On Thursday, February 11, 2016, Alfredo Carneiro < alfr...@simbioseventures.com> wrote: > Hi guys, > > I have been searching for the past few weeks about Mesos and

Re: deploy mesos cluster on aws

2016-01-10 Thread Jeff Schroeder
On Sunday, January 10, 2016, Rodrick Brown wrote: > We run 100% on AWS and have been running Mesos in production since version > 0.19 > Our cluster consists of 3 dedicated zookeeper nodes (M3.2lx), 3 dedicated > masters (M3.2lx), 8 dedicated slaves (M4.4xl) and 2 haproxy (M4.Medium) > instances u

Re: Role-related configuration in Mesos

2015-12-28 Thread Jeff Schroeder
Perhaps we could also support HTTP PATCH so you could just update one small thing vs's PUT's get and set method. On Thursday, December 17, 2015, Adam Bordelon wrote: > First off, if we're going to have a /reservations endpoint, we should > follow the same PUT+DELETE pattern for reserve+unreserve

Re: Mesos at Moz

2015-12-07 Thread Jeff Schroeder
>From a quick skim, this looks excellent. Are there any plans to try getting the bamboo changes back upstream, or is it going to be a permanent fork? On Monday, December 7, 2015, Arunabha Ghosh wrote: > Hi Folks, > We, at Moz have been working for a while on RogerOS, our > next gen

Re: [VOTE] Release Apache Mesos 0.24.0 (rc1)

2015-08-28 Thread Jeff Schroeder
Also, if you want to Accept application/json, perhaps you could do something like: [recordLength, recordData], [record2Length, record2Data] That gives you the same semantics, and the record data can be anything, but also falls inline with the server returning Content-Type: application/json For t

Re: Mesos/Marathon/HAProxy Logging

2015-08-25 Thread Jeff Schroeder
This is the header that should be passed: https://en.m.wikipedia.org/wiki/X-Forwarded-For Most of the modern internet routes through reverse proxies and this is how we log the actual source clients to solve similar auditing and compliance needs. On Tuesday, August 25, 2015, John Omernik wrote:

Re: No CRAM-MD5 support for libsasl2 when compiling

2015-08-16 Thread Jeff Schroeder
s in > /home/user/Desktop/libsasl2/lib/sasl2? > > On 17/08/2015 2:45 PM, Jeff Schroeder wrote: > > Yes, the same thing happened to me at first. You need both the headers > (the dev package and .h files) and the actual libraries installed. > > On Sunday, August 16, 2015, F21

Re: No CRAM-MD5 support for libsasl2 when compiling

2015-08-16 Thread Jeff Schroeder
Yes, the same thing happened to me at first. You need both the headers (the dev package and .h files) and the actual libraries installed. On Sunday, August 16, 2015, F21 wrote: > I am trying to build mesos 0.23. > > In my case, I am trying to build mesos and all of its dependencies > manually, s

Re: MesosCon Seattle attendee introduction thread

2015-08-16 Thread Jeff Schroeder
to meet with folks interested in learning more about how we >> deploy and manage services at Twitter using Mesos and Apache Aurora >> <http://aurora.apache.org>. Twitter has a booth where I'll be hanging >> out for a portion of the conference, feel free to stop by and

Re: SSL in Mesos 0.23

2015-08-07 Thread Jeff Schroeder
able, rather than /etc/mesos-master/ssl_enabled. Of course, along > with > > the rest of the SSL_ environment variables. > > > > e.g. SSL_ENABLED=true SSL_KEY_FILE= > > SSL_CERT_FILE= ./mesos-master > > > > MPark. > > > > On Thu, Aug 6, 2015 at 9

Re: SSL in Mesos 0.23

2015-08-06 Thread Jeff Schroeder
nks for your help! > > [1] http://open.mesosphere.com/getting-started/datacenter/install/ > [2] http://mesos.apache.org/documentation/latest/mesos-ssl/ > > Regards, > Dharmit. > -- Jeff Schroeder Don't drink and derive, alcohol and analysis don't mix. http://www.digitalprognosis.com

Re: Problem building 0.23.0 under mock for RHEL7

2015-08-04 Thread Jeff Schroeder
ir/build/BUILD/mesos-0.23.0/include/mesos/scheduler.hpp:486:15: > error: 'function' in namespace 'std' does not name a type > const std::function& connected, >^ > / > ``` > > On Wed, Aug 5, 2015 at 10:21 AM, Jeff Schroeder < &g

Re: Problem building 0.23.0 under mock for RHEL7

2015-08-04 Thread Jeff Schroeder
irc. >> >> Cheers, >> Tim >> >> -- >> >> *From: *"Jeff Schroeder" > > >> *To: *"Mesos Users" > > >> *Sent: *Tuesday, August 4, 2015 6:21:39 PM >> *Subject: *Problem building 0.23.0 under mock for RHEL7 >> >&

Re: Problem building 0.23.0 under mock for RHEL7

2015-08-04 Thread Jeff Schroeder
Will do, I'll be in around 9am CST. Thanks Tim On Tuesday, August 4, 2015, Tim St Clair wrote: > I'll take a look in the a.m. feel free to ping me on irc. > > Cheers, > Tim > > -- > > *From: *"Jeff Schroeder" > >

Problem building 0.23.0 under mock for RHEL7

2015-08-04 Thread Jeff Schroeder
curious if it has been seen before. [1] http://pkgs.fedoraproject.org/cgit/mesos.git/tree/?h=f23 [2] https://gist.githubusercontent.com/SEJeff/7c01db2704a1352c8d7e/raw/b606fa77b73984cfa02d62cc1c90f4ae736fae0a/build.log -- Jeff Schroeder Don't drink and derive, alcohol and analysis

Re: Questions about framework development - (HA and reconciling state)

2015-07-25 Thread Jeff Schroeder
Not sure how much more difficult it would be, but Apache Aurora uses the native mesos replicated log construct for data persistence (where you store data in memory). It requires one manual setup to deploy the framework, but seems like it is worth it for what you get out of it. Here is how I just re

Re: "Cluster of Workstations" type design for a Mesos cluster

2015-07-21 Thread Jeff Schroeder
As far as mesos is concerned, compute is a commodity. This should work just fine. Put Aurora or Marathon ontop of mesos if you need a general purpose scheduler and you're good to go. The nice thing is that you can add additional slaves as you need. I believe heterogeneous clusters are best if possi

Re: [VOTE] Release Apache Mesos 0.23.0 (rc3)

2015-07-16 Thread Jeff Schroeder
What about MESOS-3055 in 0.23? Is that going to get passed up on even if we are going to cut another rc? On Thursday, July 16, 2015, Vinod Kone wrote: > -1 so that we can cherry pick MESOS-3055. > > The master crash bug is MESOS-3070 > but the f

Re: [VOTE] Release Apache Mesos 0.23.0 (rc2)

2015-07-09 Thread Jeff Schroeder
1 FAILED TEST YOU HAVE 10 DISABLED TESTS = On Thu, Jul 9, 2015 at 2:03 PM, Jie Yu wrote: > Jeff, > > Can you use --verbose during testing and paste the log? > > - Jie > > On Thu, Jul 9, 2015 at 11:02 AM, Jeff Schroeder < > jeffschroe...@computer.org>

Re: [VOTE] Release Apache Mesos 0.23.0 (rc2)

2015-07-09 Thread Jeff Schroeder
Does anyone else see this python failure, or am I just testing something incorrectly? On Wed, Jul 8, 2015 at 7:33 PM, Jeff Schroeder wrote: > -1 (non-binding) The Python test ExamplesTest.PythonFramework still fails > on RHEL7, unless building the mesos native Python package isn

Re: [VOTE] Release Apache Mesos 0.23.0 (rc2)

2015-07-08 Thread Jeff Schroeder
-1 (non-binding) The Python test ExamplesTest.PythonFramework still fails on RHEL7, unless building the mesos native Python package isn't part of make check. It pops on an ImportError for mesos.native See here for more details: https://gist.github.com/SEJeff/05a31c1d2d10fecc97a1 On Wednesday,

Re: git / chronos-marathon integration

2015-06-24 Thread Jeff Schroeder
And I guess git clone with -b for a branch name or specifying a commit manually won't work for you? On Wednesday, June 24, 2015, Sebastien Brennion < sebastien.brenn...@leanbi.ch> wrote: > Is there a way to specify which version of a script marathon/chronos > should use ? : > > > > I would like

Re: How to upgrade mesos version from a running mesos cluster

2015-06-18 Thread Jeff Schroeder
Hello Tommy, have you read the documentation? If not, please take a look and then follow up with any specific questions here: http://mesos.apache.org/documentation/latest/upgrades/ On Thursday, June 18, 2015, tommy xiao wrote: > Hi, > > I have a question on upgrade strategy: > How about upgrade

Re: slave work_dir filling up

2015-06-09 Thread Jeff Schroeder
http://mesos.apache.org/documentation/latest/configuration/ Look under Slave Options at --gc_delay and --gc_disk_headroom On Tuesday, June 9, 2

Re: Mesos Security Recommendations

2015-06-04 Thread Jeff Schroeder
For securing insecure network communication you can use something like stunnel, then point the app at the local stunnel. It would be a fair bit of hoops to configure it all with any your config management system, but is totally doable. On Thursday, June 4, 2015, John Webb wrote: > All, > > I'm l

Re: [DISCUSS] Renaming Mesos Slave

2015-06-02 Thread Jeff Schroeder
For the sake of hearing out all sides, who is offended by this name? For those who are offended, what would be less offensive names? If slave is taken out of context, minion seems just as easy to be taken out of context. I'd really love to hear from someone actually offended what we can do to make

Re: [DISCUSS] Renaming Mesos Slave

2015-06-01 Thread Jeff Schroeder
My (very personal) thought here is that we should ensure a vocal minority is not changing things for the sake of changing it. What is the industry standard here? Are potential users actually refusing to use mesos due to the terminology which is unfortunately very prevalent in the client/server worl

Re: cluster confusion after zookeeper blip

2015-05-18 Thread Jeff Schroeder
Not that this is super helpful for your issue, but I ran into an identical problem this morning with Aurora ontop of mesos where the scheduler was inoperable due to my ZK ensemble losing quorum and generally acting bad. However as soon as I fixed the quorum things immediately recovered. I believe i

Re: Batch Scheduler with dependency support

2015-05-13 Thread Jeff Schroeder
Lookup Hubspot's Singularity On Wednesday, May 13, 2015, Aaron Carey wrote: > Thanks Jeff, > > Any other options around as well? > > -- > *From:* jeffschr...@gmail.com > [ > jeffschr...@gmail.com > ] on behalf of > Jeff Sc

Batch Scheduler with dependency support

2015-05-13 Thread Jeff Schroeder
> thanks, > Aaron > > -- > *From:* jeffschr...@gmail.com [jeffschr...@gmail.com] on behalf of Jeff > Schroeder [jeffschroe...@computer.org] > *Sent:* 13 May 2015 13:12 > *To:* user@mesos.apache.org > *Subject:* Re: Batch Scheduler with dependency support > > Apa

Re: Batch Scheduler with dependency support

2015-05-13 Thread Jeff Schroeder
Apache Aurora does this and you can be explicit about the ordering On Wednesday, May 13, 2015, Aaron Carey wrote: > Hi All, > > I was just wondering if anyone out there knew of a good mesos batch > scheduler which supports dependencies between tasks? (ie Task B cannot run > until Task A is comp

Failed to perform slave recovery after adding or changing attributes

2015-05-08 Thread Jeff Schroeder
ood bit more difficult if we need to programatically roll out changes to all of our mesos infrastructure via config management (I use salt). This is more of me asking if there is a way for things to work a bit better, and if not, I'd like to discuss a feature enhancement to make it better. -

Re: Docker 1.6 and Registry 2.0 support

2015-04-21 Thread Jeff Schroeder
On Tuesday, April 21, 2015, craig w wrote: > That's good to know. I suppose Mesos/Marathon working with Docker Registry > 2.0 is not ready, would like confirmation. > It has to do with the docker client you have and nothing to do with the mesos / marathon version that I'm aware of. Mesos literal

Re: Current State of Service Discovery

2015-04-12 Thread Jeff Schroeder
This discussion reminds me of a few excellent blog posts on solving similar problems. Smart clients vs dumb load balancers: http://blog.lusis.org/blog/2013/05/13/smart-clients/ Then there is the "local haproxy" idea, which I think is a lot less applicable to a mesos environment, but still worth t

Re: Using mesos-dns in an enterprise

2015-04-02 Thread Jeff Schroeder
You could also just use keepalived for a vip on each mesos-dns instance assuming they are in the same lan. On Thursday, April 2, 2015, Tom Arnfeld wrote: > We're using a BGP based solution currently to solve the problem of highly > available DNS resolvers. > > That might be a route worth taking,

Re: Mesos Hadoop Framework 0.1.0

2015-03-28 Thread Jeff Schroeder
Arnfeld wrote: > To follow up, this is also a decent solution to a nasty problem in the > current framework detailed here, https://github.com/mesos/hadoop/issues/32 > . > > > -- > > Tom Arnfeld > Developer // DueDil > > > On Sat, Mar 28, 2015 at 2:40 PM, Jeff Schr

Re: Mesos Hadoop Framework 0.1.0

2015-03-28 Thread Jeff Schroeder
Does this have any pros / cons over Myriad, which runs Yarn on Mesos? Other than not requiring Yarn :) On Saturday, March 28, 2015, Tom Arnfeld wrote: > Hey everyone, > > I thought it best to send an email to the list before merging and tagging > a 0.1.0 release for the Hadoop on Mesos framewor

Re: Slave recovery not recovering tasks when using systemd

2015-03-26 Thread Jeff Schroeder
On Thursday, March 26, 2015, Joerg Schad wrote: > Dear Mesos Users, > I just wanted to point out a solved issue ( > https://issues.apache.org/jira/browse/MESOS-2419) where the *systemd* > default behaviour prevents tasks from recovering. > > The problem is that the default KillMode for systemd pr

Re: [VOTE] Release Apache Mesos 0.22.0 (rc4)

2015-03-23 Thread Jeff Schroeder
How does everyone verify each release works? Obviously make check is important, but are there any sets of automated integration tests that anyone runs for a new release of mesos? If not, should there be? On Monday, March 23, 2015, Dario Rexin wrote: > +1 (non-binding) > > tested with Marathon >

Re: mesos-collectd-plugin

2015-03-10 Thread Jeff Schroeder
rily to generate figures, text > format numbers are quite enough.) > > So e.g for a hadoop job, when it's finished, we can collect the general > metrics of it? Ideally although there are many jobs from different > frameworks running at the same time on mesos, > > we still c

Re: mesos-collectd-plugin

2015-03-10 Thread Jeff Schroeder
I installed it and played with it for a bit but was somewhat underwhelmed with it. It doesn't support slaves and all of the hardcoding with duplication isn't my favorite. I ended up writing a single plugin to support both masters and slaves and putting it on every node in my Mesos cluster. Would i

Re: Question on Monitoring a Mesos Cluster

2015-03-07 Thread Jeff Schroeder
pi, you can do things like: * "if the cpu usage is over 80% for 24 hours, send a warning event" * "if the cpu usage is over 95% for 6 hours, send a critical event" This allows mostly no-impact monitoring since the monitoring tools are hitting graphite. Anyways, back to t

Question on Monitoring a Mesos Cluster

2015-03-07 Thread Jeff Schroeder
uster utilization for capacity planning? We can have the NOC watch this and simply add more hardware when the number starts getting low. Thanks -- Jeff Schroeder Don't drink and derive, alcohol and analysis don't mix. http://www.digitalprognosis.com