What are the invalid-user.log files?

2016-03-19 Thread Peter Steele
I'm testing out mesos for the first time and after installing the software I'm seeing numerous log files of the form mesos-slave..invalid-user.log.INFO.20160317-062640.918 I don't see any errors in the logs themselves, but the fact that "invalid-user" is part of the name makes me think I'm

Re: Unstability on Mesos 0.27

2016-03-19 Thread Guillermo Rodriguez
Hi, I have reported all my problems already. That's why I installed 0.27.2 and I'm waiting for 0.28 because I see many solutions there. The lastest crash I was unable to identify. There were no logs at all of any error. I just know marathon and mesos decided to shutdown at the same time. I

Re: verbose logging with the docker executor

2016-03-19 Thread James Peach
> On Mar 17, 2016, at 10:09 AM, Clarke, Trevor wrote: > > Looking in the docker executor, the docker command line is logged with > VLOG(1) but I'm not sure how to generate that level of log output. Some > googling suggests it's used in the google logging library and verbose

Re: What are the invalid-user.log files?

2016-03-19 Thread haosdent
Not sure why glog could not get USER environment variable correctly after looking its code. But should not affect you running mesos. On Fri, Mar 18, 2016 at 1:11 AM, haosdent wrote: > root should be fine. > > On Fri, Mar 18, 2016 at 12:53 AM, Peter Steele

Re: HTTP API

2016-03-19 Thread Vinod Kone
Thanks for the interest! We are actively working to make the Framework v1 API stable. We've made quite a few improvements/fixes to the Scheduler v1 API since 0.24.0. We've also introduced Executor v1 API in 0.28.0. Both are in *experimental* state. There are still things left to do to make the

Re: Deploying mesos software

2016-03-19 Thread Chris Baker
Peter, I've had success packaging my own RPMs for Centos 6 and 7 using the following: https://github.com/mesosphere/mesos-deb-packaging In my particular case, I use it to package custom-built versions of mesos. Chris On Tue, Mar 15, 2016 at 5:58 PM Peter Steele wrote: > Ah,

Re: Resource Isolation per Framework

2016-03-19 Thread Pradeep Chhetri
Hello haosdent, Perfect, then i guess that should solve my issue. I will configure things and see if it is working as expected or not. Cheers. On Fri, Mar 18, 2016 at 3:20 PM, haosdent wrote: > Seems common is not necessary in your case. The resources without role > would

Re: [VOTE] Release Apache Mesos 0.26.1 (rc2)

2016-03-19 Thread Vinod Kone
+1 (binding) Tested on ASF CI. On Sun, Mar 13, 2016 at 4:33 PM, Michael Park wrote: > +1 (binding) > > Internal CI results with the corresponding JIRA tickets for the failed > tests: > > CentOS 6 (non-SSL): > -

Re: Vote on #MesosCon proposals, deadline Friday March 25

2016-03-19 Thread David Greenberg
Hi Jay, Thanks for your feedback! The reason we're asking for you to rank the topics is that this will allow us to better understand everyone's relative preferences--next, we'll use standard voting algorithms to determine the schedule, to ensure most people get as many talks they want as

Re: Unstability on Mesos 0.27

2016-03-19 Thread Jie Yu
Thanks for reporting! Can you be more specific about which component crashes a lot? Is it the framework, the master, the agent, or the executor. As Artem and Vinod mentioned, it'll be really helpful if you can provide the relevant log (master/agent/executor's log) so that we can pinpoint the

Re: How to kill tasks when memory exceeds the cgroup limit?

2016-03-19 Thread Shiyao Ma
Thanks. The limit_swap works.

Re: What are the invalid-user.log files?

2016-03-19 Thread haosdent
I fill this issue in https://github.com/mesosphere/mesos-deb-packaging/issues/71 On Sat, Mar 19, 2016 at 1:48 AM, haosdent wrote: > According I test by systemctl. It didn't export $USER when start service. > > ``` > [Unit] > Description=test > After=network.target > >

Re: What are the invalid-user.log files?

2016-03-19 Thread Peter Steele
My USER var is root. Do I need to create a non-root user for mesos to run under? On 03/17/2016 09:22 AM, haosdent wrote: glog get the user name by environment variable "USER" in Linux. https://github.com/google/glog/blob/master/src/utilities.cc#L290-L302 I think you could check the

Compute event at Twitter HQ - 03/31

2016-03-19 Thread Ian Downes
Hello everyone, I'd like to call attention to an event the Compute group at Twitter is holding at the end of the month where there will be a few Aurora/Mesos-related talks: 1. David Robinson, one of our SREs, will talk about how our small team of SREs manages what is possibly the largest Mesos

Unstability on Mesos 0.27

2016-03-19 Thread Alfredo Carneiro
Hello guys, I am using Mesos 0.27 with different kinds of applications, such as, crawlers, databases and websites. However, I have faced many crashes and I couldn't find what it is the matter. We have 14 machines with 8Gb of ram and 4 cpu each. Usually, we run about 40 instance of our crawler,

Re: What are the invalid-user.log files?

2016-03-19 Thread Peter Steele
In my case I am using the CentOS 7 set of rpms from mesosphere... On 03/18/2016 09:14 AM, Pradeep Chhetri wrote: I installed mesos using mesosphere debian repository. On Fri, Mar 18, 2016 at 4:10 PM, haosdent > wrote: cool, you install

Re: Unstability on Mesos 0.27

2016-03-19 Thread Vinod Kone
Hey Gabriel, Could you share more details on what the crashes are and what your setup is (docker containerizer?). Any logs (master, agent, application) that can shed light would be useful to diagnose. On Wed, Mar 16, 2016 at 5:12 PM, Alfredo Carneiro < alfr...@simbioseventures.com> wrote: >

Re: [VOTE] Release Apache Mesos 0.28.0 (rc2)

2016-03-19 Thread Michael Park
+1 (binding) Internal CI results with the corresponding JIRA tickets for the failed tests: CentOS 6 (non-SSL): CentOS 6 (SSL): - MemoryPressureMesosTest.CGROUPS_ROOT_SlaveRecovery (MESOS-4047 ) CentOS 7 (non-SSL): -

Re: HTTP API

2016-03-19 Thread Joseph Wu
Zameer, In case you haven't seen this already, there is already a Java-based scheduler driver for the HTTP API here: https://github.com/mesosphere/mesos-rxjava On Thu, Mar 17, 2016 at 5:26 PM, Zameer Manji wrote: > > On Thu, Mar 17, 2016 at 10:03 AM, Vinod Kone

Re: How to kill tasks when memory exceeds the cgroup limit?

2016-03-19 Thread Dick Davies
On 18 March 2016 at 20:58, Benjamin Mahler wrote: > Interesting, why does it take down the slaves? This was a good while back, but when swap gets low our slaves kernel OOM killer tended to mess things up. > Because a lot of organizations run with swap disabled (e.g. for more

Re: [RESULT][VOTE] Release Apache Mesos 0.28.0 (rc2)

2016-03-19 Thread craig w
Great news. Do the rpm's get automatically built and released or will they come later this week? On Thu, Mar 17, 2016 at 1:28 PM, Vinod Kone wrote: > Hi all, > > > The vote for Mesos 0.28.0 (rc2) has passed with the > > following votes. > > > +1 (Binding) > >

Re: [VOTE] Release Apache Mesos 0.26.1 (rc2)

2016-03-19 Thread Benjamin Mahler
These are be captured under: https://issues.apache.org/jira/browse/MESOS-4979 On Thu, Mar 17, 2016 at 5:04 PM, Benjamin Mahler wrote: > Thanks for the hard work! Do we need to backport the rmdir fixes on the > outstanding release candidates? > > commit

Re: What are the invalid-user.log files?

2016-03-19 Thread Peter Steele
What would you suggest to troubleshoot this? Clear something isn't quite right if my log files are called "invalid-user". That said, I have managed to get a containerized application up and running so whatever is wrong isn't fatal. Just ugly... On 03/17/2016 10:42 AM, haosdent wrote: Not sure

Re: [VOTE] Release Apache Mesos 0.25.1 (rc2)

2016-03-19 Thread Michael Park
As there are insufficient votes on this rc along with a request from Evan Krall to include additional fixes: https://www.mail-archive.com/user@mesos.apache.org/msg06204.html , I'm declaring this rc failed, and will cut be cutting an

Re: [VOTE] Release Apache Mesos 0.24.2 (rc2)

2016-03-19 Thread Michael Park
As there are insufficient votes on this rc along with a request from Evan Krall to include additional fixes: https://www.mail-archive.com/user@mesos.apache.org/msg06205.html, I'm declaring this rc failed, and will cut be cutting an rc3 early next week. Thanks, MPark On 13 March 2016 at 20:57,

Re: [VOTE] Release Apache Mesos 0.24.2 (rc2)

2016-03-19 Thread Michael Park
Hi Evan, As I mentioned in the 0.25.1 rc2 thread, I will be cutting an rc3 for 0.24.2 and 0.25.1 to include those patches. Thanks again! MPark On 18 March 2016 at 17:09, Benjamin Mahler wrote: > +michael who is managing the release, he'll get back to you shortly, >

Re: [VOTE] Release Apache Mesos 0.25.1 (rc2)

2016-03-19 Thread Michael Park
Hi Evan, Thank you for letting us know of the issues and your explanation! I will cut an RC3 for 0.24.2 and 0.25.1 to include those patches. MPark On 14 March 2016 at 20:25, Evan Krall wrote: > On Sun, Mar 13, 2016 at 3:46 PM, Michael Park wrote: > >> Hi

Re: Unstability on Mesos 0.27

2016-03-19 Thread Artem Harutyunyan
Hi Guillermo, We would really like to help you, and to understand what the issues are. Could you please send us all the logs you have so we can inspect them and figure out what happened? Artem. On Thursday, March 17, 2016, Guillermo Rodriguez wrote: > Update to 0.27.2 or

RE: verbose logging with the docker executor

2016-03-19 Thread Clarke, Trevor
Thanks, worked great -- Trevor R.H. Clarke Software Engineer, Ball Aerospace (937)320-7087 > -Original Message- > From: James Peach [mailto:jor...@gmail.com] > Sent: Thursday, March 17, 2016 1:15 PM > To: user@mesos.apache.org > Subject: Re: verbose logging with the

Can mesos support supports multi-datacenter and multi-region configurations for failure isolation and scalability.

2016-03-19 Thread tommy xiao
recently, i read Nomad blog's article, , the blog said: ``` Mesos does not support federation or multiple failure isolation regions. Nomad supports multi-datacenter and multi-region configurations for failure isolation and scalability. ``` how the mesos support multi-datacenter and

Re: [VOTE] Release Apache Mesos 0.28.0 (rc2)

2016-03-19 Thread Zhitao Li
I don't think it's a blocking issue after some initial investigation. Changing my vote to +1 (nonbinding) On Wed, Mar 16, 2016 at 6:07 PM, Vinod Kone wrote: > > On Wed, Mar 16, 2016 at 5:59 PM, Daniel Osborne < > daniel.osbo...@metaswitch.com> wrote: > >> Is this issue a

Re: [RESULT][VOTE] Release Apache Mesos 0.28.0 (rc2)

2016-03-19 Thread Bill Farner
Jake - i think that would be wonderful! On Thu, Mar 17, 2016 at 11:17 AM, Jake Farrell wrote: > I've been maintaining a deb/rpm set for Mesos and for Aurora and Thrift we > have been using the infra supported Bintray to make it available to the > community via

Re: What are the invalid-user.log files?

2016-03-19 Thread Peter Steele
Yes, I see that as well. Also the case for the mesos master. That explains the invalid-user, but why isn't these processes picking up $USER? On 03/18/2016 07:31 AM, Pradeep Chhetri wrote: I can see that USER environment variable is not set for mesos-slave process from /proc//environ. On Fri,

RE: [VOTE] Release Apache Mesos 0.28.0 (rc2)

2016-03-19 Thread Daniel Osborne
Is this issue a blocker? Are we moving to rc3 or proceeding with 0.28.0? Sorry if this is a silly question, a bit new to the release / voting process. Best, -Dan From: Zhitao Li [mailto:zhitaoli...@gmail.com] Sent: Tuesday, March 15, 2016 8:15 AM To: Jörg Schad Cc:

Re: [RESULT][VOTE] Release Apache Mesos 0.28.0 (rc2)

2016-03-19 Thread Kapil Arya
Here is a link to the rpm/deb packages: http://open.mesosphere.com/downloads/mesos/#apache-mesos-0.28.0 Best, Kapil On Thu, Mar 17, 2016 at 2:33 PM, Vinod Kone wrote: > +1 > > @vinodkone > > On Mar 17, 2016, at 11:27 AM, Bill Farner wrote: > > Jake -

Re: Deploying mesos software

2016-03-19 Thread Peter Steele
Thanks for the info. The mesosphere.com/downloads link does include the CentOS 7 RPMs I was looking for so I'm sticking with these for now. Peter On 03/17/2016 07:55 AM, Chris Baker wrote: Peter, I've had success packaging my own RPMs for Centos 6 and 7 using the following:

Re: How to kill tasks when memory exceeds the cgroup limit?

2016-03-19 Thread Dick Davies
Great! I'm not really sure why mesos even allows RSS limiting without VMEM, it takes down slaves like the Black Death if you accidentally deploy a 'leaker'. I'm sure there's a use case I'm not seeing :) On 18 March 2016 at 16:27, Shiyao Ma wrote: > Thanks. The limit_swap works.

Re: How to kill tasks when memory exceeds the cgroup limit?

2016-03-19 Thread Dick Davies
Last time I tried (not on the latest release) I also had to have cgroups set to limit swap, otherwise as soon as the process hit the RAM limit it would just start to consume swap. try adding --cgroups_limit_swap to the slaves startup flags. On 17 March 2016 at 16:21, Shiyao Ma

Re: [VOTE] Release Apache Mesos 0.28.0 (rc2)

2016-03-19 Thread Kapil Arya
+1 (binding). You can find the links to rpm/deb files for this RC here: http://open.mesosphere.com/downloads/mesos-rc/ On Thu, Mar 17, 2016 at 12:58 PM, Michael Park wrote: > +1 (binding) > > Internal CI results with the corresponding JIRA tickets for the failed > tests: > >

Re: How to kill tasks when memory exceeds the cgroup limit?

2016-03-19 Thread haosdent
Does your oom killer enable? Could check by cat memory.oom_control file. On Fri, Mar 18, 2016 at 12:21 AM, Shiyao Ma wrote: > Hi, > > > For the slave side: > export MESOS_RESOURCES='cpus:4;mem:180' > export MESOS_ISOLATION='cgroups/cpu,cgroups/mem' > > For the framework, > It

Resource Isolation per Framework

2016-03-19 Thread Pradeep Chhetri
Hello All, I am looking for ways to isolate resources per framework basis. Basically what i want to achieve is segregating machines on which tasks from a framework can run and having a common set of machines which can be shared across all frameworks. So I can think of assigning one role per

Re: HTTP API

2016-03-19 Thread Zameer Manji
+1 I am also interested in knowing the state of the HTTP API. I have heard that it stabilizing the API might be tied with Mesos 1.0 but I don't have a source for that. Can a PMC member comment on what the plan is? On Mon, Mar 14, 2016 at 2:30 PM, Dario Rexin wrote: > Hi

Re: What are the invalid-user.log files?

2016-03-19 Thread Pradeep Chhetri
I think it is an upstart issue. I am noticing this issue in Ubuntu 14.04 which uses upstart. This is what i can debug so far: According to this documentation: http://upstart.ubuntu.com/cookbook/#job-environment When Upstart runs a job, it provides it with a very restrictive environment which

Re: [VOTE] Release Apache Mesos 0.28.0 (rc2)

2016-03-19 Thread Vinod Kone
On Wed, Mar 16, 2016 at 5:59 PM, Daniel Osborne < daniel.osbo...@metaswitch.com> wrote: > Is this issue a blocker? Are we moving to rc3 or proceeding with 0.28.0? > It was not marked as such, so I'm guessing not. @Jie and @Zhitao, can you confirm? Also, we still need some binding votes for this

Re: [VOTE] Release Apache Mesos 0.28.0 (rc2)

2016-03-19 Thread Vinod Kone
+1 (binding) Tested on ASF CI (ubuntu 14.04 w/ gcc and clang). On Wed, Mar 16, 2016 at 6:07 PM, Vinod Kone wrote: > > On Wed, Mar 16, 2016 at 5:59 PM, Daniel Osborne < > daniel.osbo...@metaswitch.com> wrote: > >> Is this issue a blocker? Are we moving to rc3 or proceeding

Re: [RESULT][VOTE] Release Apache Mesos 0.28.0 (rc2)

2016-03-19 Thread Vinod Kone
The project itself doesn't officially release rpms/debs, but the community members do. For example, Mesosphere is planning to release rpms/debs shortly. On Thu, Mar 17, 2016 at 10:38 AM, craig w wrote: > Great news. Do the rpm's get automatically built and released or will

Re: Compute event at Twitter HQ - 03/31

2016-03-19 Thread haosdent
Would it have youtube live link? On Thu, Mar 17, 2016 at 12:38 AM, Ian Downes wrote: > Hello everyone, > > I'd like to call attention to an event the Compute group at Twitter is > holding at the end of the month where there will be a few > Aurora/Mesos-related talks: > >