Re: marathon issue in running a docker container.

2016-04-12 Thread nirmalendu swain
Changing the network type to HOST does not work. Its frequently changing the deployment status to Staged and then to no task. On Tuesday, 12 April 2016 11:49 AM, haosdent wrote: How about change the network type from BRIDGE to HOST? On Tue, Apr 12, 2016 at 2:13 PM, nirmalendu swain wr

Slaves not getting registered

2016-04-12 Thread aishwarya.adyanthaya
Hi, I’m unable to get the slave registered with the master node. I’ve configured both the masters and slave machines but am unable to get it registered. Thank you. This message is for the designated recipient only and may contain privileged, proprietary, or ot

Re: marathon issue in running a docker container.

2016-04-12 Thread haosdent
>Its frequently changing the deployment status to Staged Do you find any related log in mesos when marathon lauch the task? On Tue, Apr 12, 2016 at 3:55 PM, nirmalendu swain < nirmalendu_sw...@yahoo.com> wrote: > Changing the network type to HOST does not work. Its frequently changing > the depl

Re: Slaves not getting registered

2016-04-12 Thread haosdent
>but am unable to get it registered. Hi, @aishwarya Could you post master and slave log to provide more details? Usually it is because of network problem. On Tue, Apr 12, 2016 at 4:02 PM, wrote: > Hi, > > > > I’m unable to get the slave registered with the master node. I’ve > configured both the

Re: Slaves not getting registered

2016-04-12 Thread Abhishek Amralkar
What error you are getting when you are starting the mesos slaves? On 12-Apr-2016, at 1:32 PM, aishwarya.adyanth...@accenture.com wrote: Hi,   I’m unable to get the slave registered with the master node. I’ve configured both the masters and slave machines but am unable t

Re: marathon issue in running a docker container.

2016-04-12 Thread Abhishek Amralkar
anything is sandbox logs, why the tasks are getting killed? `stderr` and `stdout`? On 12-Apr-2016, at 1:35 PM, haosdent wrote: >Its frequently changing the deployment status to Staged Do you find any related log in mesos when marathon lauch the task? On Tue, Apr 12,

Re: marathon issue in running a docker container.

2016-04-12 Thread nirmalendu swain
When marathon launches the task while network : "BRIDGE", I can see from the mesos log that its listening to some host:port Even I can do an ssh to that instance, also saw MONGO_URL as env value over there. Pasting the log below --  --container="mesos-87849fd2-fda9-4d6a-870f-de101a5bdc59-S1.864e0

Re: marathon issue in running a docker container.

2016-04-12 Thread nirmalendu swain
>From the stderr log, nothing can be figure out. From stout log, it says server >running at host:port. But If I do telnet or curl, it does not show me any >reponse.output of stderr log : I0412 08:16:12.842341  9909 exec.cpp:134] Version: 0.27.1I0412 08:16:12.844701   9934 exec.cpp:208] Executor r

Re: marathon issue in running a docker container.

2016-04-12 Thread Rad Gruchalski
Do you have anything like haproxy for port mappings installed on your Mesos cluster? When using BRIDGE network, your process inside of the container, say SSH, is running on a standard port 22. Marathon allocates a random port in the default range of 31000 to 32000. However, it is your task to

RE: Slaves not getting registered

2016-04-12 Thread aishwarya.adyanthaya
Hi, There is no error while starting the slave machine. I have two masters and one slave configured and the slave isn’t getting registered in the GUI. Previously when I worked on single master and single slave, the GUI showed the activated status of slave. Thank you. From: Abhishek Amralkar

Re: SharedFilesystemIsolator (filesystem/shared)

2016-04-12 Thread Erb, Stephan
Hi Jie, I tried the new isolator and ran into an issue. Details are available in this ticket: https://issues.apache.org/jira/browse/MESOS-5187? Best Regards, Stephan From: Jie Yu Sent: Monday, April 11, 2016 17:37 To: user@mesos.apache.org Subject: Re: Shared

RE: Slaves not getting registered

2016-04-12 Thread aishwarya.adyanthaya
The GUI log shows like this: I0412 08:45:51.379609 3616 master.cpp:3673] Processing DECLINE call for offers: [ 74f33592-fc48-4066-a59c-977818b4c13c-O282 ] for framework 74f33592-fc48-4066-a59c-977818b4c13c-0001 (chronos-2.4.0) at scheduler-15022696-44ec-43d2-b193-a3cc4021d20e@30.30.30.48:42208

Re: Slaves not getting registered

2016-04-12 Thread Dick Davies
There's no mention of a slave there, have a look at the logs on the slaves filesystem and see if it is giving any errors. On 12 April 2016 at 10:17, wrote: > The GUI log shows like this: > > > > I0412 08:45:51.379609 3616 master.cpp:3673] Processing DECLINE call for > offers: [ 74f33592-fc48-40

Re: marathon issue in running a docker container.

2016-04-12 Thread nirmalendu swain
I might be wrong here. But I am using marathon-lb package of DCOS which already has haproxy. On Tuesday, 12 April 2016 2:14 PM, Rad Gruchalski wrote: Do you have anything like haproxy for port mappings installed on your Mesos cluster? When using BRIDGE network, your process inside o

Re: Mesos 0.28 SSL in official packages

2016-04-12 Thread Paul Bell
FWIW, I quite agree with Zameer's point. That said, I want to make abundantly clear that in my experience the folks at Mesosphere are wonderfully helpful. But what happens if down the road Mesosphere is acquired or there occurs some other event that could represent, if not a conflict of interest,

Re: Slaves not getting registered

2016-04-12 Thread June Taylor
Try looking in /var/log/mesos/ at these files: mesos-slave.WARNING, mesos-slave.INFO, mesos-slave.ERROR Thanks, June Taylor System Administrator, Minnesota Population Center University of Minnesota On Tue, Apr 12, 2016 at 4:36 AM, Dick Davies wrote: > There's no mention of a slave there, have

Re: Mesos 0.28 SSL in official packages

2016-04-12 Thread June Taylor
I heartily agree on both points. While I've found Mesosphere's documentation very helpful, it is often mixed up with the DCOS commercial offering. That may be something we're interested in down the road, but right now we are trying to stand up a relatively small cluster using straight Mesos/Maratho

Re: Mesos 0.28 SSL in official packages

2016-04-12 Thread Adam Bordelon
We've discussed Apache-built/distributed packages before, and nobody has any objections, but we need somebody to take on the work to get the package builds setup. I believe Vinod had some thoughts on how to get started, but any Apache committer (Zameer?) should have access to builds.apache.org I th

Re: Mesos 0.28 SSL in official packages

2016-04-12 Thread Steven Borrelli
I’d be willing to assist in the effort to have standard packages (and additional packages for modules like net-modules). Steven Borrelli st...@borrelli.org > On Apr 12, 2016, at 11:10 AM, Adam Bordelon wrote: > > We've discussed Apache-built/distributed packages before, and nobody > has any

Re: Mesos 0.28 SSL in official packages

2016-04-12 Thread Kapil Arya
At Mesosphere, we are planning to enable SSL into the nightlies starting sometime later this week. The goal is to have both SSL and non-SSL Mesos packages for Mesos 0.29.0 onwards in the Mesosphere deb/rpm repos. I will send out another email as soon as the stuff is ready for the community. Best,

Re: marathon issue in running a docker container.

2016-04-12 Thread haosdent
>If I do telnet or curl, it does not show me any reponse. Looks wired here, could you find the status of task is running or other status in mesos webui or marathon webui? And is it possible for you to use `docker ps` to find out the container and use `docker exec` to enter container and check whet

Re: marathon issue in running a docker container.

2016-04-12 Thread haosdent
>Server running at: http://0.0.0.0:7683 And according to your log, your service is running on 7683 while your configuration use 8080 in portMapping. On Wed, Apr 13, 2016 at 1:25 AM, haosdent wrote: > >If I do telnet or curl, it does not show me any reponse. > Looks wired here, could you find the

Re: Slaves not getting registered

2016-04-12 Thread haosdent
>Is there a way to find out the number of masters that are present in the environment together through CLI/GUI? There is a command line tool `mesos-resolve` which can show master info in terminal. But I think it may not related to your problem here. If your mesos doesn't depend on zookeeper, it sho

Re: Mesos 0.28 SSL in official packages

2016-04-12 Thread Zameer Manji
For the record, I am not a committer on the Apache Mesos project and I do not have the time to contribute packaging tools for the project. I think existing committers who are Mesosphere employees can kick start this effort by asking their employer to contribute the existing tools to the project. Th

Re: Mesos 0.28 SSL in official packages

2016-04-12 Thread Adam Bordelon
Mesosphere open-sourced the package-building scripts long ago: https://github.com/mesosphere/mesos-deb-packaging The TeamCity configuration, however, is internal-only, but that wouldn't work for Apache anyway, since Apache doesn't use TeamCity AFAIK. On Tue, Apr 12, 2016 at 11:07 AM, Zameer Manji

Re: mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-12 Thread Justin Ryan
Thanks to those who’ve taken the time to be helpful here, I dug into this a bit more today and verified some things: * When I run the scheduler on the active mesos master and tcpdump traffic to port 5050, I see none. * It does identify its’ IP as the running master at startup. * I do see z

Package release schedule for Mesosphere Mesos 0.29

2016-04-12 Thread June Taylor
What's the release schedule for Mesosphere's packaging of Mesos, currently at 0.28, and I'm specifically interested in when 0.29 would be expected? Is there a routine schedule for this? Thanks, June Taylor System Administrator, Minnesota Population Center University of Minnesota

Mesos Community Sync 4/7/16 - Notes

2016-04-12 Thread Greg Mann
Hey Mesos community, Find below the notes from our last Mesos Community Sync meeting, which occurred on 4/7/2016 at 9am PST. The next community sync will be on 4/21/2016 from 3-4pm PST. See the Mesos community calendar for upcoming meetings (note that you can a

Re: Package release schedule for Mesosphere Mesos 0.29

2016-04-12 Thread Vinod Kone
As discussed in the previous community sync (notes of which have just been emailed to the list), 0.29.0 is being targeted for mid-may. Mesosphere packages are automatically released (within hours) following the official release. On Tue, Apr 12, 2016 at 1:45 PM, June Taylor wrote: > What's the r

Re: mesos/kafka issues (org.apache.mesos.Scheduler)

2016-04-12 Thread Justin Ryan
Learn something new every day, I needed ‘-nvi any’ for tcpdump because it routes traffic to its’ own external ip over the local interface. So now at least I see the attempt to register, but I’m still not sure what I’m looking for. This at least confirms there’s no wierd netfilter rules, the pr

[RESULT][VOTE] Release Apache Mesos 0.24.2 (rc5)

2016-04-12 Thread Michael Park
Hi all, The vote for Mesos 0.24.2 (rc5) has passed with the following votes. +1 (Binding) -- * Benjamin Mahler * Vinod Kone * Kapil Arya +1 (Non-binding) -- N/A There were no 0 or -1 votes. Please find the release at: https://dist.apache.

[RESULT][VOTE] Release Apache Mesos 0.25.1 (rc4)

2016-04-12 Thread Michael Park
Hi all, The vote for Mesos 0.25.1 (rc4) has passed with the following votes. +1 (Binding) -- Benjamin Mahler Vinod Kone Kapil Arya +1 (Non-binding) -- N/A There were no 0 or -1 votes. Please find the release at: https://dist.apache.org/re

[RESULT][VOTE] Release Apache Mesos 0.26.1 (rc4)

2016-04-12 Thread Michael Park
Hi all, The vote for Mesos 0.26.1 (rc4) has passed with the following votes. +1 (Binding) -- Benjamin Mahler Vinod Kone Kapil Arya +1 (Non-binding) -- N/A There were no 0 or -1 votes. Please find the release at: https://dist.apache.org/re

Mesos interconnection among clusters project calico

2016-04-12 Thread Stefano Bianchi
Hi all i have to set up two mesos clusters. On each cluster i should integrate Project calico in order to distribute tasks among the agents. But these tasks should be sent also from a slave of one cluster to the slave of the other cluster. I know that when i start calico on each slaves, it registe

[Proposal] Remove the default value for agent work_dir

2016-04-12 Thread Greg Mann
Hey folks! A number of situations have arisen in which the default value of the Mesos agent `--work_dir` flag (/tmp/mesos) has caused problems on systems in which the automatic cleanup of '/tmp' deletes agent metadata. To resolve this, we would like to eliminate the default value of the agent `--wo

Re: SharedFilesystemIsolator (filesystem/shared)

2016-04-12 Thread Jie Yu
Stephan, Thanks for testing! I'll try to address that ticket and will make sure not removing filesystem/shared before we fix that issue. - Jie On Tue, Apr 12, 2016 at 2:16 AM, Erb, Stephan wrote: > Hi Jie, > > > I tried the new isolator and ran into an issue. Details are available in > this ti

Re: [Proposal] Remove the default value for agent work_dir

2016-04-12 Thread Zameer Manji
+1 I have seen this confuse many users of Apache Aurora many times. Eliminating the default will cause operators to select a location with the appropriate persistence properties. On Tue, Apr 12, 2016 at 3:58 PM, Greg Mann wrote: > Hey folks! > A number of situations have arisen in which the def

RE: Slaves not getting registered

2016-04-12 Thread aishwarya.adyanthaya
cation requests will be refused root@master1:/var/log/mesos# tail -f mesos-master.master1.invalid-user.log.INFO.20160412-11014 tail: cannot open ‘mesos-master.master1.invalid-user.log.INFO.20160412-11014’ for reading: No such file or directory root@master1:/var/log/mesos# tail -f mesos-master.m

Re: Slaves not getting registered

2016-04-12 Thread Abhishek Amralkar
Log line format: [IWEF]mmdd hh:mm:ss.uu threadid file:line] msg W0412 11:01:49.024226  3712 authenticator.cpp:511] No credentials provided, authentication requests will be refused   root@master1:/var/log/mesos# tail -f mesos-master.master1.invalid-user.log.INFO.20160412-11014 tail

RE: Slaves not getting registered

2016-04-12 Thread aishwarya.adyanthaya
ser.log.INFO.20160412-11014 tail: cannot open ‘mesos-master.master1.invalid-user.log.INFO.20160412-11014’ for reading: No such file or directory root@master1:/var/log/mesos# tail -f mesos-master.master1.invalid-user.log.INFO.20160412-11014 mesos-master.master1.invalid-user.log.INFO.20160412-110143.

Re: Slaves not getting registered

2016-04-12 Thread haosdent
>I0412 11:01:50.586612 3732 recover.cpp:578] Successfully joined the Paxos group According to this, master 1 should connect to zk successfully. >root@slave1:/var/log/mesos# tail -f mesos-slave.slave1.invalid-user.log.INFO.20160412-110554.1696 >I0413 03:12:54.532676 1711 grou

Re: [Proposal] Remove the default value for agent work_dir

2016-04-12 Thread James Peach
> On Apr 12, 2016, at 3:58 PM, Greg Mann wrote: > > Hey folks! > A number of situations have arisen in which the default value of the Mesos > agent `--work_dir` flag (/tmp/mesos) has caused problems on systems in which > the automatic cleanup of '/tmp' deletes agent metadata. To resolve this,

Re: [Proposal] Remove the default value for agent work_dir

2016-04-12 Thread Jie Yu
+1 On Tue, Apr 12, 2016 at 9:29 PM, James Peach wrote: > > > On Apr 12, 2016, at 3:58 PM, Greg Mann wrote: > > > > Hey folks! > > A number of situations have arisen in which the default value of the > Mesos agent `--work_dir` flag (/tmp/mesos) has caused problems on systems > in which the autom

Re: Mesos Community Sync 4/7/16 - Notes

2016-04-12 Thread Sam Chen
Hi Grep, I am located in China. I would love to participate in next meeting. Could you please share me how to join in? appreciated. Regards, Sam On Wed, Apr 13, 2016 at 5:07 AM, Greg Mann wrote: > Hey Mesos community, > Find below the notes from our last Mesos Community Sync meeting, which > oc

Re: [Proposal] Remove the default value for agent work_dir

2016-04-12 Thread Timothy Chen
+1 Tim On Wed, Apr 13, 2016 at 5:31 AM, Jie Yu wrote: > +1 > > On Tue, Apr 12, 2016 at 9:29 PM, James Peach wrote: > >> >> > On Apr 12, 2016, at 3:58 PM, Greg Mann wrote: >> > >> > Hey folks! >> > A number of situations have arisen in which the default value of the >> Mesos agent `--work_dir`

Re: [Proposal] Remove the default value for agent work_dir

2016-04-12 Thread Avinash Sridharan
+1 On Tue, Apr 12, 2016 at 9:31 PM, Jie Yu wrote: > +1 > > On Tue, Apr 12, 2016 at 9:29 PM, James Peach wrote: > > > > > > On Apr 12, 2016, at 3:58 PM, Greg Mann wrote: > > > > > > Hey folks! > > > A number of situations have arisen in which the default value of the > > Mesos agent `--work_dir

RE: Slaves not getting registered

2016-04-12 Thread aishwarya.adyanthaya
log/mesos# tail -f >mesos-slave.slave1.invalid-user.log.INFO.20160412-110554.1696 >I0413 03:12:54.532676 1711 group.cpp:519] ZooKeeper session expired >I0413 03:12:58.757953 1715 slave.cpp:4304] Current disk usage 6.44%. Max >allowed age: 5.848917453828577days >W0413 03:13:04.53

Re: Slaves not getting registered

2016-04-12 Thread Abhishek Amralkar
sos# tail -f mesos-slave.slave1.invalid-user.log.INFO.20160412-110554.1696 >I0413 03:12:54.532676  1711 group.cpp:519] ZooKeeper session expired >I0413 03:12:58.757953  1715 slave.cpp:4304] Current disk usage 6.44%. Max allowed age: 5.848917453828577days >W0413 03:13:04.539577  17

Hybrid application deployments (container/VM/bare metal) in Mesos

2016-04-12 Thread Xiaoning Ding
Hello, I'm wondering if someone here can help point me some document links about hybrid application deployment in Mesos. The basic idea is that we have some applications in mixed flavors (container, VM, bare metal) and we want to run them on a single cluster. Let me explain by an example. Say

Re: Mesos Community Sync 4/7/16 - Notes

2016-04-12 Thread haosdent
Hi, @Sam There is a community sync document. https://docs.google.com/document/d/153CUCj5LOJCFAVpdDZC7COJDwKh9RDjxaTA0S7lzwDA/edit You could get the hangout link and live video link in doucment when the community sync starting. The general schedule rule seems host every two weeks. One for 9am PST(2

Re: Slaves not getting registered

2016-04-12 Thread haosdent
t; *Cc:* Kumari, Suruchi > *Subject:* Re: Slaves not getting registered > > >I0412 11:01:50.586612 3732 recover.cpp:578] Successfully joined the > Paxos group > > According to this, master 1 should connect to zk successfully. > > >root@slave1:/var/log/mesos# tail -f &g

Re: [Proposal] Remove the default value for agent work_dir

2016-04-12 Thread tommy xiao
how about /opt/mesos/ +1 2016-04-13 12:44 GMT+08:00 Avinash Sridharan : > +1 > > On Tue, Apr 12, 2016 at 9:31 PM, Jie Yu wrote: > >> +1 >> >> On Tue, Apr 12, 2016 at 9:29 PM, James Peach wrote: >> >> > >> > > On Apr 12, 2016, at 3:58 PM, Greg Mann wrote: >> > > >> > > Hey folks! >> > > A numbe

Re: Hybrid application deployments (container/VM/bare metal) in Mesos

2016-04-12 Thread tommy xiao
mesos + marathon natively support your heterogeneous app. are you some concerns? 2016-04-13 13:57 GMT+08:00 Xiaoning Ding : > Hello, > > I'm wondering if someone here can help point me some document links about > hybrid application deployment in Mesos. The basic idea is that we have some > applic

Mesos-master url in HA

2016-04-12 Thread Guillermo Rodriguez
Hi, I have 3 mesos master setup for HA. One has the lead. http://172.31.35.90:5050/master/slaves <<< leader, returns slave list http://172.31.35.91:5050/master/slaves <<< follower, returns empty list http://172.31.35.92:5050/master/slaves <<< follower, returns empty list I have

Re: Hybrid application deployments (container/VM/bare metal) in Mesos

2016-04-12 Thread Guangya Liu
If you do not want to provision VM or PM on demand, then mesos plus marathon can help. There is also a JIRA talking about support Qemu/KVM in mesos https://issues.apache.org/jira/browse/MESOS-2717 On Wed, Apr 13, 2016 at 2:17 PM, tommy xiao wrote: > mesos + marathon natively support your hetero

Re: Mesos Masters Leader Keeps Fluctuating

2016-04-12 Thread haosdent
Do all the network between 3 Mesos masters and zookeeper are stable? Is it lost packets when you ping zookeeper servers in every Mesos master? And if possible, could you post the related Mesos masters logs as well? On Wed, Apr 13, 2016 at 2:26 PM, wrote: > Hi, > > > > I have set the quorum valu