Re: Tasks always lost

2014-07-02 Thread Vinod Kone
On Tue, Jul 1, 2014 at 9:12 PM, qingyang li liqingyang1...@gmail.com wrote: '20140702-113428-1694607552-5050-17766-' failed to start: Failed to fetch URIs for container 'af557235-2d5f-4062-aaf3-a747cb3cd0d1': exit status 32512 looks like the mesos slave is unable to fetch the executor

Re: Review Request 23214: PortMapping: allow containers to recover even when they were not managed by Network Isolator previously.

2014-07-02 Thread Ian Downes
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23214/#review47154 --- Did you consider: hashmapContainerID, OptionInfo* infos This is

Jenkins build is back to normal : mesos-reviewbot #1061

2014-07-02 Thread Apache Jenkins Server
See https://builds.apache.org/job/mesos-reviewbot/1061/

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Brian Wickman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23224/#review47213 --- This seems to be missing the setup.py(.in)s from mesos.api,

Re: Review Request 23220: Fixed and renamed AllocatorZooKeeper tests.

2014-07-02 Thread Jiang Yan Xu
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23220/#review47214 --- Ship it! Looks good. I assume that they have been run with many

Re: Review Request 23221: PortMappingMesosTest: added a test to ensure that all configuations are cleaned up for an orphan container

2014-07-02 Thread Mesos ReviewBot
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23221/#review47223 --- Bad patch! Reviews applied: [23214, 23221] Failed command: make

Re: Mesos never finds a master

2014-07-02 Thread Kevin Lyda
On Wed, Jul 2, 2014 at 6:45 PM, Kevin Lyda ke...@ie.suberic.net wrote: What should I be hunting down here? I tried clearing any previous state by removing /tmp/mesos, /var/lib/mesos/replicated_log and by removing the /mesos tree in zookeeper (delete /mesos/log_replicas and delete /mesos) The

Re: #MesosCon $99 early-bird registration ends Friday!

2014-07-02 Thread Dave Lester
Friendly reminder and email bump: Early-bird (US$99) registration for #MesosCon http://events.linuxfoundation.org/events/mesoscon ends this Friday, at which point it will raise to US$299. On Mon, Jun 30, 2014 at 11:12 AM, Dave Lester daveles...@gmail.com wrote: Hi All, Early-bird (US$99)

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Mesos ReviewBot
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23224/#review47228 --- Bad patch! Reviews applied: [23224] Failed command: ./bootstrap

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Thomas Rampelberg
On July 2, 2014, 6:17 p.m., Brian Wickman wrote: This seems to be missing the setup.py(.in)s from mesos.api, mesos.native, mesos.protocol. Forget to git add? Otherwise this is looking great! Argh, how'd I miss those. Thanks! - Thomas

Re: Mesos never finds a master

2014-07-02 Thread Jie Yu
Kevin, How many masters did you start? - Jie On Wed, Jul 2, 2014 at 12:35 PM, Kevin Lyda ke...@ie.suberic.net wrote: On Wed, Jul 2, 2014 at 6:45 PM, Kevin Lyda ke...@ie.suberic.net wrote: What should I be hunting down here? I tried clearing any previous state by removing /tmp/mesos,

Re: Mesos never finds a master

2014-07-02 Thread Jie Yu
Looks like there are only 2 masters: I0702 20:21:20.772124 14260 network.hpp:461] ZooKeeper group PIDs: { log-replica(1)@10.196.106.219:5050, log-replica(1)@10.196.106.221:5050 } The replicated log cannot be initialized unless all the masters are available. - Jie On Wed, Jul 2, 2014 at 1:42

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Thomas Rampelberg
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23224/ --- (Updated July 2, 2014, 8:46 p.m.) Review request for mesos. Bugs: MESOS-857

Re: Review Request 23220: Fixed and renamed AllocatorZooKeeper tests.

2014-07-02 Thread Mesos ReviewBot
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23220/#review47233 --- Patch looks great! Reviews applied: [23220] All tests passed. -

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Mesos ReviewBot
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23224/#review47235 --- Bad patch! Reviews applied: [23224] Failed command: make -j3

Re: Review Request 22123: Failover boolean to prevent using large timeout values

2014-07-02 Thread Adam B
On June 26, 2014, 1:39 a.m., Adam B wrote: include/mesos/mesos.proto, line 128 https://reviews.apache.org/r/22123/diff/1/?file=601126#file601126line128 Please add some documentation to the FrameworkInfo comment that explains what a value of failover=true means and when it should

Re: Review Request 23214: PortMapping: allow containers to recover even when they were not managed by Network Isolator previously.

2014-07-02 Thread Chi Zhang
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23214/ --- (Updated July 2, 2014, 10:55 p.m.) Review request for mesos, Ian Downes, Jie

Re: Review Request 23214: PortMapping: allow containers to recover even when they were not managed by Network Isolator previously.

2014-07-02 Thread Chi Zhang
On July 2, 2014, 5:43 p.m., Ian Downes wrote: Did you consider: hashmapContainerID, OptionInfo* infos This is explicit that we support containers that optionally have network isolation. Then we can disambiguate so e.g., calling watch() or usage() for a container that we know about

Re: Review Request 23214: PortMapping: allow containers to recover even when they were not managed by Network Isolator previously.

2014-07-02 Thread Chi Zhang
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23214/#review47211 --- src/tests/port_mapping_tests.cpp

Jenkins build is back to normal : Mesos-Trunk-Ubuntu-Build-Out-Of-Src-Set-JAVA_HOME #2226

2014-07-02 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mesos-Trunk-Ubuntu-Build-Out-Of-Src-Set-JAVA_HOME/2226/changes

Re: Review Request 23221: PortMappingMesosTest: added a test to ensure that all configuations are cleaned up for an orphan container

2014-07-02 Thread Chi Zhang
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23221/#review47242 --- this patch is now in https://reviews.apache.org/r/23214 - Chi

Re: Review Request 23250: Created an example LoadGeneratorScheduler to test Master's framework rate limiting feature.

2014-07-02 Thread Mesos ReviewBot
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23250/#review47252 --- Patch looks great! Reviews applied: [23250] All tests passed. -

Re: Review Request 23246: Used shared bind mount to fix MESOS-1558.

2014-07-02 Thread Ian Downes
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23246/#review47253 --- Ship it!

Re: Review Request 23246: Used shared bind mount to fix MESOS-1558.

2014-07-02 Thread Ian Downes
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23246/#review47254 --- src/slave/containerizer/isolators/network/port_mapping.cpp

Re: Review Request 23214: PortMapping: allow containers to recover even when they were not managed by Network Isolator previously.

2014-07-02 Thread Jie Yu
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23214/#review47246 --- src/slave/containerizer/isolators/network/port_mapping.cpp

Re: Review Request 23246: Used shared bind mount to fix MESOS-1558.

2014-07-02 Thread Jie Yu
On July 2, 2014, 11:22 p.m., Ian Downes wrote: src/slave/containerizer/isolators/network/port_mapping.cpp, line 1084 https://reviews.apache.org/r/23246/diff/2/?file=623104#file623104line1084 I don't follow why you need to bind mount the directory to itself? Can't you do this:

Re: Review Request 22796: Add timeout to rescind unused offers

2014-07-02 Thread Timothy Chen
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22796/ --- (Updated July 2, 2014, 11:37 p.m.) Review request for mesos, Adam B and Niklas

Re: Review Request 23216: Added the low level scheduler example using pthread.

2014-07-02 Thread Vinod Kone
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23216/#review47220 --- src/examples/low_level_scheduler_pthread.cpp

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Thomas Rampelberg
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23224/ --- (Updated July 2, 2014, 11:53 p.m.) Review request for mesos. Bugs: MESOS-857

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Brian Wickman
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23224/#review47259 --- thanks for doing this work. this will be a huge improvement going

Re: Review Request 21277: Passed CommandInfo to mesos-fetcher as JSON.

2014-07-02 Thread Dominic Hamon
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/21277/#review47261 --- Ship it! Ship It! - Dominic Hamon On May 9, 2014, 12:05 p.m.,

Re: Review Request 23246: Used shared bind mount to fix MESOS-1558.

2014-07-02 Thread Mesos ReviewBot
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23246/#review47262 --- Patch looks great! Reviews applied: [23246] All tests passed. -

Re: Review Request 23214: PortMapping: allow containers to recover even when they were not managed by Network Isolator previously.

2014-07-02 Thread Vinod Kone
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23214/#review47258 --- src/slave/containerizer/isolators/network/port_mapping.cpp

Re: Review Request 23246: Used shared bind mount to fix MESOS-1558.

2014-07-02 Thread Chi Zhang
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23246/#review47256 --- Throwing this out here: an even better goal is that within the

Re: Review Request 23246: Used shared bind mount to fix MESOS-1558.

2014-07-02 Thread Chi Zhang
On July 3, 2014, 12:18 a.m., Chi Zhang wrote: src/slave/containerizer/isolators/network/port_mapping.cpp, line 1071 https://reviews.apache.org/r/23246/diff/2/?file=623104#file623104line1071 is this sufficient to check BIND_MOUNT_ROOT is _self_ mounted? a little update: - only

Re: Review Request 22796: Add timeout to rescind unused offers

2014-07-02 Thread Mesos ReviewBot
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22796/#review47265 --- Bad patch! Reviews applied: [22796] Failed command:

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Thomas Rampelberg
On July 3, 2014, 12:10 a.m., Brian Wickman wrote: src/python/native/setup.py.in, line 29 https://reviews.apache.org/r/23224/diff/4/?file=623204#file623204line29 what's your philosophy on versioning here? should we always require deps==version or just deps=major,major+1

Re: Review Request 23250: Created an example LoadGeneratorScheduler to test Master's framework rate limiting feature.

2014-07-02 Thread Vinod Kone
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23250/#review47264 --- Ship it! src/examples/load_generator_framework.cpp

Re: Review Request 22796: Add timeout to rescind unused offers

2014-07-02 Thread Timothy Chen
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22796/ --- (Updated July 3, 2014, 12:42 a.m.) Review request for mesos, Adam B and Niklas

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Thomas Rampelberg
On July 3, 2014, 12:10 a.m., Brian Wickman wrote: src/Makefile.am, line 137 https://reviews.apache.org/r/23224/diff/1/?file=622309#file622309line137 i would love to see all protos contained here, including messages protos. this will allow for the development of a pure python

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Thomas Rampelberg
On July 3, 2014, 12:10 a.m., Brian Wickman wrote: src/python/protocol/setup.py.in, line 27 https://reviews.apache.org/r/23224/diff/4/?file=623207#file623207line27 same here, to be conservative, might want protobuf=2.5.0,3 Good call. On July 3, 2014, 12:10 a.m., Brian Wickman

Re: Review Request 23224: Refactored the python bindings into multiple modules.

2014-07-02 Thread Thomas Rampelberg
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/23224/ --- (Updated July 3, 2014, 12:46 a.m.) Review request for mesos. Bugs: MESOS-857

Re: Review Request 22796: Add timeout to rescind unused offers

2014-07-02 Thread Mesos ReviewBot
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22796/#review47273 --- Patch looks great! Reviews applied: [22796] All tests passed. -

Re: Review Request 22313: MESOS-886: Prevented slave from launching tasks before containerize's update completes.

2014-07-02 Thread Vinod Kone
On June 23, 2014, 6:50 p.m., Vinod Kone wrote: src/slave/slave.cpp, line 1185 https://reviews.apache.org/r/22313/diff/11/?file=612826#file612826line1185 Also, what about launching the tasks after updating resources in registerExecutor()? Yifan Gu wrote: Sounds good,

Re: Review Request 23216: Added the low level scheduler example using pthread.

2014-07-02 Thread Zuyu Zhang
On July 2, 2014, 11:38 p.m., Vinod Kone wrote: src/examples/low_level_scheduler_pthread.cpp, line 119 https://reviews.apache.org/r/23216/diff/1/?file=622210#file622210line119 do we need to protect this via mutex? afaict, all the callbacks (connected, detected and received) are

Re: Review Request 22313: MESOS-886: Prevented slave from launching tasks before containerize's update completes.

2014-07-02 Thread Yifan Gu
On June 23, 2014, 6:50 p.m., Vinod Kone wrote: src/slave/slave.cpp, line 1185 https://reviews.apache.org/r/22313/diff/11/?file=612826#file612826line1185 Also, what about launching the tasks after updating resources in registerExecutor()? Yifan Gu wrote: Sounds good,