[jira] [Assigned] (MESOS-3270) Fix error GMOCK_CONFIG_CMD in CMake

2015-08-15 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent reassigned MESOS-3270:
---

Assignee: haosdent

 Fix error GMOCK_CONFIG_CMD in CMake
 ---

 Key: MESOS-3270
 URL: https://issues.apache.org/jira/browse/MESOS-3270
 Project: Mesos
  Issue Type: Bug
Reporter: haosdent
Assignee: haosdent

 The GMOCK_CONFIG_CMD don't contains ${CMAKE_CXX_FLAGS}, this make gmock build 
 with GTEST_LANG_CXX11 failed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-2588) Create pre-create hook before a Docker container launches

2015-08-15 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-2588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698301#comment-14698301
 ] 

haosdent commented on MESOS-2588:
-

ping [~tnachen] Could you review the patch again? Thank you in advance.

 Create pre-create hook before a Docker container launches
 -

 Key: MESOS-2588
 URL: https://issues.apache.org/jira/browse/MESOS-2588
 Project: Mesos
  Issue Type: Bug
  Components: docker
Reporter: Timothy Chen
Assignee: haosdent

 To be able to support custom actions to be called before launching a docker 
 contianer, we should create a hook that can be extensible and allow 
 module/hooks to be performed before a docker container is launched.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MESOS-3270) Fix error GMOCK_CONFIG_CMD in CMake

2015-08-15 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent updated MESOS-3270:

Description: 
The GMOCK_CONFIG_CMD don't contains $\{CMAKE_CXX_FLAGS\}, this make gmock build 
with GTEST_LANG_CXX11 failed.

This bug is report by [~hausdorff] in https://reviews.apache.org/r/37370/ 

  was:The GMOCK_CONFIG_CMD don't contains \$\{CMAKE_CXX_FLAGS\}, this make 
gmock build with GTEST_LANG_CXX11 failed.


 Fix error GMOCK_CONFIG_CMD in CMake
 ---

 Key: MESOS-3270
 URL: https://issues.apache.org/jira/browse/MESOS-3270
 Project: Mesos
  Issue Type: Bug
Reporter: haosdent
Assignee: haosdent

 The GMOCK_CONFIG_CMD don't contains $\{CMAKE_CXX_FLAGS\}, this make gmock 
 build with GTEST_LANG_CXX11 failed.
 This bug is report by [~hausdorff] in https://reviews.apache.org/r/37370/ 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MESOS-3270) Fix error GMOCK_CONFIG_CMD in CMake

2015-08-15 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent updated MESOS-3270:

Description: The GMOCK_CONFIG_CMD don't contains \$\{CMAKE_CXX_FLAGS\}, 
this make gmock build with GTEST_LANG_CXX11 failed.  (was: The GMOCK_CONFIG_CMD 
don't contains {code}${CMAKE_CXX_FLAGS}{code}, this make gmock build with 
GTEST_LANG_CXX11 failed.)

 Fix error GMOCK_CONFIG_CMD in CMake
 ---

 Key: MESOS-3270
 URL: https://issues.apache.org/jira/browse/MESOS-3270
 Project: Mesos
  Issue Type: Bug
Reporter: haosdent
Assignee: haosdent

 The GMOCK_CONFIG_CMD don't contains \$\{CMAKE_CXX_FLAGS\}, this make gmock 
 build with GTEST_LANG_CXX11 failed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (MESOS-3270) Fix error GMOCK_CONFIG_CMD in CMake

2015-08-15 Thread haosdent (JIRA)
haosdent created MESOS-3270:
---

 Summary: Fix error GMOCK_CONFIG_CMD in CMake
 Key: MESOS-3270
 URL: https://issues.apache.org/jira/browse/MESOS-3270
 Project: Mesos
  Issue Type: Bug
Reporter: haosdent


The GMOCK_CONFIG_CMD don't contains ${CMAKE_CXX_FLAGS}, this make gmock build 
with GTEST_LANG_CXX11 failed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MESOS-3270) Fix error GMOCK_CONFIG_CMD in CMake

2015-08-15 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent updated MESOS-3270:

Description: The GMOCK_CONFIG_CMD don't contains 
{code}${CMAKE_CXX_FLAGS}{code}, this make gmock build with GTEST_LANG_CXX11 
failed.  (was: The GMOCK_CONFIG_CMD don't contains ${CMAKE_CXX_FLAGS}, this 
make gmock build with GTEST_LANG_CXX11 failed.)

 Fix error GMOCK_CONFIG_CMD in CMake
 ---

 Key: MESOS-3270
 URL: https://issues.apache.org/jira/browse/MESOS-3270
 Project: Mesos
  Issue Type: Bug
Reporter: haosdent
Assignee: haosdent

 The GMOCK_CONFIG_CMD don't contains {code}${CMAKE_CXX_FLAGS}{code}, this make 
 gmock build with GTEST_LANG_CXX11 failed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MESOS-3270) Fix error GMOCK_CONFIG_CMD in CMake

2015-08-15 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent updated MESOS-3270:

Description: The GMOCK_CONFIG_CMD don't contains ${CMAKE_CXX_FLAGS}, this 
make gmock build with GTEST_LANG_CXX11 failed.  (was: The GMOCK_CONFIG_CMD 
don't contains ${CMAKE_CXX_FLAGS}, this make gmock build with GTEST_LANG_CXX11 
failed.)

 Fix error GMOCK_CONFIG_CMD in CMake
 ---

 Key: MESOS-3270
 URL: https://issues.apache.org/jira/browse/MESOS-3270
 Project: Mesos
  Issue Type: Bug
Reporter: haosdent
Assignee: haosdent

 The GMOCK_CONFIG_CMD don't contains ${CMAKE_CXX_FLAGS}, this make gmock 
 build with GTEST_LANG_CXX11 failed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (MESOS-3247) Add missing unimplemented.hpp to windows specific OS code

2015-08-10 Thread haosdent (JIRA)
haosdent created MESOS-3247:
---

 Summary: Add missing unimplemented.hpp to windows specific OS code
 Key: MESOS-3247
 URL: https://issues.apache.org/jira/browse/MESOS-3247
 Project: Mesos
  Issue Type: Bug
Reporter: haosdent
Assignee: haosdent


Currently windows specific OS code missing #include 
stout/unimplemented.hpp. It let compile in windows failed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3218) Enhance the bootstrap script

2015-08-06 Thread gyliu (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14660100#comment-14660100
 ] 

gyliu commented on MESOS-3218:
--

The bootstrap is only for developers and it is not suggested to run ln -s xx 
xx manually by developer.

 Enhance the bootstrap script
 

 Key: MESOS-3218
 URL: https://issues.apache.org/jira/browse/MESOS-3218
 Project: Mesos
  Issue Type: Improvement
Reporter: Chen Zhiwei
Assignee: Chen Zhiwei
Priority: Trivial

 Running bootstrap command in Mesos root directory will create some link files.
 But it can't handle below case:
 1. Users accidently create an invalid link file
 $ ln -s .gitignore-templat .gitignore
 2. Then running bootstrap command will show an error message
 $ ./bootstrap 
 ln: failed to create symbolic link ‘.gitignore’: File exists
 The `test -e` command can't recognize invalid link file.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-2337) __init__.py not getting installed in $PREFIX/lib/pythonX.Y/site-packages/mesos

2015-08-04 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-2337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14653859#comment-14653859
 ] 

haosdent commented on MESOS-2337:
-

Refrence openstack, them have 
/usr/lib/python2.7/dist-packages/openstack/__init__.py under [common 
module|http://packages.ubuntu.com/zh-cn/precise/amd64/python-openstack-common/filelist],
 and other [sub 
module|http://packages.ubuntu.com/zh-cn/precise/amd64/python-openstack-compute/filelist]
 don't contains this.

 __init__.py not getting installed in $PREFIX/lib/pythonX.Y/site-packages/mesos
 --

 Key: MESOS-2337
 URL: https://issues.apache.org/jira/browse/MESOS-2337
 Project: Mesos
  Issue Type: Bug
  Components: build, python api
Reporter: Kapil Arya
Assignee: Marco Massenzio
Priority: Critical
  Labels: mesosphere

 When doing a {{make install}}, the src/python/native/src/mesos/__init__.py 
 file is not getting installed in 
 {{$PREFIX/lib/pythonX.Y/site-packages/mesos/}}.  
 This makes it impossible to do the following import when {{PYTHONPATH}} is 
 set to the {{site-packages}} directory.
 {code}
 import mesos.interface.mesos_pb2
 {code}
 The directories {{$PREFIX/lib/pythonX.Y/site-packages/mesos/interface, 
 native}} do have their corresponding {{__init__.py}} files.
 Reproducing the bug:
 {code}
 ../configure --prefix=$HOME/test-install  make install
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (MESOS-3141) Compiler warning when mocking function type has an enum return type.

2015-08-06 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14645670#comment-14645670
 ] 

haosdent edited comment on MESOS-3141 at 8/6/15 7:56 AM:
-

I also submit a simple patch to rbt, let it support binary file. 
https://github.com/reviewboard/rbtools/pull/50/files


was (Author: haosd...@gmail.com):
The pull request in Github. https://github.com/apache/mesos/pull/50/files 
Contains binary changes. [~mcypark]

 Compiler warning when mocking function type has an enum return type.
 

 Key: MESOS-3141
 URL: https://issues.apache.org/jira/browse/MESOS-3141
 Project: Mesos
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Michael Park
Assignee: haosdent
  Labels: mesosphere

 The purpose of this ticket is to document a very cryptic error message 
 (actually a warning that gets propagated by {{-Werror}}) that gets generated 
 by {{clang-3.5}} from {{gmock}} source code when trying to mock a perfectly 
 innocent-looking function.
 h3. Problem
 The following code is attempting to mock a {{MesosExecutorDriver}}:
 {code}
 class MockMesosExecutorDriver : public MesosExecutorDriver {
 public:
   MockMesosExecutorDriver(mesos::Executor* executor)
 : MesosExecutorDriver(executor) {}
   MOCK_METHOD1(sendStatusUpdate, Status(const TaskStatus));
 };
 {code}
 The above code generates the following error message:
 {noformat}
 In file included from 
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock.h:58:
 In file included from 
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:46:
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h:355:10:
  error: indirection of non-volatile null pointer will be deleted, not trap 
 [-Werror,-Wnull-dereference]
   return *static_casttypename remove_referenceT::type*(__null);
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:78:22:
  note: in instantiation of function template specialization 
 'testing::internal::Invalidmesos::Status' requested here
 return internal::InvalidT();
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:190:43:
  note: in instantiation of member function 
 'testing::internal::BuiltInDefaultValuemesos::Status::Get' requested here
 internal::BuiltInDefaultValueT::Get() : *value_;
   ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1435:34:
  note: in instantiation of member function 
 'testing::DefaultValuemesos::Status::Get' requested here
 return DefaultValueResult::Get();
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1334:22:
  note: in instantiation of member function 
 'testing::internal::FunctionMockerBasemesos::Status (const mesos::TaskStatus 
 )::PerformDefaultAction' requested here
 func_mocker-PerformDefaultAction(args, call_description));
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1448:26:
  note: in instantiation of function template specialization 
 'testing::internal::ActionResultHoldermesos::Status::PerformDefaultActionmesos::Status
  (const mesos::TaskStatus )' requested here
 return ResultHolder::PerformDefaultAction(this, args, call_description);
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-generated-function-mockers.h:81:7:
  note: in instantiation of member function 
 'testing::internal::FunctionMockerBasemesos::Status (const mesos::TaskStatus 
 )::UntypedPerformDefaultAction' requested here
 class FunctionMockerR(A1) : public
   ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h:355:10:
  note: consider using __builtin_trap() or qualifying pointer with 'volatile'
   return *static_casttypename remove_referenceT::type*(__null);
  ^
 {noformat}
 The source of the issue here is that {{Status}} is an {{enum}}. In 
 {{gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h}} you can find 
 the following function:
 {code}
 template typename T
 T Invalid() {
   return *static_casttypename remove_referenceT::type*(NULL);
 }
 {code}
 This function gets called with the return type of a mocked function. In our 
 case,  the return type of the mocked function is {{Status}}.
 Attempting to compile the following minimal example with {{clang-3.5}} 
 reproduces the error message:
 {code}
 #include type_traits
 template typename T
 T invalid() {
   return *static_casttypename std::remove_referenceT::type *(nullptr

[jira] [Commented] (MESOS-3126) Building mesos.jar, uses old javadoc plugin in maven.

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642756#comment-14642756
 ] 

haosdent commented on MESOS-3126:
-

You could try this docker image. 
https://registry.hub.docker.com/u/mesosphere/mesos/

 Building mesos.jar, uses old javadoc plugin in maven.
 -

 Key: MESOS-3126
 URL: https://issues.apache.org/jira/browse/MESOS-3126
 Project: Mesos
  Issue Type: Bug
  Components: build
Affects Versions: 0.23.0
 Environment: Centos 6.5. 
 Manually installed maven, version 3.3.3
 Manually  installing mesos.
Reporter: Hans van den Bogert
Priority: Minor

 When building Mesos, including the mesos.jar. It errors when generating the 
 javadoc:
 http://pastebin.com/TNRERS4p
 It seems the javadoc plugin is also trying to parse .class files, though the 
 source directories are correct in the pom file.
 Further investigation leads to think that the javadoc plugin version is to 
 blame. Setting a fixed version of 2.10.3 for the javadoc plugin, in the pom 
 file, successfully generates the javadoc.
 In this environment, the default javadoc plugin being downloaded, is 2.8.1



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3144) Update Homebrew formula for Mesos (Mac OSX)

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14643754#comment-14643754
 ] 

haosdent commented on MESOS-3144:
-

Yes, only need update sha256 in your pull request.

 Update Homebrew formula for Mesos (Mac OSX)
 ---

 Key: MESOS-3144
 URL: https://issues.apache.org/jira/browse/MESOS-3144
 Project: Mesos
  Issue Type: Task
Reporter: Marco Massenzio
Assignee: Marco Massenzio
Priority: Trivial
  Labels: mesosphere, newbie

 We have pushed a [pull 
 request|https://github.com/Homebrew/homebrew/pull/42099] to Homebrew for the 
 new 0.23 formula.
 Once accepted, we must verify that this works on a Mac OSX device.
 This would also be a great time to ensure our documentation is up-to-date.
 Currently, the Homebrew check fails, as they have deprecated SHA-1 checksums:
 {noformat}
 Error Message
 failed: brew audit mesos
 Stacktrace
 Error: 7 problems in 1 formula
 mesos:
  * Stable resource protobuf: SHA1 checksums are deprecated, please use 
 SHA256
  * Stable resource python-gflags: SHA1 checksums are deprecated, please use 
 SHA256
  * Stable resource six: SHA1 checksums are deprecated, please use SHA256
  * Stable resource google-apputils: SHA1 checksums are deprecated, please 
 use SHA256
  * Stable resource python-dateutil: SHA1 checksums are deprecated, please 
 use SHA256
  * Stable resource boto: SHA1 checksums are deprecated, please use SHA256
  * Stable resource pytz: SHA1 checksums are deprecated, please use SHA256
 {noformat}
 Don't know enough about Homebrew to really figure out what is going on here; 
 nor how to fix this.
 The Mesos SHA-256 has been correctly entered and computed via the [Online 
 SHA/MD5 calculator|https://md5file.com/calculator].
 I guess, we should go download the packages and compute their SHA-256 and/or 
 research from the respective download sites whether they publish the SHA.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3141) Compiler warning when mocking function type has an enum return type.

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14643768#comment-14643768
 ] 

haosdent commented on MESOS-3141:
-

[~mcypark] Seems rbt use git diff to generate patch, and git diff don't 
contains binary data change.

 Compiler warning when mocking function type has an enum return type.
 

 Key: MESOS-3141
 URL: https://issues.apache.org/jira/browse/MESOS-3141
 Project: Mesos
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Michael Park
Assignee: haosdent
  Labels: mesosphere

 The purpose of this ticket is to document a very cryptic error message 
 (actually a warning that gets propagated by {{-Werror}}) that gets generated 
 by {{clang-3.5}} from {{gmock}} source code when trying to mock a perfectly 
 innocent-looking function.
 h3. Problem
 The following code is attempting to mock a {{MesosExecutorDriver}}:
 {code}
 class MockMesosExecutorDriver : public MesosExecutorDriver {
 public:
   MockMesosExecutorDriver(mesos::Executor* executor)
 : MesosExecutorDriver(executor) {}
   MOCK_METHOD1(sendStatusUpdate, Status(const TaskStatus));
 };
 {code}
 The above code generates the following error message:
 {noformat}
 In file included from 
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock.h:58:
 In file included from 
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:46:
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h:355:10:
  error: indirection of non-volatile null pointer will be deleted, not trap 
 [-Werror,-Wnull-dereference]
   return *static_casttypename remove_referenceT::type*(__null);
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:78:22:
  note: in instantiation of function template specialization 
 'testing::internal::Invalidmesos::Status' requested here
 return internal::InvalidT();
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:190:43:
  note: in instantiation of member function 
 'testing::internal::BuiltInDefaultValuemesos::Status::Get' requested here
 internal::BuiltInDefaultValueT::Get() : *value_;
   ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1435:34:
  note: in instantiation of member function 
 'testing::DefaultValuemesos::Status::Get' requested here
 return DefaultValueResult::Get();
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1334:22:
  note: in instantiation of member function 
 'testing::internal::FunctionMockerBasemesos::Status (const mesos::TaskStatus 
 )::PerformDefaultAction' requested here
 func_mocker-PerformDefaultAction(args, call_description));
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1448:26:
  note: in instantiation of function template specialization 
 'testing::internal::ActionResultHoldermesos::Status::PerformDefaultActionmesos::Status
  (const mesos::TaskStatus )' requested here
 return ResultHolder::PerformDefaultAction(this, args, call_description);
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-generated-function-mockers.h:81:7:
  note: in instantiation of member function 
 'testing::internal::FunctionMockerBasemesos::Status (const mesos::TaskStatus 
 )::UntypedPerformDefaultAction' requested here
 class FunctionMockerR(A1) : public
   ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h:355:10:
  note: consider using __builtin_trap() or qualifying pointer with 'volatile'
   return *static_casttypename remove_referenceT::type*(__null);
  ^
 {noformat}
 The source of the issue here is that {{Status}} is an {{enum}}. In 
 {{gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h}} you can find 
 the following function:
 {code}
 template typename T
 T Invalid() {
   return *static_casttypename remove_referenceT::type*(NULL);
 }
 {code}
 This function gets called with the return type of a mocked function. In our 
 case,  the return type of the mocked function is {{Status}}.
 Attempting to compile the following minimal example with {{clang-3.5}} 
 reproduces the error message:
 {code}
 #include type_traits
 template typename T
 T invalid() {
   return *static_casttypename std::remove_referenceT::type *(nullptr);
 }
 enum E { A, B };
 int main() {
   invalidE();
 }
 {code}
 * See it online on [GCC Explorer|https://goo.gl/t1FepZ]
 Note that if the type is not an {{enum}}, the warning is not generated. This 
 is why

[jira] [Assigned] (MESOS-2960) Configure DiscoveryInfo and Visibility per port

2015-07-27 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent reassigned MESOS-2960:
---

Assignee: haosdent

 Configure DiscoveryInfo and Visibility per port
 ---

 Key: MESOS-2960
 URL: https://issues.apache.org/jira/browse/MESOS-2960
 Project: Mesos
  Issue Type: Improvement
  Components: general
Affects Versions: 0.22.1
Reporter: Frank Scholten
Assignee: haosdent
  Labels: mesosphere

 For Mesos Elasticsearch I like to use DiscoveryInfo to advertise the client 
 port (9200) with Visibility=EXTERNAL so it can be discovered by load 
 balancers while advertising the transport port (9300) as Visibility=FRAMEWORK 
 because it is used by nodes to talk too each other and should not be load 
 balanced.
 However, I can only set one DiscoveryInfo and one visibility, instead of one 
 per port. I suggest to allow multiple DiscoveryInfo's to be configured with 
 their own visibility.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3081) Default configuration of google logging

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642528#comment-14642528
 ] 

haosdent commented on MESOS-3081:
-

This ticket seems duplicated with 
https://issues.apache.org/jira/browse/MESOS-2153 and 
https://issues.apache.org/jira/browse/MESOS-2153

 Default configuration of google logging
 ---

 Key: MESOS-3081
 URL: https://issues.apache.org/jira/browse/MESOS-3081
 Project: Mesos
  Issue Type: Wish
Affects Versions: 0.22.1
Reporter: Rinaldo DiGiorgio
Priority: Minor

 The Google Logging api as configured requires that users look at many files 
 on many systems. When first using mesos log files are important.  The idea 
 that having the log output categorized for you is a good idea for production 
 systems but there are many ways and better tools for doing that like logstash 
 and elasticsearch.  It would be better if the default configuration was for 
 one logfile with standard conventions rather than many log files.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3057) Mesos web ui sorting by Id results in non-intuitive order.

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642550#comment-14642550
 ] 

haosdent commented on MESOS-3057:
-

I think this ticket is not a problem. The task id maybe defined by your use 
framework, it could be test_a, test_b, test_c in other frameworks. The 
better way to fix it is let your use framwork use a fixed bit suffix. For 
example, like test_1, test_2.

 Mesos web ui sorting by Id results in non-intuitive order.
 --

 Key: MESOS-3057
 URL: https://issues.apache.org/jira/browse/MESOS-3057
 Project: Mesos
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Cody Roseborough
Priority: Trivial
  Labels: newbie
 Attachments: web ui sorted by ids.png


 In the mesos webui sorting task by ID results in non-intuitive order. For 
 example with Id's task_0-task_200 sorted asc you get task_0, task_1, task_10, 
 task_100... task_109, task_11, task_110 etc. It happens if you use just 
 numbers as Id's also. 
 It seems like it should be sorted using natural sort order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3057) Mesos web ui sorting by Id results in non-intuitive order.

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642556#comment-14642556
 ] 

haosdent commented on MESOS-3057:
-

[~Croseborough] Could you give your framework name to me? Is it open source? 

 Mesos web ui sorting by Id results in non-intuitive order.
 --

 Key: MESOS-3057
 URL: https://issues.apache.org/jira/browse/MESOS-3057
 Project: Mesos
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Cody Roseborough
Priority: Trivial
  Labels: newbie
 Attachments: web ui sorted by ids.png


 In the mesos webui sorting task by ID results in non-intuitive order. For 
 example with Id's task_0-task_200 sorted asc you get task_0, task_1, task_10, 
 task_100... task_109, task_11, task_110 etc. It happens if you use just 
 numbers as Id's also. 
 It seems like it should be sorted using natural sort order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-527) Command Executors do not have Executor IDs in the master.

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14643813#comment-14643813
 ] 

haosdent commented on MESOS-527:


Sorry for misunderstand this ticket #_#. Let me update again.

 Command Executors do not have Executor IDs in the master.
 -

 Key: MESOS-527
 URL: https://issues.apache.org/jira/browse/MESOS-527
 Project: Mesos
  Issue Type: Bug
Reporter: Benjamin Mahler
Assignee: haosdent
  Labels: twitter

 The webui is broken for command executors because the master does not know 
 the executor ID for the tasks using a command executor. This is because the 
 Task protobuf only has the executor_id field, no other field to indicate the 
 presence of the command executor.
 It seems the slave also doesn't set the Task.executor_id for command 
 executors, thus relying on it being optionally set in executorTerminated() to 
 determine whether the task used a command executor.
 This all seems pretty messy, a few things to consider:
 1) Should we simply always set the Task.executor_id for these tasks? The 
 master could do so currently, but there would be an implicit contract that 
 the slave and master both use the task id as the executor id.
 2) We can add a boolean is_command_executor to Task, so that both the master 
 and slave can set the field, and the slave can use the boolean in 
 executorTerminated() to determine whether the task used a command executor.
 3) Alternatively, we can add a /frameworks/FID/tasks/TID url format for the 
 broken links on the master webui, so that we can search for the task in the 
 slave state to locate its executor.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (MESOS-3149) Use setuptools to install python cli package

2015-07-26 Thread haosdent (JIRA)
haosdent created MESOS-3149:
---

 Summary: Use setuptools to install python cli package
 Key: MESOS-3149
 URL: https://issues.apache.org/jira/browse/MESOS-3149
 Project: Mesos
  Issue Type: Task
Reporter: haosdent
Assignee: haosdent


mesos-ps/mesos-cat which depends on src/cli/python/mesos could not work in OSX 
because src/cli/python is not installed to sys.path. It's time to finish this 
TODO.
 
{code}
# Add 'src/cli/python' to PYTHONPATH.
# TODO(benh): Remove this if/when we install the 'mesos' module via
# PIP and setuptools.
PYTHONPATH=@abs_top_srcdir@/src/cli/python:${PYTHONPATH}
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3139) Incorporate CMake into standard documentation

2015-07-26 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642028#comment-14642028
 ] 

haosdent commented on MESOS-3139:
-

Got it. Looking forward CMake and running mesos in windows!

 Incorporate CMake into standard documentation
 -

 Key: MESOS-3139
 URL: https://issues.apache.org/jira/browse/MESOS-3139
 Project: Mesos
  Issue Type: Task
  Components: build
Reporter: Alex Clemmer
Assignee: Alex Clemmer
  Labels: build, cmake, mesosphere

 Right now it's anyone's guess how to build with CMake. If we want people to 
 use it, we should put up documentation. The central challenge is that the 
 CMake instructions will be slightly different for different platforms.
 For example, on Linux, the gist of the build is basically the same as 
 autotools; you pull down the system dependencies (like APR, _etc_.), and then:
 ```
 ./bootstrap
 mkdir build-cmake  cd build-cmake
 cmake ..
 make
 ```
 But, on Windows, it will be somewhat more complicated. There is no bootstrap 
 step, for example, because Windows doesn't have bash natively. And even when 
 we put that in, you'll still have to build the glog stuff out-of-band because 
 CMake has no way of booting up Visual Studio and calling build.
 So practically, we need to figure out:
 * What our build story is for different platforms
 * Write specific instructions for our core target platforms.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3111) Executor Process gets killed before executor_shutdown_grace_period

2015-07-26 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642202#comment-14642202
 ] 

haosdent commented on MESOS-3111:
-

{code}
class ShutdownProcess : public ProcessShutdownProcess
{
protected:
  virtual void initialize()
  {
VLOG(1)  Scheduling shutdown of the executor;
// TODO(benh): Pass the shutdown timeout with ExecutorRegistered
// since it might have gotten configured on the command line.
delay(slave::EXECUTOR_SHUTDOWN_GRACE_PERIOD, self(), Self::kill);
  }
{code}
{code}
  void shutdown()
  {
if (aborted) {
  VLOG(1)  Ignoring shutdown message because the driver is aborted!;
  return;
}

LOG(INFO)  Executor asked to shutdown;

if (!local) {
  // Start the Shutdown Process.
  spawn(new ShutdownProcess(), true);
}
{code}
And slave::EXECUTOR_SHUTDOWN_GRACE_PERIOD is default 5 seconds.

 Executor Process gets killed before executor_shutdown_grace_period
 --

 Key: MESOS-3111
 URL: https://issues.apache.org/jira/browse/MESOS-3111
 Project: Mesos
  Issue Type: Bug
  Components: framework
Affects Versions: 0.22.1
 Environment: OS X 10.10.3
Reporter: Rajesh Kumar

 I am starting mesos slave with following command:
 ./bin/mesos-slave.sh --master=zk://127.0.0.1:2181/global 
 --work_dir=/mnt1/logs/mesos/ --executor_registration_timeout=3mins 
 --executor_shutdown_grace_period=60secs
 As you see I have specified executor_shutdown_grace_period as 60 sec. When 
 framework is shutdown it is expected that executors process will be shutdown 
 gracefully. If executor takes longer than executor_shutdown_grace_period, it 
 will be killed by SIGKILL. But it seems its not happening as expected. 
 Looking at following slave log, request to shutdown comes around 14:55:25 and 
 executor is killed with SIGKILL at 14:55:30, just 5 secs later. I was 
 expecting it to wait till 60 secs.
 I0721 14:55:14.430140 178049024 slave.cpp:2164] Got registration for executor 
 'global' of framework global_2015-07-21_09-23-57_542 from 
 executor(1)@172.16.20.211:62623
 I0721 14:55:14.432839 178049024 slave.cpp:1555] Sending queued task 
 'Long_Running_Job 0' to executor 'global' of framework 
 global_2015-07-21_09-23-57_542
 I0721 14:55:23.275876 176975872 slave.cpp:3648] Current disk usage 36.83%. 
 Max allowed age: 3.721608945021898days
 I0721 14:55:25.493098 180195328 slave.cpp:1768] Asked to shut down framework 
 global_2015-07-21_09-23-57_542 by master@127.0.0.1:5050
 I0721 14:55:25.493131 180195328 slave.cpp:1793] Shutting down framework 
 global_2015-07-21_09-23-57_542
 I0721 14:55:25.493176 180195328 slave.cpp:3473] Shutting down executor 
 'global' of framework global_2015-07-21_09-23-57_542
 I0721 14:55:30.514751 179658752 containerizer.cpp:1123] Executor for 
 container 'de9f1d01-bbb6-4fbe-b41b-2a253498d6a1' has exited
 I0721 14:55:30.514798 179658752 containerizer.cpp:918] Destroying container 
 'de9f1d01-bbb6-4fbe-b41b-2a253498d6a1'
 I0721 14:55:30.538887 177512448 slave.cpp:3223] Executor 'global' of 
 framework global_2015-07-21_09-23-57_542 terminated with signal Killed: 9



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3141) Compiler warning when mocking function type has an enum return type.

2015-07-26 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642221#comment-14642221
 ] 

haosdent commented on MESOS-3141:
-

I update gmock to 1.7.0

 Compiler warning when mocking function type has an enum return type.
 

 Key: MESOS-3141
 URL: https://issues.apache.org/jira/browse/MESOS-3141
 Project: Mesos
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Michael Park
Assignee: haosdent
  Labels: mesosphere

 The purpose of this ticket is to document a very cryptic error message 
 (actually a warning that gets propagated by {{-Werror}}) that gets generated 
 by {{clang-3.5}} from {{gmock}} source code when trying to mock a perfectly 
 innocent-looking function.
 h3. Problem
 The following code is attempting to mock a {{MesosExecutorDriver}}:
 {code}
 class MockMesosExecutorDriver : public MesosExecutorDriver {
 public:
   MockMesosExecutorDriver(mesos::Executor* executor)
 : MesosExecutorDriver(executor) {}
   MOCK_METHOD1(sendStatusUpdate, Status(const TaskStatus));
 };
 {code}
 The above code generates the following error message:
 {noformat}
 In file included from 
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock.h:58:
 In file included from 
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:46:
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h:355:10:
  error: indirection of non-volatile null pointer will be deleted, not trap 
 [-Werror,-Wnull-dereference]
   return *static_casttypename remove_referenceT::type*(__null);
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:78:22:
  note: in instantiation of function template specialization 
 'testing::internal::Invalidmesos::Status' requested here
 return internal::InvalidT();
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:190:43:
  note: in instantiation of member function 
 'testing::internal::BuiltInDefaultValuemesos::Status::Get' requested here
 internal::BuiltInDefaultValueT::Get() : *value_;
   ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1435:34:
  note: in instantiation of member function 
 'testing::DefaultValuemesos::Status::Get' requested here
 return DefaultValueResult::Get();
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1334:22:
  note: in instantiation of member function 
 'testing::internal::FunctionMockerBasemesos::Status (const mesos::TaskStatus 
 )::PerformDefaultAction' requested here
 func_mocker-PerformDefaultAction(args, call_description));
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1448:26:
  note: in instantiation of function template specialization 
 'testing::internal::ActionResultHoldermesos::Status::PerformDefaultActionmesos::Status
  (const mesos::TaskStatus )' requested here
 return ResultHolder::PerformDefaultAction(this, args, call_description);
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-generated-function-mockers.h:81:7:
  note: in instantiation of member function 
 'testing::internal::FunctionMockerBasemesos::Status (const mesos::TaskStatus 
 )::UntypedPerformDefaultAction' requested here
 class FunctionMockerR(A1) : public
   ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h:355:10:
  note: consider using __builtin_trap() or qualifying pointer with 'volatile'
   return *static_casttypename remove_referenceT::type*(__null);
  ^
 {noformat}
 The source of the issue here is that {{Status}} is an {{enum}}. In 
 {{gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h}} you can find 
 the following function:
 {code}
 template typename T
 T Invalid() {
   return *static_casttypename remove_referenceT::type*(NULL);
 }
 {code}
 This function gets called with the return type of a mocked function. In our 
 case,  the return type of the mocked function is {{Status}}.
 Attempting to compile the following minimal example with {{clang-3.5}} 
 reproduces the error message:
 {code}
 #include type_traits
 template typename T
 T invalid() {
   return *static_casttypename std::remove_referenceT::type *(nullptr);
 }
 enum E { A, B };
 int main() {
   invalidE();
 }
 {code}
 * See it online on [GCC Explorer|https://goo.gl/t1FepZ]
 Note that if the type is not an {{enum}}, the warning is not generated. This 
 is why existing mocked functions that return non-{{enum}} types such as 
 {{Futurevoid

[jira] [Assigned] (MESOS-3111) Executor Process gets killed before executor_shutdown_grace_period

2015-07-26 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent reassigned MESOS-3111:
---

Assignee: haosdent

 Executor Process gets killed before executor_shutdown_grace_period
 --

 Key: MESOS-3111
 URL: https://issues.apache.org/jira/browse/MESOS-3111
 Project: Mesos
  Issue Type: Bug
  Components: framework
Affects Versions: 0.22.1
 Environment: OS X 10.10.3
Reporter: Rajesh Kumar
Assignee: haosdent

 I am starting mesos slave with following command:
 ./bin/mesos-slave.sh --master=zk://127.0.0.1:2181/global 
 --work_dir=/mnt1/logs/mesos/ --executor_registration_timeout=3mins 
 --executor_shutdown_grace_period=60secs
 As you see I have specified executor_shutdown_grace_period as 60 sec. When 
 framework is shutdown it is expected that executors process will be shutdown 
 gracefully. If executor takes longer than executor_shutdown_grace_period, it 
 will be killed by SIGKILL. But it seems its not happening as expected. 
 Looking at following slave log, request to shutdown comes around 14:55:25 and 
 executor is killed with SIGKILL at 14:55:30, just 5 secs later. I was 
 expecting it to wait till 60 secs.
 I0721 14:55:14.430140 178049024 slave.cpp:2164] Got registration for executor 
 'global' of framework global_2015-07-21_09-23-57_542 from 
 executor(1)@172.16.20.211:62623
 I0721 14:55:14.432839 178049024 slave.cpp:1555] Sending queued task 
 'Long_Running_Job 0' to executor 'global' of framework 
 global_2015-07-21_09-23-57_542
 I0721 14:55:23.275876 176975872 slave.cpp:3648] Current disk usage 36.83%. 
 Max allowed age: 3.721608945021898days
 I0721 14:55:25.493098 180195328 slave.cpp:1768] Asked to shut down framework 
 global_2015-07-21_09-23-57_542 by master@127.0.0.1:5050
 I0721 14:55:25.493131 180195328 slave.cpp:1793] Shutting down framework 
 global_2015-07-21_09-23-57_542
 I0721 14:55:25.493176 180195328 slave.cpp:3473] Shutting down executor 
 'global' of framework global_2015-07-21_09-23-57_542
 I0721 14:55:30.514751 179658752 containerizer.cpp:1123] Executor for 
 container 'de9f1d01-bbb6-4fbe-b41b-2a253498d6a1' has exited
 I0721 14:55:30.514798 179658752 containerizer.cpp:918] Destroying container 
 'de9f1d01-bbb6-4fbe-b41b-2a253498d6a1'
 I0721 14:55:30.538887 177512448 slave.cpp:3223] Executor 'global' of 
 framework global_2015-07-21_09-23-57_542 terminated with signal Killed: 9



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3141) Compiler warning when mocking function type has an enum return type.

2015-07-26 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642241#comment-14642241
 ] 

haosdent commented on MESOS-3141:
-

LoL. [~mcypark] Seems review board not support binary data. Should I submit 
through github pull request?

 Compiler warning when mocking function type has an enum return type.
 

 Key: MESOS-3141
 URL: https://issues.apache.org/jira/browse/MESOS-3141
 Project: Mesos
  Issue Type: Bug
Affects Versions: 0.23.0
Reporter: Michael Park
Assignee: haosdent
  Labels: mesosphere

 The purpose of this ticket is to document a very cryptic error message 
 (actually a warning that gets propagated by {{-Werror}}) that gets generated 
 by {{clang-3.5}} from {{gmock}} source code when trying to mock a perfectly 
 innocent-looking function.
 h3. Problem
 The following code is attempting to mock a {{MesosExecutorDriver}}:
 {code}
 class MockMesosExecutorDriver : public MesosExecutorDriver {
 public:
   MockMesosExecutorDriver(mesos::Executor* executor)
 : MesosExecutorDriver(executor) {}
   MOCK_METHOD1(sendStatusUpdate, Status(const TaskStatus));
 };
 {code}
 The above code generates the following error message:
 {noformat}
 In file included from 
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock.h:58:
 In file included from 
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:46:
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h:355:10:
  error: indirection of non-volatile null pointer will be deleted, not trap 
 [-Werror,-Wnull-dereference]
   return *static_casttypename remove_referenceT::type*(__null);
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:78:22:
  note: in instantiation of function template specialization 
 'testing::internal::Invalidmesos::Status' requested here
 return internal::InvalidT();
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-actions.h:190:43:
  note: in instantiation of member function 
 'testing::internal::BuiltInDefaultValuemesos::Status::Get' requested here
 internal::BuiltInDefaultValueT::Get() : *value_;
   ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1435:34:
  note: in instantiation of member function 
 'testing::DefaultValuemesos::Status::Get' requested here
 return DefaultValueResult::Get();
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1334:22:
  note: in instantiation of member function 
 'testing::internal::FunctionMockerBasemesos::Status (const mesos::TaskStatus 
 )::PerformDefaultAction' requested here
 func_mocker-PerformDefaultAction(args, call_description));
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-spec-builders.h:1448:26:
  note: in instantiation of function template specialization 
 'testing::internal::ActionResultHoldermesos::Status::PerformDefaultActionmesos::Status
  (const mesos::TaskStatus )' requested here
 return ResultHolder::PerformDefaultAction(this, args, call_description);
  ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/gmock-generated-function-mockers.h:81:7:
  note: in instantiation of member function 
 'testing::internal::FunctionMockerBasemesos::Status (const mesos::TaskStatus 
 )::UntypedPerformDefaultAction' requested here
 class FunctionMockerR(A1) : public
   ^
 ../3rdparty/libprocess/3rdparty/gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h:355:10:
  note: consider using __builtin_trap() or qualifying pointer with 'volatile'
   return *static_casttypename remove_referenceT::type*(__null);
  ^
 {noformat}
 The source of the issue here is that {{Status}} is an {{enum}}. In 
 {{gmock-1.6.0/include/gmock/internal/gmock-internal-utils.h}} you can find 
 the following function:
 {code}
 template typename T
 T Invalid() {
   return *static_casttypename remove_referenceT::type*(NULL);
 }
 {code}
 This function gets called with the return type of a mocked function. In our 
 case,  the return type of the mocked function is {{Status}}.
 Attempting to compile the following minimal example with {{clang-3.5}} 
 reproduces the error message:
 {code}
 #include type_traits
 template typename T
 T invalid() {
   return *static_casttypename std::remove_referenceT::type *(nullptr);
 }
 enum E { A, B };
 int main() {
   invalidE();
 }
 {code}
 * See it online on [GCC Explorer|https://goo.gl/t1FepZ]
 Note that if the type is not an {{enum}}, the warning is not generated. This 
 is why

[jira] [Commented] (MESOS-3091) Failed to build scheduler using C++ API: google/protobuf/stubs/common.h: No such file or directory

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642343#comment-14642343
 ] 

haosdent commented on MESOS-3091:
-

I think you need install protobuf 2.5.0 manually. Mesos should not install 3rd 
party headers because it maybe override exists headers.

 Failed to build scheduler using C++ API:  google/protobuf/stubs/common.h: No 
 such file or directory
 ---

 Key: MESOS-3091
 URL: https://issues.apache.org/jira/browse/MESOS-3091
 Project: Mesos
  Issue Type: Bug
  Components: c++ api
Affects Versions: 0.22.1
Reporter: Mohamed ElTahan

 When trying to build a scheduler against mesos, the following error is 
 encountered:
 In file included from /bb/bin/mesos/include/mesos/mesos.hpp:22:0,
  from /bb/bin/mesos/include/mesos/executor.hpp:26,
  from mesosbe_BasExecutor.cpp:23:
 /bb/bin/mesos/include/mesos/mesos.pb.h:9:42: fatal error: 
 google/protobuf/stubs/common.h: No such file or directory
  #include google/protobuf/stubs/common.h
 It seems that the problem is mainly mesos's install target only installs 
 mesos's headers and libs but not the headers and libs for the 3rd party 
 packages shipped with mesos



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3087) Typos in oversubscription doc

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642319#comment-14642319
 ] 

haosdent commented on MESOS-3087:
-

Patch: https://reviews.apache.org/r/36839/

 Typos in oversubscription doc
 -

 Key: MESOS-3087
 URL: https://issues.apache.org/jira/browse/MESOS-3087
 Project: Mesos
  Issue Type: Documentation
  Components: documentation
Affects Versions: 0.23.0
Reporter: Brian Candler
Priority: Trivial
  Labels: newbie

 * In docs/oversubscription.md: there are three cases where revocable is 
 written as recovable, including the name of a JSON field.
 {noformat}
 $ grep -niR recovable .
 ./docs/oversubscription.md:51:with revocable resources.  Further more, 
 recovable resources cannot be
 ./docs/oversubscription.md:95:Launching tasks using recovable resources is 
 done through the existing
 ./docs/oversubscription.md:96:`launchTasks` API. Revocable resources will 
 have the `recovable` field set. See
 {noformat}
 * Also in `docs/oversubscription.md`: the last sentence doesn't make sense
 {noformat}
 To select custom a resource estimator and QoS controller, please refer to the
 [modules documentation](modules.md).
 {noformat}
 Maybe should say To select a custom... or To install a custom...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (MESOS-3087) Typos in oversubscription doc

2015-07-27 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent reassigned MESOS-3087:
---

Assignee: haosdent

 Typos in oversubscription doc
 -

 Key: MESOS-3087
 URL: https://issues.apache.org/jira/browse/MESOS-3087
 Project: Mesos
  Issue Type: Documentation
  Components: documentation
Affects Versions: 0.23.0
Reporter: Brian Candler
Assignee: haosdent
Priority: Trivial
  Labels: newbie

 * In docs/oversubscription.md: there are three cases where revocable is 
 written as recovable, including the name of a JSON field.
 {noformat}
 $ grep -niR recovable .
 ./docs/oversubscription.md:51:with revocable resources.  Further more, 
 recovable resources cannot be
 ./docs/oversubscription.md:95:Launching tasks using recovable resources is 
 done through the existing
 ./docs/oversubscription.md:96:`launchTasks` API. Revocable resources will 
 have the `recovable` field set. See
 {noformat}
 * Also in `docs/oversubscription.md`: the last sentence doesn't make sense
 {noformat}
 To select custom a resource estimator and QoS controller, please refer to the
 [modules documentation](modules.md).
 {noformat}
 Maybe should say To select a custom... or To install a custom...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (MESOS-3091) Failed to build scheduler using C++ API: google/protobuf/stubs/common.h: No such file or directory

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642343#comment-14642343
 ] 

haosdent edited comment on MESOS-3091 at 7/27/15 6:39 AM:
--

I think you need install protobuf 2.5.0 manually. Mesos should not install 3rd 
party headers because it may override exists headers.


was (Author: haosd...@gmail.com):
I think you need install protobuf 2.5.0 manually. Mesos should not install 3rd 
party headers because it maybe override exists headers.

 Failed to build scheduler using C++ API:  google/protobuf/stubs/common.h: No 
 such file or directory
 ---

 Key: MESOS-3091
 URL: https://issues.apache.org/jira/browse/MESOS-3091
 Project: Mesos
  Issue Type: Bug
  Components: c++ api
Affects Versions: 0.22.1
Reporter: Mohamed ElTahan

 When trying to build a scheduler against mesos, the following error is 
 encountered:
 In file included from /bb/bin/mesos/include/mesos/mesos.hpp:22:0,
  from /bb/bin/mesos/include/mesos/executor.hpp:26,
  from mesosbe_BasExecutor.cpp:23:
 /bb/bin/mesos/include/mesos/mesos.pb.h:9:42: fatal error: 
 google/protobuf/stubs/common.h: No such file or directory
  #include google/protobuf/stubs/common.h
 It seems that the problem is mainly mesos's install target only installs 
 mesos's headers and libs but not the headers and libs for the 3rd party 
 packages shipped with mesos



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3126) Building mesos.jar, uses old javadoc plugin in maven.

2015-07-26 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642056#comment-14642056
 ] 

haosdent commented on MESOS-3126:
-

{code}
[ERROR] Java heap space - [Help 1]
[ERROR]
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR]
[ERROR] For more information about the errors and possible solutions, please 
read the following articles:
[ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/OutOfMemoryError
{code}

The problem for this is because no enough memory.

 Building mesos.jar, uses old javadoc plugin in maven.
 -

 Key: MESOS-3126
 URL: https://issues.apache.org/jira/browse/MESOS-3126
 Project: Mesos
  Issue Type: Bug
  Components: build
Affects Versions: 0.23.0
 Environment: Centos 6.5. 
 Manually installed maven, version 3.3.3
 Manually  installing mesos.
Reporter: Hans van den Bogert
Priority: Minor

 When building Mesos, including the mesos.jar. It errors when generating the 
 javadoc:
 http://pastebin.com/TNRERS4p
 It seems the javadoc plugin is also trying to parse .class files, though the 
 source directories are correct in the pom file.
 Further investigation leads to think that the javadoc plugin version is to 
 blame. Setting a fixed version of 2.10.3 for the javadoc plugin, in the pom 
 file, successfully generates the javadoc.
 In this environment, the default javadoc plugin being downloaded, is 2.8.1



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3126) Building mesos.jar, uses old javadoc plugin in maven.

2015-07-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14642377#comment-14642377
 ] 

haosdent commented on MESOS-3126:
-

[~hbogert] I could build it successfully. Is it because our environment 
different?
I use CenOS 6.5/maven 3.0.5/JDK 1.7.0_75

 Building mesos.jar, uses old javadoc plugin in maven.
 -

 Key: MESOS-3126
 URL: https://issues.apache.org/jira/browse/MESOS-3126
 Project: Mesos
  Issue Type: Bug
  Components: build
Affects Versions: 0.23.0
 Environment: Centos 6.5. 
 Manually installed maven, version 3.3.3
 Manually  installing mesos.
Reporter: Hans van den Bogert
Priority: Minor

 When building Mesos, including the mesos.jar. It errors when generating the 
 javadoc:
 http://pastebin.com/TNRERS4p
 It seems the javadoc plugin is also trying to parse .class files, though the 
 source directories are correct in the pom file.
 Further investigation leads to think that the javadoc plugin version is to 
 blame. Setting a fixed version of 2.10.3 for the javadoc plugin, in the pom 
 file, successfully generates the javadoc.
 In this environment, the default javadoc plugin being downloaded, is 2.8.1



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (MESOS-3104) Add an endpoint that exposes component flags.

2015-07-27 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent reassigned MESOS-3104:
---

Assignee: haosdent

 Add an endpoint that exposes component flags.
 -

 Key: MESOS-3104
 URL: https://issues.apache.org/jira/browse/MESOS-3104
 Project: Mesos
  Issue Type: Task
Reporter: David Robinson
Assignee: haosdent
Priority: Minor
  Labels: twitter

 Apparently there's an ongoing effort to break /state.json apart into separate 
 endpoints. As part of this effort it would be great if an endpoint was 
 created that only exposed the flags. Configuration management tools could use 
 the endpoint to determine whether the master/slave is correctly configured.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3139) Incorporate CMake into standard documentation

2015-07-24 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14640884#comment-14640884
 ] 

haosdent commented on MESOS-3139:
-

@hausdorff Got it thank you.

 Incorporate CMake into standard documentation
 -

 Key: MESOS-3139
 URL: https://issues.apache.org/jira/browse/MESOS-3139
 Project: Mesos
  Issue Type: Task
  Components: build
Reporter: Alex Clemmer
Assignee: Alex Clemmer
  Labels: build, cmake, mesosphere

 Right now it's anyone's guess how to build with CMake. If we want people to 
 use it, we should put up documentation. The central challenge is that the 
 CMake instructions will be slightly different for different platforms.
 For example, on Linux, the gist of the build is basically the same as 
 autotools; you pull down the system dependencies (like APR, _etc_.), and then:
 ```
 ./bootstrap
 mkdir build-cmake  cd build-cmake
 cmake ..
 make
 ```
 But, on Windows, it will be somewhat more complicated. There is no bootstrap 
 step, for example, because Windows doesn't have bash natively. And even when 
 we put that in, you'll still have to build the glog stuff out-of-band because 
 CMake has no way of booting up Visual Studio and calling build.
 So practically, we need to figure out:
 * What our build story is for different platforms
 * Write specific instructions for our core target platforms.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (MESOS-3098) Implement WindowsContainerizer and WindowsDockerContainerizer

2015-07-25 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14641595#comment-14641595
 ] 

haosdent edited comment on MESOS-3098 at 7/25/15 1:55 PM:
--

Is WindowsContainer a Linux actually? Or windows container is a windows and 
only could run windows program?


was (Author: haosd...@gmail.com):
Does WindowsContainer is a Linux actually? Or windows container is a windows 
and only could run windows program?

 Implement WindowsContainerizer and WindowsDockerContainerizer
 -

 Key: MESOS-3098
 URL: https://issues.apache.org/jira/browse/MESOS-3098
 Project: Mesos
  Issue Type: Task
  Components: containerization
Reporter: Joseph Wu
Assignee: Alex Clemmer
  Labels: mesosphere

 The MVP for Windows support is a containerizer that (1) runs on Windows, and 
 (2) runs and passes all the tests that are relevant to the Windows platform 
 (_e.g._, not the tests that involve cgroups). To do this we require at least 
 a `WindowsContainerizer` (to be implemented alongside the 
 `MesosContainerizer`), which provides no meaningful (_e.g._) process 
 namespacing (much like the default unix containerizer). In the long term 
 (hopefully before MesosCon) we want to support also the Windows container 
 API. This will require implementing a separate containerizer, maybe called 
 `WindowsDockerContainerizer`.
 Since the Windows container API is actually officially supported through the 
 Docker interface (_i.e._, MSFT actually ported the Docker engine to Windows, 
 and that is the official API), the interfaces (like the fetcher) shouldn't 
 change much. The tests probably will have to change, as we don't have access 
 to any isolation primitives like cgroups for those tests.
 Outstanding TODO([~hausdorff]): Flesh out this description when more details 
 are available, regarding:
 * The container API for Windows (when we know them)
 * The nuances of Windows vs Linux (when we know them)
 * etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-527) Command Executors do not have Executor IDs in the master.

2015-07-25 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14641651#comment-14641651
 ] 

haosdent commented on MESOS-527:


Patch: https://reviews.apache.org/r/36814/

I use executor in mesos-execute directly. [~bmahler] Does this way acceptable?

 Command Executors do not have Executor IDs in the master.
 -

 Key: MESOS-527
 URL: https://issues.apache.org/jira/browse/MESOS-527
 Project: Mesos
  Issue Type: Bug
Reporter: Benjamin Mahler
  Labels: twitter

 The webui is broken for command executors because the master does not know 
 the executor ID for the tasks using a command executor. This is because the 
 Task protobuf only has the executor_id field, no other field to indicate the 
 presence of the command executor.
 It seems the slave also doesn't set the Task.executor_id for command 
 executors, thus relying on it being optionally set in executorTerminated() to 
 determine whether the task used a command executor.
 This all seems pretty messy, a few things to consider:
 1) Should we simply always set the Task.executor_id for these tasks? The 
 master could do so currently, but there would be an implicit contract that 
 the slave and master both use the task id as the executor id.
 2) We can add a boolean is_command_executor to Task, so that both the master 
 and slave can set the field, and the slave can use the boolean in 
 executorTerminated() to determine whether the task used a command executor.
 3) Alternatively, we can add a /frameworks/FID/tasks/TID url format for the 
 broken links on the master webui, so that we can search for the task in the 
 slave state to locate its executor.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-2411) trailing slash in work_dir causes sandbox link issues

2015-07-25 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-2411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14641494#comment-14641494
 ] 

haosdent commented on MESOS-2411:
-

Could not reproduce this problem in 0.23. Please check again [~elmalto]

 trailing slash in work_dir causes sandbox link issues
 -

 Key: MESOS-2411
 URL: https://issues.apache.org/jira/browse/MESOS-2411
 Project: Mesos
  Issue Type: Bug
  Components: webui
Affects Versions: 0.20.1
Reporter: Malte Buecken
Assignee: haosdent
Priority: Trivial
   Original Estimate: 1h
  Remaining Estimate: 1h

 OS: Debian (wheezy-backports 7)
 When you define a work_dir and the work_dir has a trailing /, you cannot open 
 the sandbox in the webui any more because of two // in the url which produce 
 a angular.js parse error



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3040) Update submitting-a-patch.md

2015-07-13 Thread gyliu (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14625750#comment-14625750
 ] 

gyliu commented on MESOS-3040:
--

The step 7 in submitting a patch is:
Make your changes to the code (using whatever IDE/editor you choose) to 
actually fix the bug or implement the feature.

Before beginning, please read the Mesos C++ Style Guide. It is recommended 
to use the git pre-commit hook (support/hooks/pre-commit) to automatically 
check for style errors. See the hook script for instructions to enable it.
Most of your changes will probably be to files inside of BASE_MESOS_DIR
To build, we recommend that you don’t build inside of the src directory. We 
recommend you do the following:
From inside of the root Mesos directory: mkdir build  cd build
../configure
make
Now all of the files generated by the build process will be contained 
in the build directory you created, instead of being spread throughout the src 
directory, which is a bit messier. This is both cleaner, and makes it easy to 
clean up if you want to get rid of the files generated by configure and make. 
I.e. You can reset your build process without risking changes you made in the 
src directory, by simply deleting the build directory, and creating a new one.

Make sure all of your test cases now pass.

The build instructions in getting start is:

# Change working directory.
$ cd mesos

# Bootstrap (Only required if building from git repository).
$ ./bootstrap

# Configure and build.
$ mkdir build
$ cd build
$ ../configure
$ make

IMHO, the getting started can give some command to enable one can get a quick 
start, but the submitting a patch should add more detail for what the command 
does and why we want to do that.


 Update submitting-a-patch.md
 

 Key: MESOS-3040
 URL: https://issues.apache.org/jira/browse/MESOS-3040
 Project: Mesos
  Issue Type: Documentation
  Components: documentation
Affects Versions: 1.0.0
Reporter: gyliu
Assignee: gyliu
 Fix For: 1.0.0


 When developer is using git to clone the mesos source code and want to build, 
 s/he need to run ./bootstrap first to generate some files such as configure, 
 .gitignore etc. But the document do not including such instructions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (MESOS-3040) Update submitting-a-patch.md

2015-07-13 Thread gyliu (JIRA)
gyliu created MESOS-3040:


 Summary: Update submitting-a-patch.md
 Key: MESOS-3040
 URL: https://issues.apache.org/jira/browse/MESOS-3040
 Project: Mesos
  Issue Type: Documentation
  Components: documentation
Affects Versions: 1.0.0
Reporter: gyliu
 Fix For: 1.0.0


When developer is using git to clone the mesos source code and want to build, 
s/he need to run ./bootstrap first to generate some files such as configure, 
.gitignore etc. But the document do not including such instructions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MESOS-3059) Allow http endpoint to dynamically change the slave attributes

2015-07-15 Thread Nitin (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nitin updated MESOS-3059:
-
Description: 
This is well understood that - changing the attributes dynamically is not safe 
without a restart because slave itself may not know which old framework tasks 
are running on it that were dependent on previous attributes. 

However, total restart makes lot of other history to delete. We need to ensure 
a dynamic attribute changes with a soft restart. 

It will be good to expose a rest endpoint either at slave or mesos-master which 
directly changes the state in zookeeper.

USE-CASE
We use slave attributes/roles to direct the framework scheduling to use 
specific slave as per it's requirements. Mesos scheduler only creates the offer 
on the basis of some resources.
In our use case, we have some categorization of our spark frameworks or jobs 
with framework(like marathon) based on multiple factors. We want job or 
frameworks belonging to one category be running into their specific cluster of 
resources. We want to dynamically manage the slaves into these logical 
sub-clusters.

Since number of jobs that will be submitted or when it will be submitted is 
very dynamic, it make sense to be able to dynamically assign roles or 
attributes to slaves. It is not possible to gauge the requirements at time of 
cluster provisioning. Static role or attribute assignment leads to sub-optimal 
use of the cluster.


  was:
This is well understood that - changing the attributes dynamically is not safe 
without a restart because slave itself may not know which old framework tasks 
are running on it that were dependent on previous attributes. 

However, total restart makes lot of other history to delete. We need to ensure 
a dynamic attribute changes with a soft restart. 

It will be good to expose a rest endpoint either at slave or mesos-master which 
directly changes the state in zookeeper.

USE-CASE
We use slave attributes/roles to direct the framework scheduling to use 
specific slave as per it's requirements. Mesos scheduler only creates the offer 
on the basis of some resources.
In our use case, we have some categorization of our spark frameworks or jobs 
with framework(like marathon) based on multiple factors. We want job or 
frameworks belonging to one category be running into their specific cluster of 
resources. We want to dynamically manage the slaves into these logical 
sub-clusters.



 Allow http endpoint to dynamically change the slave attributes
 --

 Key: MESOS-3059
 URL: https://issues.apache.org/jira/browse/MESOS-3059
 Project: Mesos
  Issue Type: Wish
Reporter: Nitin

 This is well understood that - changing the attributes dynamically is not 
 safe without a restart because slave itself may not know which old framework 
 tasks are running on it that were dependent on previous attributes. 
 However, total restart makes lot of other history to delete. We need to 
 ensure a dynamic attribute changes with a soft restart. 
 It will be good to expose a rest endpoint either at slave or mesos-master 
 which directly changes the state in zookeeper.
 USE-CASE
 We use slave attributes/roles to direct the framework scheduling to use 
 specific slave as per it's requirements. Mesos scheduler only creates the 
 offer on the basis of some resources.
 In our use case, we have some categorization of our spark frameworks or jobs 
 with framework(like marathon) based on multiple factors. We want job or 
 frameworks belonging to one category be running into their specific cluster 
 of resources. We want to dynamically manage the slaves into these logical 
 sub-clusters.
 Since number of jobs that will be submitted or when it will be submitted is 
 very dynamic, it make sense to be able to dynamically assign roles or 
 attributes to slaves. It is not possible to gauge the requirements at time of 
 cluster provisioning. Static role or attribute assignment leads to 
 sub-optimal use of the cluster.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3023) Factoring out the pattern for URL generation

2015-07-17 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630891#comment-14630891
 ] 

haosdent commented on MESOS-3023:
-

Hi, [~klaus1982] Thank you for your patch, I also write some new review to it. 
Maybe you need find a committer to shepherd this. Because a shepherd committer 
could give you more advice and help you to commit it. :-)
And IMHO, if this pattern only use in this one place, do we still need 
factoring out the pattern? Or waiting some other places also need use this 
pattern, and generic it at that time?

 Factoring out the pattern for URL generation 
 -

 Key: MESOS-3023
 URL: https://issues.apache.org/jira/browse/MESOS-3023
 Project: Mesos
  Issue Type: Task
Reporter: Artem Harutyunyan
Assignee: Klaus Ma
Priority: Minor
  Labels: beginner, mesosphere, newbie

 fetcher_test.cpp uses the following code for generating URLs:
 string url = http://; + net::getHostname(process.self().address.ip).get() + 
 : + stringify(process.self().address.port) + / + process.self().id
 it would be good to isolate that code in a function, and replace the code 
 above with something like:
 string url = http://; + endpoint_url(process, uri_test);



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (MESOS-3091) Failed to build scheduler using C++ API: google/protobuf/stubs/common.h: No such file or directory

2015-07-20 Thread Mohamed (JIRA)
Mohamed created MESOS-3091:
--

 Summary: Failed to build scheduler using C++ API:  
google/protobuf/stubs/common.h: No such file or directory
 Key: MESOS-3091
 URL: https://issues.apache.org/jira/browse/MESOS-3091
 Project: Mesos
  Issue Type: Bug
  Components: c++ api
Affects Versions: 0.22.1
Reporter: Mohamed


When trying to build a scheduler against mesos, the following error is 
encountered:

In file included from /bb/bin/mesos/include/mesos/mesos.hpp:22:0,
 from /bb/bin/mesos/include/mesos/executor.hpp:26,
 from mesosbe_BasExecutor.cpp:23:
/bb/bin/mesos/include/mesos/mesos.pb.h:9:42: fatal error: 
google/protobuf/stubs/common.h: No such file or directory
 #include google/protobuf/stubs/common.h

It seems that the problem is mainly mesos's install target only installs 
mesos's headers and libs but not the headers and libs for the 3rd party 
packages shipped with mesos



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3706) Tasks stuck in staging.

2015-10-21 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14966818#comment-14966818
 ] 

haosdent commented on MESOS-3706:
-

As you stay, stdout and stderr are empty. Could you try use strace or gdb to 
attach the mesos-docker-executor hangout which point? For example, in the 
example you describe in description, try use
{noformat}
strace -p 35360
{noformat}

to find mesos-docker-executor blocks on which syscall and then post result here.

> Tasks stuck in staging.
> ---
>
> Key: MESOS-3706
> URL: https://issues.apache.org/jira/browse/MESOS-3706
> Project: Mesos
>  Issue Type: Bug
>  Components: docker, slave
>Affects Versions: 0.23.0, 0.24.1
>Reporter: Jord Sonneveld
> Attachments: Screen Shot 2015-10-12 at 9.08.30 AM.png, Screen Shot 
> 2015-10-12 at 9.24.32 AM.png, docker.txt, mesos-slave.INFO, 
> mesos-slave.INFO.2, mesos-slave.INFO.3, stderr, stdout
>
>
> I have a docker image which starts fine on all my slaves except for one.  On 
> that one, it is stuck in STAGING for a long time and never starts.  The INFO 
> log is full of messages like this:
> I1012 16:02:09.210306 34905 slave.cpp:1768] Asked to kill task 
> kwe-vinland-work.6c939697-70f8-11e5-845c-0242e054dd72 of framework 
> 20150109-172016-504433162-5050-19367-0002
> E1012 16:02:09.211272 34907 socket.hpp:174] Shutdown failed on fd=12: 
> Transport endpoint is not connected [107]
> kwe-vinland-work is the task that is stuck in staging.  It is launched by 
> marathon.  I have launched 161 instances successfully on my cluster.  But it 
> refuses to launch on this specific slave.
> These machines are all managed via ansible so their configurations are / 
> should be identical.  I have re-run my ansible scripts and rebooted the 
> machines to no avail.
> It's been in this state for almost 30 minutes.  You can see the mesos docker 
> executor is still running:
> jord@dalstgmesos03:~$ date
> Mon Oct 12 16:13:55 UTC 2015
> jord@dalstgmesos03:~$ ps auwx | grep kwe-vinland
> root 35360  0.0  0.0 1070576 21476 ?   Ssl  15:46   0:00 
> mesos-docker-executor 
> --container=mesos-20151012-082619-4145023498-5050-22623-S0.0695c9e0-0adf-4dfb-bc2a-6060245dcabe
>  --docker=docker --help=false --mapped_directory=/mnt/mesos/sandbox 
> --sandbox_directory=/data/mesos/mesos/work/slaves/20151012-082619-4145023498-5050-22623-S0/frameworks/20150109-172016-504433162-5050-19367-0002/executors/kwe-vinland-work.6c939697-70f8-11e5-845c-0242e054dd72/runs/0695c9e0-0adf-4dfb-bc2a-6060245dcabe
>  --stop_timeout=0ns
> According to docker ps -a, nothing was ever even launched:
> jord@dalstgmesos03:/data/mesos$ sudo docker ps -a
> CONTAINER IDIMAGE  
> COMMAND  CREATED STATUS  PORTS
> NAMES
> 5c858b90b0a0registry.roger.dal.moz.com:5000/moz-statsd-v0.22   
> "/bin/sh -c ./start.s"   39 minutes ago  Up 39 minutes   
> 0.0.0.0:9125->8125/udp, 0.0.0.0:9126->8126/tcp   statsd-fe-influxdb
> d765ba3829fdregistry.roger.dal.moz.com:5000/moz-statsd-v0.22   
> "/bin/sh -c ./start.s"   41 minutes ago  Up 41 minutes   
> 0.0.0.0:8125->8125/udp, 0.0.0.0:8126->8126/tcp   statsd-repeater
> Those are the only two entries. Nothing about the kwe-vinland job.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-1582) Improve build time.

2015-10-24 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14972825#comment-14972825
 ] 

haosdent commented on MESOS-1582:
-

Add the missing libprocess build time for every object: 
http://fpaste.org/283318/

> Improve build time.
> ---
>
> Key: MESOS-1582
> URL: https://issues.apache.org/jira/browse/MESOS-1582
> Project: Mesos
>  Issue Type: Epic
>  Components: build
>Reporter: Benjamin Hindman
>
> The build takes a ridiculously long time unless you have a large, parallel 
> machine. This is a combination of many factors, all of which we'd like to 
> discuss and track here.
> I'd also love to actually track build times so we can get an appreciation of 
> the improvements. Please leave a comment below with your build times!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3793) Cannot start mesos local on a Debian GNU/Linux 8 docker machine

2015-10-23 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14971183#comment-14971183
 ] 

haosdent commented on MESOS-3793:
-

Please add --launcher=posix or mount cgroup as rw when launch docker container. 
http://search-hadoop.com/m/0Vlr6zfCev1S7gRF1

> Cannot start mesos local on a Debian GNU/Linux 8 docker machine
> ---
>
> Key: MESOS-3793
> URL: https://issues.apache.org/jira/browse/MESOS-3793
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.25.0
> Environment: Debian GNU/Linux 8 docker machine
>Reporter: Matthias Veit
>Assignee: Jojy Varghese
>  Labels: mesosphere
>
> We updated the mesos version to 0.25.0 in our Marathon docker image, that 
> runs our integration tests.
> We use mesos local for those tests. This fails with this message:
> {noformat}
> root@a06e4b4eb776:/marathon# mesos local
> I1022 18:42:26.852485   136 leveldb.cpp:176] Opened db in 6.103258ms
> I1022 18:42:26.853302   136 leveldb.cpp:183] Compacted db in 765740ns
> I1022 18:42:26.853343   136 leveldb.cpp:198] Created db iterator in 9001ns
> I1022 18:42:26.853355   136 leveldb.cpp:204] Seeked to beginning of db in 
> 1287ns
> I1022 18:42:26.853366   136 leveldb.cpp:273] Iterated through 0 keys in the 
> db in ns
> I1022 18:42:26.853406   136 replica.cpp:744] Replica recovered with log 
> positions 0 -> 0 with 1 holes and 0 unlearned
> I1022 18:42:26.853775   141 recover.cpp:449] Starting replica recovery
> I1022 18:42:26.853862   141 recover.cpp:475] Replica is in EMPTY status
> I1022 18:42:26.854751   138 replica.cpp:641] Replica in EMPTY status received 
> a broadcasted recover request
> I1022 18:42:26.854856   140 recover.cpp:195] Received a recover response from 
> a replica in EMPTY status
> I1022 18:42:26.855002   140 recover.cpp:566] Updating replica status to 
> STARTING
> I1022 18:42:26.855655   138 master.cpp:376] Master 
> a3f39818-1bda-4710-b96b-2a60ed4d12b8 (a06e4b4eb776) started on 
> 172.17.0.14:5050
> I1022 18:42:26.855680   138 master.cpp:378] Flags at startup: 
> --allocation_interval="1secs" --allocator="HierarchicalDRF" 
> --authenticate="false" --authenticate_slaves="false" 
> --authenticators="crammd5" --authorizers="local" --framework_sorter="drf" 
> --help="false" --hostname_lookup="true" --initialize_driver_logging="true" 
> --log_auto_initialize="true" --logbufsecs="0" --logging_level="INFO" 
> --max_slave_ping_timeouts="5" --quiet="false" 
> --recovery_slave_removal_limit="100%" --registry="replicated_log" 
> --registry_fetch_timeout="1mins" --registry_store_timeout="5secs" 
> --registry_strict="false" --root_submissions="true" 
> --slave_ping_timeout="15secs" --slave_reregister_timeout="10mins" 
> --user_sorter="drf" --version="false" --webui_dir="/usr/share/mesos/webui" 
> --work_dir="/tmp/mesos/local/AK0XpG" --zk_session_timeout="10secs"
> I1022 18:42:26.855790   138 master.cpp:425] Master allowing unauthenticated 
> frameworks to register
> I1022 18:42:26.855803   138 master.cpp:430] Master allowing unauthenticated 
> slaves to register
> I1022 18:42:26.855815   138 master.cpp:467] Using default 'crammd5' 
> authenticator
> W1022 18:42:26.855829   138 authenticator.cpp:505] No credentials provided, 
> authentication requests will be refused
> I1022 18:42:26.855840   138 authenticator.cpp:512] Initializing server SASL
> I1022 18:42:26.856442   136 containerizer.cpp:143] Using isolation: 
> posix/cpu,posix/mem,filesystem/posix
> I1022 18:42:26.856943   140 leveldb.cpp:306] Persisting metadata (8 bytes) to 
> leveldb took 1.888185ms
> I1022 18:42:26.856987   140 replica.cpp:323] Persisted replica status to 
> STARTING
> I1022 18:42:26.857115   140 recover.cpp:475] Replica is in STARTING status
> I1022 18:42:26.857270   140 replica.cpp:641] Replica in STARTING status 
> received a broadcasted recover request
> I1022 18:42:26.857312   140 recover.cpp:195] Received a recover response from 
> a replica in STARTING status
> I1022 18:42:26.857368   140 recover.cpp:566] Updating replica status to VOTING
> I1022 18:42:26.857781   140 leveldb.cpp:306] Persisting metadata (8 bytes) to 
> leveldb took 371121ns
> I1022 18:42:26.857841   140 replica.cpp:323] Persisted replica status to 
> VOTING
> I1022 18:42:26.857895   140 recover.cpp:580] S

[jira] [Commented] (MESOS-1582) Improve build time.

2015-10-25 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973107#comment-14973107
 ] 

haosdent commented on MESOS-1582:
-

I found most time are spent in #include files. For example, #include 
"tests/mesos.hpp" need 3 seconds,  #include "master/master.hpp" also nearly 3 
seconds. I would write a script to generate a list contains the include time 
for every header file.

> Improve build time.
> ---
>
> Key: MESOS-1582
> URL: https://issues.apache.org/jira/browse/MESOS-1582
> Project: Mesos
>  Issue Type: Epic
>  Components: build
>Reporter: Benjamin Hindman
>
> The build takes a ridiculously long time unless you have a large, parallel 
> machine. This is a combination of many factors, all of which we'd like to 
> discuss and track here.
> I'd also love to actually track build times so we can get an appreciation of 
> the improvements. Please leave a comment below with your build times!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3506) Build instructions for CentOS 6.6 should include `sudo yum update`

2015-10-23 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14971413#comment-14971413
 ] 

haosdent commented on MESOS-3506:
-

I test this in CentOS 6 docker image, seem don't contains default
{noformat}
[root@af15e2315ea4 /]# wget
bash: wget: command not found
[root@af15e2315ea4 /]# tar
bash: tar: command not found
[root@af15e2315ea4 /]# which
bash: which: command not found
[root@af15e2315ea4 /]# cat /etc/issue
CentOS release 6.7 (Final)
Kernel \r on an \m
{noformat}

> Build instructions for CentOS 6.6 should include `sudo yum update`
> --
>
> Key: MESOS-3506
> URL: https://issues.apache.org/jira/browse/MESOS-3506
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.25.0
>Reporter: Greg Mann
>Assignee: Greg Mann
>  Labels: documentation, mesosphere
>
> Neglecting to run {{sudo yum update}} on CentOS 6.6 currently causes the 
> build to break when building {{mesos-0.25.0.jar}}. The build instructions for 
> this platform on the Getting Started page should be changed accordingly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3545) Investigate restoring tasks/executors after machine reboot.

2015-10-22 Thread Megha (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14969497#comment-14969497
 ] 

Megha commented on MESOS-3545:
--

Here's the first draft of the design for Persistent Tasks. Looking forward to 
feedback and comments.

https://docs.google.com/document/d/1l7goeISpYmCjM03l20lmjZ6_BMfdxBs31znEBRtzsuU/edit?usp=sharing



> Investigate restoring tasks/executors after machine reboot.
> ---
>
> Key: MESOS-3545
> URL: https://issues.apache.org/jira/browse/MESOS-3545
> Project: Mesos
>  Issue Type: Improvement
>  Components: slave
>Reporter: Benjamin Hindman
>  Labels: mesosphere
>
> If a task/executor is restartable (see MESOS-3544) it might make sense to 
> force an agent to restart these tasks/executors _before_ after a machine 
> reboot in the event that the machine is network partitioned away from the 
> master (or the master has failed) but we'd like to get these services running 
> again. Assuming the agent(s) running on the machine has not been disconnected 
> from the master for longer than the master's agent re-registration timeout 
> the agent should be able to re-register (i.e., after a network partition is 
> resolved) without a problem. However, in the same way that a framework would 
> be interested in knowing that it's tasks/executors were restarted we'd want 
> to send something like a TASK_RESTARTED status update.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3787) As a developer, I'd like to be able to expand environment variables through the Docker executor.

2015-10-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975836#comment-14975836
 ] 

haosdent commented on MESOS-3787:
-

Add a test case to verify above proposal could works. 
https://reviews.apache.org/r/39678/

> As a developer, I'd like to be able to expand environment variables through 
> the Docker executor.
> 
>
> Key: MESOS-3787
> URL: https://issues.apache.org/jira/browse/MESOS-3787
> Project: Mesos
>  Issue Type: Wish
>Reporter: John Garcia
>  Labels: mesosphere
> Attachments: mesos.patch, test-example.json
>
>
> We'd like to have expanded variables usable in [the json files used to create 
> a Marathon app, hence] the Task's CommandInfo, so that the executor is able 
> to detect the correct values at runtime.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3787) As a developer, I'd like to be able to expand environment variables through the Docker executor.

2015-10-26 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975486#comment-14975486
 ] 

haosdent commented on MESOS-3787:
-

I think we just need run it use the shell way, not need extra variables before 
start.
{node}
diff --git a/src/docker/docker.cpp b/src/docker/docker.cpp
index 56d63dc..71327bd 100755
--- a/src/docker/docker.cpp
+++ b/src/docker/docker.cpp
@@ -583,8 +583,7 @@ Future Docker::run(
   }

   Try s = subprocess(
-  path,
-  argv,
+  cmd,
   Subprocess::PATH("/dev/null"),
   stdoutIo,
   stderrIo,
{node}

According my test example in https://reviews.apache.org/r/39387/ . Pass "$xxx" 
would replace to environment variables when running.

> As a developer, I'd like to be able to expand environment variables through 
> the Docker executor.
> 
>
> Key: MESOS-3787
>     URL: https://issues.apache.org/jira/browse/MESOS-3787
> Project: Mesos
>  Issue Type: Wish
>Reporter: John Garcia
>  Labels: mesosphere
> Attachments: mesos.patch, test-example.json
>
>
> We'd like to have expanded variables usable in [the json files used to create 
> a Marathon app, hence] the Task's CommandInfo, so that the executor is able 
> to detect the correct values at runtime.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (MESOS-3787) As a developer, I'd like to be able to expand environment variables through the Docker executor.

2015-10-26 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975486#comment-14975486
 ] 

haosdent edited comment on MESOS-3787 at 10/27/15 1:28 AM:
---

I think we just need run it use the shell way, not need extra variables before 
start.
{code}
diff --git a/src/docker/docker.cpp b/src/docker/docker.cpp
index 56d63dc..71327bd 100755
--- a/src/docker/docker.cpp
+++ b/src/docker/docker.cpp
@@ -583,8 +583,7 @@ Future Docker::run(
   }

   Try s = subprocess(
-  path,
-  argv,
+  cmd,
   Subprocess::PATH("/dev/null"),
   stdoutIo,
   stderrIo,
{code}

According my test example in https://reviews.apache.org/r/39387/ . Pass "$xxx" 
would replace to environment variables when running.


was (Author: haosd...@gmail.com):
I think we just need run it use the shell way, not need extra variables before 
start.
{node}
diff --git a/src/docker/docker.cpp b/src/docker/docker.cpp
index 56d63dc..71327bd 100755
--- a/src/docker/docker.cpp
+++ b/src/docker/docker.cpp
@@ -583,8 +583,7 @@ Future Docker::run(
   }

   Try s = subprocess(
-  path,
-  argv,
+  cmd,
   Subprocess::PATH("/dev/null"),
   stdoutIo,
   stderrIo,
{node}

According my test example in https://reviews.apache.org/r/39387/ . Pass "$xxx" 
would replace to environment variables when running.

> As a developer, I'd like to be able to expand environment variables through 
> the Docker executor.
> 
>
> Key: MESOS-3787
> URL: https://issues.apache.org/jira/browse/MESOS-3787
> Project: Mesos
>  Issue Type: Wish
>Reporter: John Garcia
>  Labels: mesosphere
> Attachments: mesos.patch, test-example.json
>
>
> We'd like to have expanded variables usable in [the json files used to create 
> a Marathon app, hence] the Task's CommandInfo, so that the executor is able 
> to detect the correct values at runtime.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (MESOS-2635) Web UI Display Bug when starting lots of tasks with small cpu value

2015-10-26 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-2635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975492#comment-14975492
 ] 

haosdent edited comment on MESOS-2635 at 10/27/15 1:34 AM:
---

Seems I could not found values depends on metrics in frameworks.html and 
framework.html. Could you provide more details about this? Last time I found 
only slave and master pages depends on values have to got from metrics 
endpoint, so I only update them.


was (Author: haosd...@gmail.com):
Seems I could not found values depends on metrics in frameworks.html Could you 
provide more details about this? Last time I found only slave and master pages 
depends on values have to got from metrics endpoint, so I only update them.

> Web UI Display Bug when starting lots of tasks with small cpu value
> ---
>
> Key: MESOS-2635
> URL: https://issues.apache.org/jira/browse/MESOS-2635
> Project: Mesos
>  Issue Type: Bug
>  Components: webui
>Affects Versions: 0.22.0
>Reporter: Lukas Loesche
>Assignee: haosdent
>Priority: Minor
> Fix For: 0.25.0
>
> Attachments: Screen Shot 2015-04-17 at 18.53.54.png, fixed_after.png, 
> fixed_before.png
>
>
> During Marathon scale testing we encountered a display bug where the Mesos 
> Web UI would show -4.064304448547773e-11 Idle CPUs in the Resources overview.
> To reproduce, we started:
>  "cmd": "python -m SimpleHTTPServer $PORT0",
>  "cpus": 0.0001,
>  "instances": 2000,
>  "mem": 32,



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-2635) Web UI Display Bug when starting lots of tasks with small cpu value

2015-10-26 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-2635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14975492#comment-14975492
 ] 

haosdent commented on MESOS-2635:
-

Seems I could not found values depends on metrics in frameworks.html Could you 
provide more details about this? Last time I found only slave and master pages 
depends on values have to got from metrics endpoint, so I only update them.

> Web UI Display Bug when starting lots of tasks with small cpu value
> ---
>
> Key: MESOS-2635
> URL: https://issues.apache.org/jira/browse/MESOS-2635
> Project: Mesos
>  Issue Type: Bug
>  Components: webui
>Affects Versions: 0.22.0
>Reporter: Lukas Loesche
>Assignee: haosdent
>Priority: Minor
> Fix For: 0.25.0
>
> Attachments: Screen Shot 2015-04-17 at 18.53.54.png, fixed_after.png, 
> fixed_before.png
>
>
> During Marathon scale testing we encountered a display bug where the Mesos 
> Web UI would show -4.064304448547773e-11 Idle CPUs in the Resources overview.
> To reproduce, we started:
>  "cmd": "python -m SimpleHTTPServer $PORT0",
>  "cpus": 0.0001,
>  "instances": 2000,
>  "mem": 32,



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3599) COMMAND health checks with marathon running in slave context broken

2015-10-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14976671#comment-14976671
 ] 

haosdent commented on MESOS-3599:
-

As you see, I expose it as $MESOS_CONTAINER_NAME in 
https://reviews.apache.org/r/39387/diff/1#2

> COMMAND health checks with marathon running in slave context broken
> ---
>
> Key: MESOS-3599
> URL: https://issues.apache.org/jira/browse/MESOS-3599
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.23.0
>Reporter: Erhan Kesken
>Assignee: haosdent
>Priority: Critical
>
> When deploying Mesos 0.23rc4 with latest Marathon 0.10.0 RC3 command health 
> check stop working. Rolling back to Mesos 0.22.1 fixes the problem.
> Containerizer is Docker.
> All packages are from official Mesosphere Ubuntu 14.04 sources.
> The issue must be analyzed further.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3599) COMMAND health checks with marathon running in slave context broken

2015-10-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14976675#comment-14976675
 ] 

haosdent commented on MESOS-3599:
-

I think we could ping [~tnachen] again to review the patch. ;-)

> COMMAND health checks with marathon running in slave context broken
> ---
>
> Key: MESOS-3599
> URL: https://issues.apache.org/jira/browse/MESOS-3599
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.23.0
>Reporter: Erhan Kesken
>Assignee: haosdent
>Priority: Critical
>
> When deploying Mesos 0.23rc4 with latest Marathon 0.10.0 RC3 command health 
> check stop working. Rolling back to Mesos 0.22.1 fixes the problem.
> Containerizer is Docker.
> All packages are from official Mesosphere Ubuntu 14.04 sources.
> The issue must be analyzed further.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3216) virtual memory exhausted:: Cannot allocate memory

2015-10-27 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14976728#comment-14976728
 ] 

haosdent commented on MESOS-3216:
-

hmm, last time I try to build Mesos in a virtual machine which only have 1GB 
also have similar problem. I have to change the memory to 2GB and make build 
pass.

> virtual memory exhausted:: Cannot allocate memory
> -
>
> Key: MESOS-3216
> URL: https://issues.apache.org/jira/browse/MESOS-3216
> Project: Mesos
>  Issue Type: Bug
>  Components: build
>Affects Versions: 0.23.0
> Environment: Linux Kudu 3.19.0-25-generic #26-Ubuntu SMP Fri Jul 24 
> 21:17:31 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
> (Ubuntu 15.04)
>Reporter: Samuel Marks
>
> After receiving this error when building on a virtual instance, I decided to 
> build a package using https://github.com/deric/mesos-deb-packaging.
> Here is the last little bit of the output after running {{./build_mesos --ref 
> 0.23.0 --build-version p1}}:
> {code}
> mv -f common/.deps/libmesos_no_3rdparty_la-http.Tpo 
> common/.deps/libmesos_no_3rdparty_la-http.Plo
> /bin/bash ../libtool  --tag=CXX   --mode=compile g++ -DPACKAGE_NAME=\"mesos\" 
> -DPACKAGE_TARNAME=\"mesos\" -DPACKAGE_VERSION=\"0.23.0\" 
> -DPACKAGE_STRING=\"mesos\ 0.23.0\" -DPACKAGE_BUGREPORT=\"\" 
> -DPACKAGE_URL=\"\" -DPACKAGE=\"mesos\" -DVERSION=\"0.23.0\" -DSTDC_HEADERS=1 
> -DHAVE_SYS_TYPES_H=1 -DHAVE_SYS_STAT_H=1 -DHAVE_STDLIB_H=1 -DHAVE_STRING_H=1 
> -DHAVE_MEMORY_H=1 -DHAVE_STRINGS_H=1 -DHAVE_INTTYPES_H=1 -DHAVE_STDINT_H=1 
> -DHAVE_UNISTD_H=1 -DHAVE_DLFCN_H=1 -DLT_OBJDIR=\".libs/\" 
> -DHAVE_PTHREAD_PRIO_INHERIT=1 -DHAVE_PTHREAD=1 -DHAVE_LIBZ=1 -DHAVE_LIBCURL=1 
> -DHAVE_APR_POOLS_H=1 -DHAVE_LIBAPR_1=1 -DHAVE_SVN_VERSION_H=1 
> -DHAVE_LIBSVN_SUBR_1=1 -DHAVE_SVN_DELTA_H=1 -DHAVE_LIBSVN_DELTA_1=1 
> -DHAVE_LIBSASL2=1 -DMESOS_HAS_JAVA=1 -DHAVE_PYTHON=\"2.7\" 
> -DMESOS_HAS_PYTHON=1 -I. 
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/src   -Wall 
> -Werror -DLIBDIR=\"/usr/lib\" -DPKGLIBEXECDIR=\"/usr/libexec/mesos\" 
> -DPKGDATADIR=\"/usr/share/mesos\" 
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/include 
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/3rdparty/libprocess/include
>  
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/3rdparty/libprocess/3rdparty/stout/include
>  -I../include -I../include/mesos 
> -I../3rdparty/libprocess/3rdparty/boost-1.53.0 
> -I../3rdparty/libprocess/3rdparty/picojson-4f93734 
> -I../3rdparty/libprocess/3rdparty/protobuf-2.5.0/src 
> -I../3rdparty/libprocess/3rdparty/glog-0.3.3/src 
> -I../3rdparty/libprocess/3rdparty/glog-0.3.3/src 
> -I../3rdparty/leveldb/include -I../3rdparty/zookeeper-3.4.5/src/c/include 
> -I../3rdparty/zookeeper-3.4.5/src/c/generated 
> -I../3rdparty/libprocess/3rdparty/protobuf-2.5.0/src 
> -I/usr/include/subversion-1 -I/usr/include/apr-1 -I/usr/include/apr-1.0  
> -pthread -O2 -Wno-unused-local-typedefs -Wno-maybe-uninitialized -std=c++11 
> -MT master/allocator/libmesos_no_3rdparty_la-allocator.lo -MD -MP -MF 
> master/allocator/.deps/libmesos_no_3rdparty_la-allocator.Tpo -c -o 
> master/allocator/libmesos_no_3rdparty_la-allocator.lo `test -f 
> 'master/allocator/allocator.cpp' || echo 
> '/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/src/'`master/allocator/allocator.cpp
> libtool: compile:  g++ -DPACKAGE_NAME=\"mesos\" -DPACKAGE_TARNAME=\"mesos\" 
> -DPACKAGE_VERSION=\"0.23.0\" "-DPACKAGE_STRING=\"mesos 0.23.0\"" 
> -DPACKAGE_BUGREPORT=\"\" -DPACKAGE_URL=\"\" -DPACKAGE=\"mesos\" 
> -DVERSION=\"0.23.0\" -DSTDC_HEADERS=1 -DHAVE_SYS_TYPES_H=1 
> -DHAVE_SYS_STAT_H=1 -DHAVE_STDLIB_H=1 -DHAVE_STRING_H=1 -DHAVE_MEMORY_H=1 
> -DHAVE_STRINGS_H=1 -DHAVE_INTTYPES_H=1 -DHAVE_STDINT_H=1 -DHAVE_UNISTD_H=1 
> -DHAVE_DLFCN_H=1 -DLT_OBJDIR=\".libs/\" -DHAVE_PTHREAD_PRIO_INHERIT=1 
> -DHAVE_PTHREAD=1 -DHAVE_LIBZ=1 -DHAVE_LIBCURL=1 -DHAVE_APR_POOLS_H=1 
> -DHAVE_LIBAPR_1=1 -DHAVE_SVN_VERSION_H=1 -DHAVE_LIBSVN_SUBR_1=1 
> -DHAVE_SVN_DELTA_H=1 -DHAVE_LIBSVN_DELTA_1=1 -DHAVE_LIBSASL2=1 
> -DMESOS_HAS_JAVA=1 -DHAVE_PYTHON=\"2.7\" -DMESOS_HAS_PYTHON=1 -I. 
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/src -Wall 
> -Werror -DLIBDIR=\"/usr/lib\" -DPKGLIBEXECDIR=\"/usr/libexec/mesos\" 
> -DPKGDATADIR=\"/usr/share/mesos\" 
> -I/linked_rep

[jira] [Commented] (MESOS-3809) Expose advertise_ip and advertise_port as command line options in mesos slave

2015-10-28 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14977853#comment-14977853
 ] 

haosdent commented on MESOS-3809:
-

+1 please also update 
https://github.com/apache/mesos/blob/master/docs/configuration.md at the same 
time.

> Expose advertise_ip and advertise_port as command line options in mesos slave
> -
>
> Key: MESOS-3809
> URL: https://issues.apache.org/jira/browse/MESOS-3809
> Project: Mesos
>  Issue Type: Bug
>  Components: slave
>Affects Versions: 0.25.0
>Reporter: Anindya Sinha
>Assignee: Anindya Sinha
>Priority: Minor
>
> advertise_ip and advertise_port are exposed as mesos master command line args 
> (MESOS-809). But the following use case makes it a candidate for adding as 
> command line args in mesos slave as well.
> On Tue, Oct 27, 2015 at 7:43 PM, Xiaodong Zhang <xdzh...@alauda.io> wrote:
> It works! Thanks a lot.
> 发件人: haosdent <haosd...@gmail.com>
> 答复: "u...@mesos.apache.org" <u...@mesos.apache.org>
> 日期: 2015年10月28日 星期三 上午10:23
> 至: user <u...@mesos.apache.org>
> 主题: Re: How to tell master which ip to connect.
> Do you try `export LIBPROCESS_ADVERTISE_IP=xxx` and 
> `LIBPROCESS_ADVERTISE_PORT` when start slave?
> On Wed, Oct 28, 2015 at 10:16 AM, Xiaodong Zhang <xdzh...@alauda.io> wrote:
> Hi teams:
> My scenarios is like this:
> My master nodes were deployed in AWS. My slaves were in AZURE.So they 
> communicate via public ip.
> I got trouble when slaves try to register to master. 
> Now slaves can get master’s public ip address,and can send register 
> request.But they can only send there private ip to master.(Because they don’t 
> know there public ip,thus they can’t not bind a public ip via —ip  flag), 
> thus  masters can’t connect slaves.How can the slave to tell master which ip 
> master should connect(I can’t find any flags like —advertise_ip in master).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (MESOS-3803) system requirenments on debian

2015-10-25 Thread shiyao.ma (JIRA)
shiyao.ma created MESOS-3803:


 Summary: system requirenments on debian
 Key: MESOS-3803
 URL: https://issues.apache.org/jira/browse/MESOS-3803
 Project: Mesos
  Issue Type: Documentation
Reporter: shiyao.ma
Priority: Trivial


ITSM the package libz-dev is missing in the system requirements for ubuntu



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3224) Create a Mesos Contributor Newbie Guide

2015-10-25 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14973318#comment-14973318
 ] 

haosdent commented on MESOS-3224:
-

{noformat}
NewbieContributionOverview.jpg
{noformat}
Seems could not find this image.


{noformat}
* Required installations
+ GTest
+ GLog
{noformat}

Because we bundle gtest and glog in 3rdparty folder, do we still need install 
them?

{noformat}
# Download and Build Mesos
{noformat}

It would be better add get start link at here. 
http://mesos.apache.org/gettingstarted/

> Create a Mesos Contributor Newbie Guide
> ---
>
> Key: MESOS-3224
> URL: https://issues.apache.org/jira/browse/MESOS-3224
> Project: Mesos
>  Issue Type: Documentation
>  Components: documentation
>Reporter: Timothy Chen
>Assignee: Diana Arroyo
>
> Currently the website doesn't have a helpful guide for community users to 
> know how to start learning to contribute to Mesos, understand the concepts 
> and lower the barrier to get involved.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3224) Create a Mesos Contributor Newbie Guide

2015-10-21 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14966771#comment-14966771
 ] 

haosdent commented on MESOS-3224:
-

Move to github pull request or review board?

> Create a Mesos Contributor Newbie Guide
> ---
>
> Key: MESOS-3224
> URL: https://issues.apache.org/jira/browse/MESOS-3224
> Project: Mesos
>  Issue Type: Documentation
>  Components: documentation
>Reporter: Timothy Chen
>Assignee: Diana Arroyo
>
> Currently the website doesn't have a helpful guide for community users to 
> know how to start learning to contribute to Mesos, understand the concepts 
> and lower the barrier to get involved.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3787) As a developer, I'd like to be able to expand environment variables through the Docker executor.

2015-10-21 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14968410#comment-14968410
 ] 

haosdent commented on MESOS-3787:
-

For CommandInfo, I think Mesos already set them when start a docker container 
in mesos-docker-executor 
https://github.com/apache/mesos/blob/master/src/docker/docker.cpp#L421-L422. Do 
this not work for you?

> As a developer, I'd like to be able to expand environment variables through 
> the Docker executor.
> 
>
> Key: MESOS-3787
> URL: https://issues.apache.org/jira/browse/MESOS-3787
> Project: Mesos
>  Issue Type: Wish
>Reporter: John Garcia
>  Labels: mesosphere
>
> We'd like to have expanded variables usable in [the json files used to create 
> a Marathon app, hence] the Task's CommandInfo, so that the executor is able 
> to detect the correct values at runtime.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3870) Prevent out-of-order libprocess message delivery

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998837#comment-14998837
 ] 

haosdent commented on MESOS-3870:
-

Ohoh, got it. Thank you for explanation.

> Prevent out-of-order libprocess message delivery
> 
>
> Key: MESOS-3870
> URL: https://issues.apache.org/jira/browse/MESOS-3870
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Neil Conway
>Priority: Minor
>  Labels: mesosphere
>
> I was under the impression that {{send()}} provided in-order, unreliable 
> message delivery. So if P1 sends <M1,M2> to P2, P2 might see <>, , , 
> or <M1,M2> — but not <M2,M1>.
> I suspect much of the code makes a similar assumption. However, it appears 
> that this behavior is not guaranteed. slave.cpp:2217 has the following 
> comment:
> {noformat}
>   // TODO(jieyu): Here we assume that CheckpointResourcesMessages are
>   // ordered (i.e., slave receives them in the same order master sends
>   // them). This should be true in most of the cases because TCP
>   // enforces in order delivery per connection. However, the ordering
>   // is technically not guaranteed because master creates multiple
>   // connections to the slave in some cases (e.g., persistent socket
>   // to slave breaks and master uses ephemeral socket). This could
>   // potentially be solved by using a version number and rejecting
>   // stale messages according to the version number.
> {noformat}
> We can improve this situation by _either_: (1) fixing libprocess to guarantee 
> ordered message delivery, e.g., by adding a sequence number, or (2) 
> clarifying that ordered message delivery is not guaranteed, and ideally 
> providing a tool to force messages to be delivered out-of-order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (MESOS-3870) Prevent out-of-order libprocess message delivery

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998700#comment-14998700
 ] 

haosdent edited comment on MESOS-3870 at 11/10/15 3:09 PM:
---

Suppose a Process enqueue to runq twice(Or impossible, seems I could not find 
any code avoid it enqueue multi times) when it receive two events.

And the Process dequeue in different work threads, and not yet running. In work 
thread 1, Process pop event A and not yet running. In work thread 2, Process 
pop event B and start running.

Is this scenario possible?


was (Author: haosd...@gmail.com):
Suppose a Process enqueue to runq twice(Or impossible, seems I could not find 
any code avoid it enqueue multi times) when it receive two events.

And the dequeue in different work threads, and not yet running. In work thread 
1, Process dequeue event A and not yet running. In work thread 2, Process 
dequeue event B and start running.

Is this scenario possible?

> Prevent out-of-order libprocess message delivery
> 
>
> Key: MESOS-3870
> URL: https://issues.apache.org/jira/browse/MESOS-3870
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Neil Conway
>Priority: Minor
>  Labels: mesosphere
>
> I was under the impression that {{send()}} provided in-order, unreliable 
> message delivery. So if P1 sends <M1,M2> to P2, P2 might see <>, , , 
> or <M1,M2> — but not <M2,M1>.
> I suspect much of the code makes a similar assumption. However, it appears 
> that this behavior is not guaranteed. slave.cpp:2217 has the following 
> comment:
> {noformat}
>   // TODO(jieyu): Here we assume that CheckpointResourcesMessages are
>   // ordered (i.e., slave receives them in the same order master sends
>   // them). This should be true in most of the cases because TCP
>   // enforces in order delivery per connection. However, the ordering
>   // is technically not guaranteed because master creates multiple
>   // connections to the slave in some cases (e.g., persistent socket
>   // to slave breaks and master uses ephemeral socket). This could
>   // potentially be solved by using a version number and rejecting
>   // stale messages according to the version number.
> {noformat}
> We can improve this situation by _either_: (1) fixing libprocess to guarantee 
> ordered message delivery, e.g., by adding a sequence number, or (2) 
> clarifying that ordered message delivery is not guaranteed, and ideally 
> providing a tool to force messages to be delivered out-of-order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3870) Prevent out-of-order libprocess message delivery

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998884#comment-14998884
 ] 

haosdent commented on MESOS-3870:
-

I think have another case make same Process run in different thread. Suppose 
ProcessBase pop event A in thread 1 and change ProcessBase.state to BLOCKED in 
https://github.com/apache/mesos/blob/master/3rdparty/libprocess/src/process.cpp#L2463
 . And not yet consume event B. Then event B reach and enqueue same Process to 
ProcessManager.runq in 
https://github.com/apache/mesos/blob/master/3rdparty/libprocess/src/process.cpp#L3008
 . Then ProcessManager dequeue it in thread 2 and pop event B and run event B 
while thread 1 not yet running event A consume function. Is this possible?

> Prevent out-of-order libprocess message delivery
> 
>
> Key: MESOS-3870
> URL: https://issues.apache.org/jira/browse/MESOS-3870
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Neil Conway
>Priority: Minor
>  Labels: mesosphere
>
> I was under the impression that {{send()}} provided in-order, unreliable 
> message delivery. So if P1 sends <M1,M2> to P2, P2 might see <>, , , 
> or <M1,M2> — but not <M2,M1>.
> I suspect much of the code makes a similar assumption. However, it appears 
> that this behavior is not guaranteed. slave.cpp:2217 has the following 
> comment:
> {noformat}
>   // TODO(jieyu): Here we assume that CheckpointResourcesMessages are
>   // ordered (i.e., slave receives them in the same order master sends
>   // them). This should be true in most of the cases because TCP
>   // enforces in order delivery per connection. However, the ordering
>   // is technically not guaranteed because master creates multiple
>   // connections to the slave in some cases (e.g., persistent socket
>   // to slave breaks and master uses ephemeral socket). This could
>   // potentially be solved by using a version number and rejecting
>   // stale messages according to the version number.
> {noformat}
> We can improve this situation by _either_: (1) fixing libprocess to guarantee 
> ordered message delivery, e.g., by adding a sequence number, or (2) 
> clarifying that ordered message delivery is not guaranteed, and ideally 
> providing a tool to force messages to be delivered out-of-order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3870) Prevent out-of-order libprocess message delivery

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998700#comment-14998700
 ] 

haosdent commented on MESOS-3870:
-

Suppose a Process enqueue to runq twice(Or impossible, seems I could not find 
any code avoid it enqueue multi times) when it receive two events.

And the dequeue in different work threads, and not yet running. In work thread 
1, Process dequeue event A and not yet running. In work thread 2, Process 
dequeue event B and start running.

Is this scenario possible?

> Prevent out-of-order libprocess message delivery
> 
>
> Key: MESOS-3870
> URL: https://issues.apache.org/jira/browse/MESOS-3870
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Neil Conway
>Priority: Minor
>  Labels: mesosphere
>
> I was under the impression that {{send()}} provided in-order, unreliable 
> message delivery. So if P1 sends <M1,M2> to P2, P2 might see <>, , , 
> or <M1,M2> — but not <M2,M1>.
> I suspect much of the code makes a similar assumption. However, it appears 
> that this behavior is not guaranteed. slave.cpp:2217 has the following 
> comment:
> {noformat}
>   // TODO(jieyu): Here we assume that CheckpointResourcesMessages are
>   // ordered (i.e., slave receives them in the same order master sends
>   // them). This should be true in most of the cases because TCP
>   // enforces in order delivery per connection. However, the ordering
>   // is technically not guaranteed because master creates multiple
>   // connections to the slave in some cases (e.g., persistent socket
>   // to slave breaks and master uses ephemeral socket). This could
>   // potentially be solved by using a version number and rejecting
>   // stale messages according to the version number.
> {noformat}
> We can improve this situation by _either_: (1) fixing libprocess to guarantee 
> ordered message delivery, e.g., by adding a sequence number, or (2) 
> clarifying that ordered message delivery is not guaranteed, and ideally 
> providing a tool to force messages to be delivered out-of-order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3870) Prevent out-of-order libprocess message delivery

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998757#comment-14998757
 ] 

haosdent commented on MESOS-3870:
-

Yes, but ProcessBase.state don't have violate. I am not sure if it will still 
get dirty value while it change in other thread.

> Prevent out-of-order libprocess message delivery
> 
>
> Key: MESOS-3870
> URL: https://issues.apache.org/jira/browse/MESOS-3870
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Neil Conway
>Priority: Minor
>  Labels: mesosphere
>
> I was under the impression that {{send()}} provided in-order, unreliable 
> message delivery. So if P1 sends <M1,M2> to P2, P2 might see <>, , , 
> or <M1,M2> — but not <M2,M1>.
> I suspect much of the code makes a similar assumption. However, it appears 
> that this behavior is not guaranteed. slave.cpp:2217 has the following 
> comment:
> {noformat}
>   // TODO(jieyu): Here we assume that CheckpointResourcesMessages are
>   // ordered (i.e., slave receives them in the same order master sends
>   // them). This should be true in most of the cases because TCP
>   // enforces in order delivery per connection. However, the ordering
>   // is technically not guaranteed because master creates multiple
>   // connections to the slave in some cases (e.g., persistent socket
>   // to slave breaks and master uses ephemeral socket). This could
>   // potentially be solved by using a version number and rejecting
>   // stale messages according to the version number.
> {noformat}
> We can improve this situation by _either_: (1) fixing libprocess to guarantee 
> ordered message delivery, e.g., by adding a sequence number, or (2) 
> clarifying that ordered message delivery is not guaranteed, and ideally 
> providing a tool to force messages to be delivered out-of-order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3870) Prevent out-of-order libprocess message delivery

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998686#comment-14998686
 ] 

haosdent commented on MESOS-3870:
-

I think ProcessManager could dequeue same Process in different work thread?
{noformat}
ProcessBase* ProcessManager::dequeue()
{
  // TODO(benh): Remove a process from this thread's runq. If there
  // are no processes to run, and this is not a dedicated thread, then
  // steal one from another threads runq.

  ProcessBase* process = NULL;

  synchronized (runq_mutex) {
if (!runq.empty()) {
  process = runq.front();
  runq.pop_front();
  // Increment the running count of processes in order to support
  // the Clock::settle() operation (this must be done atomically
  // with removing the process from the runq).
  running.fetch_add(1);
}
  }

  return process;
}
{noformat}

> Prevent out-of-order libprocess message delivery
> 
>
> Key: MESOS-3870
> URL: https://issues.apache.org/jira/browse/MESOS-3870
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Neil Conway
>Priority: Minor
>  Labels: mesosphere
>
> I was under the impression that {{send()}} provided in-order, unreliable 
> message delivery. So if P1 sends <M1,M2> to P2, P2 might see <>, , , 
> or <M1,M2> — but not <M2,M1>.
> I suspect much of the code makes a similar assumption. However, it appears 
> that this behavior is not guaranteed. slave.cpp:2217 has the following 
> comment:
> {noformat}
>   // TODO(jieyu): Here we assume that CheckpointResourcesMessages are
>   // ordered (i.e., slave receives them in the same order master sends
>   // them). This should be true in most of the cases because TCP
>   // enforces in order delivery per connection. However, the ordering
>   // is technically not guaranteed because master creates multiple
>   // connections to the slave in some cases (e.g., persistent socket
>   // to slave breaks and master uses ephemeral socket). This could
>   // potentially be solved by using a version number and rejecting
>   // stale messages according to the version number.
> {noformat}
> We can improve this situation by _either_: (1) fixing libprocess to guarantee 
> ordered message delivery, e.g., by adding a sequence number, or (2) 
> clarifying that ordered message delivery is not guaranteed, and ideally 
> providing a tool to force messages to be delivered out-of-order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3870) Prevent out-of-order libprocess message delivery

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998730#comment-14998730
 ] 

haosdent commented on MESOS-3870:
-

Yes, but I think it still could be enqueue twice? Because ProcessBase.state in 
different CPU core caches.

> Prevent out-of-order libprocess message delivery
> 
>
> Key: MESOS-3870
> URL: https://issues.apache.org/jira/browse/MESOS-3870
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Neil Conway
>Priority: Minor
>  Labels: mesosphere
>
> I was under the impression that {{send()}} provided in-order, unreliable 
> message delivery. So if P1 sends <M1,M2> to P2, P2 might see <>, , , 
> or <M1,M2> — but not <M2,M1>.
> I suspect much of the code makes a similar assumption. However, it appears 
> that this behavior is not guaranteed. slave.cpp:2217 has the following 
> comment:
> {noformat}
>   // TODO(jieyu): Here we assume that CheckpointResourcesMessages are
>   // ordered (i.e., slave receives them in the same order master sends
>   // them). This should be true in most of the cases because TCP
>   // enforces in order delivery per connection. However, the ordering
>   // is technically not guaranteed because master creates multiple
>   // connections to the slave in some cases (e.g., persistent socket
>   // to slave breaks and master uses ephemeral socket). This could
>   // potentially be solved by using a version number and rejecting
>   // stale messages according to the version number.
> {noformat}
> We can improve this situation by _either_: (1) fixing libprocess to guarantee 
> ordered message delivery, e.g., by adding a sequence number, or (2) 
> clarifying that ordered message delivery is not guaranteed, and ideally 
> providing a tool to force messages to be delivered out-of-order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (MESOS-3870) Prevent out-of-order libprocess message delivery

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998884#comment-14998884
 ] 

haosdent edited comment on MESOS-3870 at 11/10/15 4:49 PM:
---

I think have another case make same Process run in different thread. Suppose 
ProcessBase pop event A in thread 1 and change ProcessBase.state to BLOCKED in 
https://github.com/apache/mesos/blob/master/3rdparty/libprocess/src/process.cpp#L2463
 . And not yet consume event A. Then event B reach and enqueue same Process to 
ProcessManager.runq in 
https://github.com/apache/mesos/blob/master/3rdparty/libprocess/src/process.cpp#L3008
 . Then ProcessManager dequeue it in thread 2 and pop event B and run event B 
while thread 1 not yet running event A consume function. Is this possible?


was (Author: haosd...@gmail.com):
I think have another case make same Process run in different thread. Suppose 
ProcessBase pop event A in thread 1 and change ProcessBase.state to BLOCKED in 
https://github.com/apache/mesos/blob/master/3rdparty/libprocess/src/process.cpp#L2463
 . And not yet consume event B. Then event B reach and enqueue same Process to 
ProcessManager.runq in 
https://github.com/apache/mesos/blob/master/3rdparty/libprocess/src/process.cpp#L3008
 . Then ProcessManager dequeue it in thread 2 and pop event B and run event B 
while thread 1 not yet running event A consume function. Is this possible?

> Prevent out-of-order libprocess message delivery
> 
>
> Key: MESOS-3870
> URL: https://issues.apache.org/jira/browse/MESOS-3870
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Neil Conway
>Priority: Minor
>  Labels: mesosphere
>
> I was under the impression that {{send()}} provided in-order, unreliable 
> message delivery. So if P1 sends <M1,M2> to P2, P2 might see <>, , , 
> or <M1,M2> — but not <M2,M1>.
> I suspect much of the code makes a similar assumption. However, it appears 
> that this behavior is not guaranteed. slave.cpp:2217 has the following 
> comment:
> {noformat}
>   // TODO(jieyu): Here we assume that CheckpointResourcesMessages are
>   // ordered (i.e., slave receives them in the same order master sends
>   // them). This should be true in most of the cases because TCP
>   // enforces in order delivery per connection. However, the ordering
>   // is technically not guaranteed because master creates multiple
>   // connections to the slave in some cases (e.g., persistent socket
>   // to slave breaks and master uses ephemeral socket). This could
>   // potentially be solved by using a version number and rejecting
>   // stale messages according to the version number.
> {noformat}
> We can improve this situation by _either_: (1) fixing libprocess to guarantee 
> ordered message delivery, e.g., by adding a sequence number, or (2) 
> clarifying that ordered message delivery is not guaranteed, and ideally 
> providing a tool to force messages to be delivered out-of-order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3870) Prevent out-of-order libprocess message delivery

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998971#comment-14998971
 ] 

haosdent commented on MESOS-3870:
-

... Looks not possible. Only after consume event A, ProcessBase.state become 
BLOCKED.

> Prevent out-of-order libprocess message delivery
> 
>
> Key: MESOS-3870
> URL: https://issues.apache.org/jira/browse/MESOS-3870
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Neil Conway
>Priority: Minor
>  Labels: mesosphere
>
> I was under the impression that {{send()}} provided in-order, unreliable 
> message delivery. So if P1 sends <M1,M2> to P2, P2 might see <>, , , 
> or <M1,M2> — but not <M2,M1>.
> I suspect much of the code makes a similar assumption. However, it appears 
> that this behavior is not guaranteed. slave.cpp:2217 has the following 
> comment:
> {noformat}
>   // TODO(jieyu): Here we assume that CheckpointResourcesMessages are
>   // ordered (i.e., slave receives them in the same order master sends
>   // them). This should be true in most of the cases because TCP
>   // enforces in order delivery per connection. However, the ordering
>   // is technically not guaranteed because master creates multiple
>   // connections to the slave in some cases (e.g., persistent socket
>   // to slave breaks and master uses ephemeral socket). This could
>   // potentially be solved by using a version number and rejecting
>   // stale messages according to the version number.
> {noformat}
> We can improve this situation by _either_: (1) fixing libprocess to guarantee 
> ordered message delivery, e.g., by adding a sequence number, or (2) 
> clarifying that ordered message delivery is not guaranteed, and ideally 
> providing a tool to force messages to be delivered out-of-order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (MESOS-3216) virtual memory exhausted:: Cannot allocate memory

2015-11-09 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent reassigned MESOS-3216:
---

Assignee: haosdent

> virtual memory exhausted:: Cannot allocate memory
> -
>
> Key: MESOS-3216
> URL: https://issues.apache.org/jira/browse/MESOS-3216
> Project: Mesos
>  Issue Type: Bug
>  Components: build
>Affects Versions: 0.23.0
> Environment: Linux Kudu 3.19.0-25-generic #26-Ubuntu SMP Fri Jul 24 
> 21:17:31 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
> (Ubuntu 15.04)
>Reporter: Samuel Marks
>Assignee: haosdent
>
> After receiving this error when building on a virtual instance, I decided to 
> build a package using https://github.com/deric/mesos-deb-packaging.
> Here is the last little bit of the output after running {{./build_mesos --ref 
> 0.23.0 --build-version p1}}:
> {code}
> mv -f common/.deps/libmesos_no_3rdparty_la-http.Tpo 
> common/.deps/libmesos_no_3rdparty_la-http.Plo
> /bin/bash ../libtool  --tag=CXX   --mode=compile g++ -DPACKAGE_NAME=\"mesos\" 
> -DPACKAGE_TARNAME=\"mesos\" -DPACKAGE_VERSION=\"0.23.0\" 
> -DPACKAGE_STRING=\"mesos\ 0.23.0\" -DPACKAGE_BUGREPORT=\"\" 
> -DPACKAGE_URL=\"\" -DPACKAGE=\"mesos\" -DVERSION=\"0.23.0\" -DSTDC_HEADERS=1 
> -DHAVE_SYS_TYPES_H=1 -DHAVE_SYS_STAT_H=1 -DHAVE_STDLIB_H=1 -DHAVE_STRING_H=1 
> -DHAVE_MEMORY_H=1 -DHAVE_STRINGS_H=1 -DHAVE_INTTYPES_H=1 -DHAVE_STDINT_H=1 
> -DHAVE_UNISTD_H=1 -DHAVE_DLFCN_H=1 -DLT_OBJDIR=\".libs/\" 
> -DHAVE_PTHREAD_PRIO_INHERIT=1 -DHAVE_PTHREAD=1 -DHAVE_LIBZ=1 -DHAVE_LIBCURL=1 
> -DHAVE_APR_POOLS_H=1 -DHAVE_LIBAPR_1=1 -DHAVE_SVN_VERSION_H=1 
> -DHAVE_LIBSVN_SUBR_1=1 -DHAVE_SVN_DELTA_H=1 -DHAVE_LIBSVN_DELTA_1=1 
> -DHAVE_LIBSASL2=1 -DMESOS_HAS_JAVA=1 -DHAVE_PYTHON=\"2.7\" 
> -DMESOS_HAS_PYTHON=1 -I. 
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/src   -Wall 
> -Werror -DLIBDIR=\"/usr/lib\" -DPKGLIBEXECDIR=\"/usr/libexec/mesos\" 
> -DPKGDATADIR=\"/usr/share/mesos\" 
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/include 
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/3rdparty/libprocess/include
>  
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/3rdparty/libprocess/3rdparty/stout/include
>  -I../include -I../include/mesos 
> -I../3rdparty/libprocess/3rdparty/boost-1.53.0 
> -I../3rdparty/libprocess/3rdparty/picojson-4f93734 
> -I../3rdparty/libprocess/3rdparty/protobuf-2.5.0/src 
> -I../3rdparty/libprocess/3rdparty/glog-0.3.3/src 
> -I../3rdparty/libprocess/3rdparty/glog-0.3.3/src 
> -I../3rdparty/leveldb/include -I../3rdparty/zookeeper-3.4.5/src/c/include 
> -I../3rdparty/zookeeper-3.4.5/src/c/generated 
> -I../3rdparty/libprocess/3rdparty/protobuf-2.5.0/src 
> -I/usr/include/subversion-1 -I/usr/include/apr-1 -I/usr/include/apr-1.0  
> -pthread -O2 -Wno-unused-local-typedefs -Wno-maybe-uninitialized -std=c++11 
> -MT master/allocator/libmesos_no_3rdparty_la-allocator.lo -MD -MP -MF 
> master/allocator/.deps/libmesos_no_3rdparty_la-allocator.Tpo -c -o 
> master/allocator/libmesos_no_3rdparty_la-allocator.lo `test -f 
> 'master/allocator/allocator.cpp' || echo 
> '/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/src/'`master/allocator/allocator.cpp
> libtool: compile:  g++ -DPACKAGE_NAME=\"mesos\" -DPACKAGE_TARNAME=\"mesos\" 
> -DPACKAGE_VERSION=\"0.23.0\" "-DPACKAGE_STRING=\"mesos 0.23.0\"" 
> -DPACKAGE_BUGREPORT=\"\" -DPACKAGE_URL=\"\" -DPACKAGE=\"mesos\" 
> -DVERSION=\"0.23.0\" -DSTDC_HEADERS=1 -DHAVE_SYS_TYPES_H=1 
> -DHAVE_SYS_STAT_H=1 -DHAVE_STDLIB_H=1 -DHAVE_STRING_H=1 -DHAVE_MEMORY_H=1 
> -DHAVE_STRINGS_H=1 -DHAVE_INTTYPES_H=1 -DHAVE_STDINT_H=1 -DHAVE_UNISTD_H=1 
> -DHAVE_DLFCN_H=1 -DLT_OBJDIR=\".libs/\" -DHAVE_PTHREAD_PRIO_INHERIT=1 
> -DHAVE_PTHREAD=1 -DHAVE_LIBZ=1 -DHAVE_LIBCURL=1 -DHAVE_APR_POOLS_H=1 
> -DHAVE_LIBAPR_1=1 -DHAVE_SVN_VERSION_H=1 -DHAVE_LIBSVN_SUBR_1=1 
> -DHAVE_SVN_DELTA_H=1 -DHAVE_LIBSVN_DELTA_1=1 -DHAVE_LIBSASL2=1 
> -DMESOS_HAS_JAVA=1 -DHAVE_PYTHON=\"2.7\" -DMESOS_HAS_PYTHON=1 -I. 
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/src -Wall 
> -Werror -DLIBDIR=\"/usr/lib\" -DPKGLIBEXECDIR=\"/usr/libexec/mesos\" 
> -DPKGDATADIR=\"/usr/share/mesos\" 
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/include 
> -I/linked_replaced_actual_path/mesos-deb-packaging/mesos-repo/3rdparty/li

[jira] [Commented] (MESOS-3827) Improve compilation speed of GMock tests

2015-11-08 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14995690#comment-14995690
 ] 

haosdent commented on MESOS-3827:
-

Sounds great!

> Improve compilation speed of GMock tests
> 
>
> Key: MESOS-3827
> URL: https://issues.apache.org/jira/browse/MESOS-3827
> Project: Mesos
>  Issue Type: Improvement
>Reporter: Neil Conway
>Assignee: Neil Conway
>Priority: Minor
>  Labels: mesosphere, tech-debt, testing
>
> The GMock docs suggest that moving the definition of mock classes' 
> constructors and destructors to a separate compilation unit can improve 
> compile performance: 
> https://code.google.com/p/googlemock/wiki/V1_7_CookBook#Making_the_Compilation_Faster



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3906) post-reviews.py does not correctly upload binary files

2015-11-12 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15003432#comment-15003432
 ] 

haosdent commented on MESOS-3906:
-

Hi, this is not post-reviews.py problem. Please apply this 
https://reviews.reviewboard.org/r/7571/diff/1#index_header to your rbtools.

> post-reviews.py does not correctly upload binary files
> --
>
> Key: MESOS-3906
> URL: https://issues.apache.org/jira/browse/MESOS-3906
> Project: Mesos
>  Issue Type: Bug
>  Components: reviewbot
>Reporter: Benjamin Bannier
>
> I was changing images in `docs/images`, and uploading commits with 
> `support/post-reviews.py`.
> After that the review request appeared with mangled diffs in ReviewBoard. 
> Sadly this was not apparent from the web ui since the images where not 
> displayed.
> I am using RBTools 0.7.2 under OS X 10.10.5.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3915) Upgrade vendored Boost to 1.59

2015-11-13 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15004260#comment-15004260
 ] 

haosdent commented on MESOS-3915:
-

+1

> Upgrade vendored Boost to 1.59
> --
>
> Key: MESOS-3915
> URL: https://issues.apache.org/jira/browse/MESOS-3915
> Project: Mesos
>  Issue Type: Bug
>Reporter: Neil Conway
>Priority: Minor
>  Labels: boost, mesosphere, tech-debt
>
> We should upgrade the vendored version of Boost to a newer version. Benefits:
> * Should properly fix MESOS-688
> * Should fix MESOS-3799
> * Generally speaking, using a more modern version of Boost means we can take 
> advantage of bug fixes, optimizations, and new features.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (MESOS-3928) ROOT tests fail on Mesos 0.26 on Ubuntu/CentOS

2015-11-14 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15005726#comment-15005726
 ] 

haosdent edited comment on MESOS-3928 at 11/15/15 4:55 AM:
---

Do you try disable iptables and change you centos-mesos-dev to a accessible ip 
instead of 127.0.0.1 in /etc/hosts when running 
DockerContainerizerTest.ROOT_DOCKER_Launch_Executor_Bridged? Because when use 
bridged mode, if slave listen on 127.0.0.1, docker container could not 
communicate with it.


was (Author: haosd...@gmail.com):
Do you try disable iptables and change you localhost to a accessible ip instead 
of 127.0.0.1 in /etc/hosts when running 
DockerContainerizerTest.ROOT_DOCKER_Launch_Executor_Bridged? Because when use 
bridged mode, if slave listen on 127.0.0.1, docker container could not 
communicate with it.

> ROOT tests fail on Mesos 0.26 on Ubuntu/CentOS
> --
>
> Key: MESOS-3928
> URL: https://issues.apache.org/jira/browse/MESOS-3928
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.26.0
>Reporter: Marco Massenzio
>Assignee: Bernd Mathiske
> Attachments: ROOT-tests-centos-7.1.log, ROOT-tests-ubuntu-14.04.log
>
>
> Running {{0.26.0-rc1}} on both CentOS 7.1 and Ubuntu 14.04 with {{sudo}} 
> privileges, causes segfaults when running Docker tests.
> Logs attached.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3928) ROOT tests fail on Mesos 0.26 on Ubuntu/CentOS

2015-11-14 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15005726#comment-15005726
 ] 

haosdent commented on MESOS-3928:
-

Do you try disable iptables and change you localhost to a accessible ip instead 
of 127.0.0.1 in /etc/hosts when running 
DockerContainerizerTest.ROOT_DOCKER_Launch_Executor_Bridged? Because when use 
bridged mode, if slave listen on 127.0.0.1, docker container could not 
communicate with it.

> ROOT tests fail on Mesos 0.26 on Ubuntu/CentOS
> --
>
> Key: MESOS-3928
> URL: https://issues.apache.org/jira/browse/MESOS-3928
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.26.0
>Reporter: Marco Massenzio
>Assignee: Bernd Mathiske
> Attachments: ROOT-tests-centos-7.1.log, ROOT-tests-ubuntu-14.04.log
>
>
> Running {{0.26.0-rc1}} on both CentOS 7.1 and Ubuntu 14.04 with {{sudo}} 
> privileges, causes segfaults when running Docker tests.
> Logs attached.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-2262) Adding GPGPU resource into Mesos, so we can know if any GPU/Heterogeneous resource are available from slave

2015-11-15 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-2262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15005880#comment-15005880
 ] 

haosdent commented on MESOS-2262:
-

I think you could use the docker image provided by Nvidia 
https://github.com/NVIDIA/nvidia-docker and Marathon constraints operator to 
make sure your jobs only run in those machine have GPGPU resource. For more 
exactly control for GPGPU resource. So far Mesos don't provide this.

> Adding GPGPU resource into Mesos, so we can know if any GPU/Heterogeneous 
> resource are available from slave
> ---
>
> Key: MESOS-2262
> URL: https://issues.apache.org/jira/browse/MESOS-2262
> Project: Mesos
>  Issue Type: Task
>  Components: slave
> Environment: OpenCL support env, such as OS X, Linux, Windows..
>Reporter: chester kuo
>Assignee: chester kuo
>Priority: Minor
>
> Extending Mesos to support Heterogeneous resource such as GPGPU/FPGA..etc as 
> computing resources in the data-center, OpenCL will be first target to add 
> into Mesos (support by all major GPU vendor) , I will reserve to support 
> others such as CUDA in the future.
> In this feature, slave will be supported to do resources discover including 
> but not limited to, 
> (1) Heterogeneous Computing programming model : "OpenCL". "CUDA", "HSA"
> (2) Computing global memory (MB)
> (3) Computing run time version , such as "1.2" , "2.0"
> (4) Computing compute unit (double)
> (5) Computing device type : GPGPU, CPU, Accelerator device.
> (6) Computing (number of devices): (double)
> The Heterogeneous resource isolation will be supported in the framework 
> instead of in the slave devices side, the major reason here is , the 
> ecosystem , such as OpenCL operate on top of private device driver own by 
> vendors, only runtime library (OpenCL) is user-space application, so its hard 
> for us to do like Linux cgroup to have CPU/memory resource isolation. As a 
> result we may use run time library to do device isolation and memory 
> allocation.
> (PS, if anyone know how to do it for GPGPU driver, please drop me a note)
> Meanwhile, some run-time library (such as OpenCL) support to run on top of 
> CPU, so we need to use isolator API to notify this once it allocated.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Issue Comment Deleted] (MESOS-3815) docker executor not works when SSL enable

2015-11-16 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent updated MESOS-3815:

Comment: was deleted

(was: Patch: 
https://reviews.apache.org/r/39944/
https://reviews.apache.org/r/39945/)

> docker executor not works when SSL enable
> -
>
> Key: MESOS-3815
> URL: https://issues.apache.org/jira/browse/MESOS-3815
> Project: Mesos
>  Issue Type: Bug
>Reporter: haosdent
>Assignee: haosdent
>
> Because docker executor not pass SSL related environment variables, 
> mesos-docker-executor could not works normal when SSL enable. More details 
> could found in http://search-hadoop.com/m/0Vlr6DsslDSvVs72



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3815) docker executor not works when SSL enable

2015-11-16 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15007758#comment-15007758
 ] 

haosdent commented on MESOS-3815:
-

Let me delete my comment.

> docker executor not works when SSL enable
> -
>
> Key: MESOS-3815
> URL: https://issues.apache.org/jira/browse/MESOS-3815
> Project: Mesos
>  Issue Type: Bug
>Reporter: haosdent
>Assignee: haosdent
>
> Because docker executor not pass SSL related environment variables, 
> mesos-docker-executor could not works normal when SSL enable. More details 
> could found in http://search-hadoop.com/m/0Vlr6DsslDSvVs72



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (MESOS-3901) Enable Mesos to be able know when it is hosted behind a proxy with a URL prefix

2015-11-11 Thread Harpreet (JIRA)
Harpreet created MESOS-3901:
---

 Summary: Enable Mesos to be able know when it is hosted behind a 
proxy with a URL prefix
 Key: MESOS-3901
 URL: https://issues.apache.org/jira/browse/MESOS-3901
 Project: Mesos
  Issue Type: Improvement
  Components: webui
Reporter: Harpreet


If Mesos is run behind a proxy with a URL prefix e.g.  
https://:/services/mesos (`/services/mesos` being the URL 
prefix), sandboxes in mesos don't load. This happens because when
  Mesos is accessed through a proxy at 
https://:/services/mesos, Mesos tries to request slave state 
from 
https://:/slave/20151110-232502-218431498-5050-1234-S1/slave(1)/state.json?jsonp=angular.callbacks._4.
 This URL is missing the /services/mesos path prefix, so the request fails. 
Fixing this by rewriting URLs in the body of every response, would not be a 
clean solution and can be error prone.

After searching around a bit we've learned that this is apparently a common 
issue with webapps, because there is no standard specification for making them 
aware of their base URL path. Some will allow you to specify a base path in 
configuration[1], others will respect an X-Forwarded-Path header if a proxy 
provides it[2], and others don't handle this at all. 

It would be great to have explicit support in for this in Mesos.

[1] 
http://search.cpan.org/~bobtfish/Catalyst-TraitFor-Request-ProxyBase-0.05/lib/Catalyst/TraitFor/Request/ProxyBase.pm
[2] https://github.com/mattkenney/feedsquish/blob/master/rupta.py#L94





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3851) Investigate recent crashes in Command Executor

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998453#comment-14998453
 ] 

haosdent commented on MESOS-3851:
-

In the error log, registered and launchTaks have different thread id.
{noformat}
I1110 00:36:30.616987  5169 exec.cpp:306] Executor::launchTask took 160701ns
I1110 00:36:30.621285  5163 exec.cpp:222] Executor::registered took 399555ns
{noformat}

But in local test, these always have same thread id.

{noformat}
I1110 19:34:46.304114  8953 exec.cpp:222] Executor::registered took 182100ns
S1110 19:34:46.304416  8953 exec.cpp:306] Executor::launchTask took 47975ns
{noformat}
{noformat}
R1110 19:34:47.439801  9027 exec.cpp:222] Executor::registered took 257152ns
I1110 19:34:47.440234  9027 exec.cpp:306] Executor::launchTask took 111249ns
{noformat}
{noformat}
I1110 19:34:47.943961  9097 exec.cpp:222] Executor::registered took 271225ns
I1110 19:34:47.944284  9097 exec.cpp:306] Executor::launchTask took 45141ns
{noformat}

> Investigate recent crashes in Command Executor
> --
>
> Key: MESOS-3851
> URL: https://issues.apache.org/jira/browse/MESOS-3851
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Reporter: Anand Mazumdar
>Priority: Blocker
>  Labels: mesosphere
>
> Post https://reviews.apache.org/r/38900 i.e. updating CommandExecutor to 
> support rootfs. There seem to be some tests showing frequent crashes due to 
> assert violations.
> {{FetcherCacheTest.SimpleEviction}} failed due to the following log:
> {code}
> I1107 19:36:46.360908 30657 slave.cpp:1793] Sending queued task '3' to 
> executor ''3' of framework 7d94c7fb-8950-4bcf-80c1-46112292dcd6- at 
> executor(1)@172.17.5.200:33871'
> I1107 19:36:46.363682  1236 exec.cpp:297] 
> I1107 19:36:46.373569  1245 exec.cpp:210] Executor registered on slave 
> 7d94c7fb-8950-4bcf-80c1-46112292dcd6-S0
> @ 0x7f9f5a7db3fa  google::LogMessage::Fail()
> I1107 19:36:46.394081  1245 exec.cpp:222] Executor::registered took 395411ns
> @ 0x7f9f5a7db359  google::LogMessage::SendToLog()
> @ 0x7f9f5a7dad6a  google::LogMessage::Flush()
> @ 0x7f9f5a7dda9e  google::LogMessageFatal::~LogMessageFatal()
> @   0x48d00a  _CheckFatal::~_CheckFatal()
> @   0x49c99d  
> mesos::internal::CommandExecutorProcess::launchTask()
> @   0x4b3dd7  
> _ZZN7process8dispatchIN5mesos8internal22CommandExecutorProcessEPNS1_14ExecutorDriverERKNS1_8TaskInfoES5_S6_EEvRKNS_3PIDIT_EEMSA_FvT0_T1_ET2_T3_ENKUlPNS_11ProcessBaseEE_clESL_
> @   0x4c470c  
> _ZNSt17_Function_handlerIFvPN7process11ProcessBaseEEZNS0_8dispatchIN5mesos8internal22CommandExecutorProcessEPNS5_14ExecutorDriverERKNS5_8TaskInfoES9_SA_EEvRKNS0_3PIDIT_EEMSE_FvT0_T1_ET2_T3_EUlS2_E_E9_M_invokeERKSt9_Any_dataS2_
> @ 0x7f9f5a761b1b  std::function<>::operator()()
> @ 0x7f9f5a749935  process::ProcessBase::visit()
> @ 0x7f9f5a74d700  process::DispatchEvent::visit()
> @   0x48e004  process::ProcessBase::serve()
> @ 0x7f9f5a745d21  process::ProcessManager::resume()
> @ 0x7f9f5a742f52  
> _ZZN7process14ProcessManager12init_threadsEvENKUlRKSt11atomic_boolE_clES3_
> @ 0x7f9f5a74cf2c  
> _ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS3_EEE6__callIvIEILm0T_OSt5tupleIIDpT0_EESt12_Index_tupleIIXspT1_EEE
> @ 0x7f9f5a74cedc  
> _ZNSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS3_EEEclIIEvEET0_DpOT_
> @ 0x7f9f5a74ce6e  
> _ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS4_EEEvEE9_M_invokeIIEEEvSt12_Index_tupleIIXspT_EEE
> @ 0x7f9f5a74cdc5  
> _ZNSt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS4_EEEvEEclEv
> @ 0x7f9f5a74cd5e  
> _ZNSt6thread5_ImplISt12_Bind_simpleIFSt5_BindIFZN7process14ProcessManager12init_threadsEvEUlRKSt11atomic_boolE_St17reference_wrapperIS6_EEEvEEE6_M_runEv
> @ 0x7f9f5624f1e0  (unknown)
> @ 0x7f9f564a8df5  start_thread
> @ 0x7f9f559b71ad  __clone
> I1107 19:36:46.551370 30656 containerizer.cpp:1257] Executor for container 
> '6553a617-6b4a-418d-9759-5681f45ff854' has exited
> I1107 19:36:46.551429 30656 containerizer.cpp:1074] Destroying container 
> '6553a617-6b4a-418d-9759-5681f45ff854'
> I1107 19:36:46.553869 30656 containerizer.cpp:1257] Executor for container 
> 'd2c1f924-c92a-453e-82b1-c294d09c4873' has exited
> {code}
> The r

[jira] [Commented] (MESOS-2376) Allow libprocess ip and port to be configured

2015-11-10 Thread Dimitri (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-2376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998645#comment-14998645
 ] 

Dimitri commented on MESOS-2376:


[~hfaran] What do you mean ? 
I am trying to set the mesos binding ip to something more secure than 0.0.0.0. 
I am running mesos inside a docker container, should I bind in the docker 
environment or on the host ? I have tried both, none did work. I haven't tried 
to set LIBPROCESS_PORT since i am just  trying to change the interface.

I am quite suprirse by the lake of documentation for this feature, which is to 
me something that cannot make mesos usuable to production.

> Allow libprocess ip and port to be configured
> -
>
> Key: MESOS-2376
> URL: https://issues.apache.org/jira/browse/MESOS-2376
> Project: Mesos
>  Issue Type: Improvement
>  Components: java api
>Reporter: Dario Rexin
>Priority: Minor
>
> Currently if we want to configure the ip libprocess uses for communication, 
> we have to set the env var LIBPROCESS_IP, or LIBPROCESS_PORT for the port. 
> For the Java API this means, that the variable has to be set before the JVM 
> is started, because setting env vars from within JAVA is not possible / 
> non-trivial. Therefore it would be great to be able to pass them in to the 
> constructor.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3870) Prevent out-of-order libprocess message delivery

2015-11-10 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998605#comment-14998605
 ] 

haosdent commented on MESOS-3870:
-

I think although send could be ordered when only have a connection, but execute 
also could be out-of-order in receiver. ProcessManager would create 8~cpu 
number work threads to handle input messages. Because ProcessManager dispatch 
work by event, same Process would be called in different thread for different 
event.

> Prevent out-of-order libprocess message delivery
> 
>
> Key: MESOS-3870
> URL: https://issues.apache.org/jira/browse/MESOS-3870
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Neil Conway
>Priority: Minor
>  Labels: mesosphere
>
> I was under the impression that {{send()}} provided in-order, unreliable 
> message delivery. So if P1 sends <M1,M2> to P2, P2 might see <>, , , 
> or <M1,M2> — but not <M2,M1>.
> I suspect much of the code makes a similar assumption. However, it appears 
> that this behavior is not guaranteed. slave.cpp:2217 has the following 
> comment:
> {noformat}
>   // TODO(jieyu): Here we assume that CheckpointResourcesMessages are
>   // ordered (i.e., slave receives them in the same order master sends
>   // them). This should be true in most of the cases because TCP
>   // enforces in order delivery per connection. However, the ordering
>   // is technically not guaranteed because master creates multiple
>   // connections to the slave in some cases (e.g., persistent socket
>   // to slave breaks and master uses ephemeral socket). This could
>   // potentially be solved by using a version number and rejecting
>   // stale messages according to the version number.
> {noformat}
> We can improve this situation by _either_: (1) fixing libprocess to guarantee 
> ordered message delivery, e.g., by adding a sequence number, or (2) 
> clarifying that ordered message delivery is not guaranteed, and ideally 
> providing a tool to force messages to be delivered out-of-order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (MESOS-3815) os environment variables not passing to docker-executor environment variables correctly

2015-11-01 Thread haosdent (JIRA)
haosdent created MESOS-3815:
---

 Summary: os environment variables not passing to docker-executor 
environment variables correctly
 Key: MESOS-3815
 URL: https://issues.apache.org/jira/browse/MESOS-3815
 Project: Mesos
  Issue Type: Bug
Reporter: haosdent






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (MESOS-3815) os environment variables not passing to docker-executor environment variables correctly

2015-11-01 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent reassigned MESOS-3815:
---

Assignee: haosdent

> os environment variables not passing to docker-executor environment variables 
> correctly
> ---
>
> Key: MESOS-3815
> URL: https://issues.apache.org/jira/browse/MESOS-3815
> Project: Mesos
>  Issue Type: Bug
>Reporter: haosdent
>Assignee: haosdent
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3740) LIBPROCESS_IP not passed to Docker containers

2015-11-01 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14984315#comment-14984315
 ] 

haosdent commented on MESOS-3740:
-

Hi, [~cmaloney] A little thing I could not understand here, in which case, we 
need pass LIBPROCESS_IP to docker containers? Because only 
mesos-docker-executor would communicate with agent and docker container is 
managed by mesos-docker-executor. Seems the docker containers not need know 
LIBPROCESS_IP here. Or you refer to launch a seperate docker container to run 
the executor when we specify docker_mesos_image flag?

> LIBPROCESS_IP not passed to Docker containers
> -
>
> Key: MESOS-3740
> URL: https://issues.apache.org/jira/browse/MESOS-3740
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization, docker
>Affects Versions: 0.25.0
> Environment: Mesos 0.24.1
>Reporter: Cody Maloney
>Assignee: Michael Park
>  Labels: mesosphere
>
> Docker containers aren't currently passed all the same environment variables 
> that Mesos Containerizer tasks are. See: 
> https://github.com/apache/mesos/blob/master/src/slave/containerizer/containerizer.cpp#L254
>  for all the environment variables explicitly set for mesos containers.
> While some of them don't necessarily make sense for docker containers, when 
> the docker has inside of it a libprocess process (A mesos framework 
> scheduler) and is using {{--net=host}} the task needs to have LIBPROCESS_IP 
> set otherwise the same sort of problems that happen because of MESOS-3553 can 
> happen (libprocess will try to guess the machine's IP address with likely bad 
> results in a number of operating environment).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (MESOS-3817) Rename offers to outstanding offers

2015-11-02 Thread haosdent (JIRA)
haosdent created MESOS-3817:
---

 Summary: Rename offers to outstanding offers
 Key: MESOS-3817
 URL: https://issues.apache.org/jira/browse/MESOS-3817
 Project: Mesos
  Issue Type: Bug
  Components: webui
Reporter: haosdent


As discussion in http://search-hadoop.com/m/0Vlr6NFAux1DPmxp , we need rename 
offers to outstanding offers in webui to avoid user confuse.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-2960) Configure DiscoveryInfo and Visibility per port

2015-11-02 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14985342#comment-14985342
 ] 

haosdent commented on MESOS-2960:
-

ping [~adam-mesos][~mcypark] Could you help review the patch?

> Configure DiscoveryInfo and Visibility per port
> ---
>
> Key: MESOS-2960
> URL: https://issues.apache.org/jira/browse/MESOS-2960
> Project: Mesos
>  Issue Type: Improvement
>  Components: general
>Affects Versions: 0.22.1
>Reporter: Frank Scholten
>Assignee: haosdent
>
> For Mesos Elasticsearch I like to use DiscoveryInfo to advertise the client 
> port (9200) with Visibility=EXTERNAL so it can be discovered by load 
> balancers while advertising the transport port (9300) as Visibility=FRAMEWORK 
> because it is used by nodes to talk too each other and should not be load 
> balanced.
> However, I can only set one DiscoveryInfo and one visibility, instead of one 
> per port. I suggest to allow multiple DiscoveryInfo's to be configured with 
> their own visibility.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3822) Cannot specify multiple masters when slave start up

2015-11-02 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14986760#comment-14986760
 ] 

haosdent commented on MESOS-3822:
-

I think this is just because document not sync with code. Need change to 
{noformat}
 hostname or ip to a master e.g.,
--master=localhost:5050


{noformat}

> Cannot specify multiple masters  when slave start up
> 
>
> Key: MESOS-3822
> URL: https://issues.apache.org/jira/browse/MESOS-3822
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.26.0
>Reporter: Guangya Liu
>  Labels: docuentation
> Fix For: 0.26.0
>
>
> This bug is reported from user list, the document should be updated by 
> removing multiple masters when slave start up.
> /usr/sbin/mesos-slave --master=IP1:5050,IP2:5050,IP3:5050 …. --credential …
> Only if IP1 is the leader, the slave can register to master successfully, Or 
> it will register fail.
> Slave log like this:
> Creating new client SASL connection
> Authentication timed out
> Failed to authenticate with master master@172.31.43.77:5050: Authentication 
> discarded
> Authenticating with master master@172.31.43.77:5050
> Using default CRAM-MD5 authenticatee
> Is this a bug?Or it is designed like this.
> BTW: --master:zk://xxx work well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3821) DOCKER_HOST does not work well with --executor_environment_variables

2015-11-02 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14986758#comment-14986758
 ] 

haosdent commented on MESOS-3821:
-

hmm, let me submit a patch to this later.

> DOCKER_HOST does not work well with --executor_environment_variables
> 
>
> Key: MESOS-3821
> URL: https://issues.apache.org/jira/browse/MESOS-3821
> Project: Mesos
>  Issue Type: Bug
>  Components: docker
>Affects Versions: 0.25.0
> Environment: Docker 1.7.1
> Mesos 0.25.0
>Reporter: Lei Xu
>
> Hi guys,
> I found that DOCKER_HOST does not work now if I set 
> bq. --executor_environment_variables={"DOCKER_HOST":"localhost:2377"}
> but the docker executor always append 
> bq. -H unix:///var/run/docker.sock 
> on each command, it will overwrite the DOCKER_HOST in fact.
> I think it is too strict now, and I could not disable it via some command 
> flags.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (MESOS-3821) DOCKER_HOST does not work well with --executor_environment_variables

2015-11-02 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent reassigned MESOS-3821:
---

Assignee: haosdent

> DOCKER_HOST does not work well with --executor_environment_variables
> 
>
> Key: MESOS-3821
> URL: https://issues.apache.org/jira/browse/MESOS-3821
> Project: Mesos
>  Issue Type: Bug
>  Components: docker
>Affects Versions: 0.25.0
> Environment: Docker 1.7.1
> Mesos 0.25.0
>Reporter: Lei Xu
>Assignee: haosdent
>
> Hi guys,
> I found that DOCKER_HOST does not work now if I set 
> bq. --executor_environment_variables={"DOCKER_HOST":"localhost:2377"}
> but the docker executor always append 
> bq. -H unix:///var/run/docker.sock 
> on each command, it will overwrite the DOCKER_HOST in fact.
> I think it is too strict now, and I could not disable it via some command 
> flags.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MESOS-3822) Cannot specify multiple masters when slave start up

2015-11-02 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent updated MESOS-3822:

Labels: docuentation  (was: )

> Cannot specify multiple masters  when slave start up
> 
>
> Key: MESOS-3822
> URL: https://issues.apache.org/jira/browse/MESOS-3822
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.26.0
>Reporter: Guangya Liu
>  Labels: docuentation
> Fix For: 0.26.0
>
>
> This bug is reported from user list, the document should be updated by 
> removing multiple masters when slave start up.
> /usr/sbin/mesos-slave --master=IP1:5050,IP2:5050,IP3:5050 …. --credential …
> Only if IP1 is the leader, the slave can register to master successfully, Or 
> it will register fail.
> Slave log like this:
> Creating new client SASL connection
> Authentication timed out
> Failed to authenticate with master master@172.31.43.77:5050: Authentication 
> discarded
> Authenticating with master master@172.31.43.77:5050
> Using default CRAM-MD5 authenticatee
> Is this a bug?Or it is designed like this.
> BTW: --master:zk://xxx work well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3829) Error on gracefully shutdown task

2015-11-04 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990959#comment-14990959
 ] 

haosdent commented on MESOS-3829:
-

DOCKER_STOP_TIMEOUT is used for 
http://docs.docker.com/engine/reference/commandline/stop/. According docker 
document, you could receive SIGTERM before SIGKILL.

> Error on gracefully shutdown task
> -
>
> Key: MESOS-3829
> URL: https://issues.apache.org/jira/browse/MESOS-3829
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.25.0
> Environment: Marathon: 0.12.0-RC1
> Mesos: 0.25.0
> Docker 1.9.0
>Reporter: Rafael Capucho
>
> Hello,
> I'm suffering from the same error reported here[1]. I have configured my 
> mesos-slave environment as [2] setting DOCKER_STOP_TIMEOUT and 
> EXECUTOR_SHUTDOWN_GRACE_PERIOD.
> When I see the sandbox stdout, I can see in the first line the declaration:
> --stop_timeout="30secs"
> properly configured, but when I click in "Destroy App" in Marathon the stdout 
> keep showing weird things[3] like repeatedly "Killing docker task Shutting 
> down".
> In my code I deal with SIGTERM and it isn't being reached.
> Thank you.
> [1] - 
> https://groups.google.com/forum/?hl=en#!topic/marathon-framework/Oy0dN0Lron0
> [2] - https://paste.ee/r/grRyS
> [3] - https://paste.ee/r/SghOr



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3829) Error on gracefully shutdown task

2015-11-04 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14990960#comment-14990960
 ] 

haosdent commented on MESOS-3829:
-

Maybe because of this https://github.com/docker/docker/pull/3240 ? I think it 
maybe a docker issue.

> Error on gracefully shutdown task
> -
>
> Key: MESOS-3829
> URL: https://issues.apache.org/jira/browse/MESOS-3829
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.25.0
> Environment: Marathon: 0.12.0-RC1
> Mesos: 0.25.0
> Docker 1.9.0
>Reporter: Rafael Capucho
>
> Hello,
> I'm suffering from the same error reported here[1]. I have configured my 
> mesos-slave environment as [2] setting DOCKER_STOP_TIMEOUT and 
> EXECUTOR_SHUTDOWN_GRACE_PERIOD.
> When I see the sandbox stdout, I can see in the first line the declaration:
> --stop_timeout="30secs"
> properly configured, but when I click in "Destroy App" in Marathon the stdout 
> keep showing weird things[3] like repeatedly "Killing docker task Shutting 
> down".
> In my code I deal with SIGTERM and it isn't being reached.
> Thank you.
> [1] - 
> https://groups.google.com/forum/?hl=en#!topic/marathon-framework/Oy0dN0Lron0
> [2] - https://paste.ee/r/grRyS
> [3] - https://paste.ee/r/SghOr



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3829) Error on gracefully shutdown task

2015-11-05 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14991926#comment-14991926
 ] 

haosdent commented on MESOS-3829:
-

Looks correct.

> Error on gracefully shutdown task
> -
>
> Key: MESOS-3829
> URL: https://issues.apache.org/jira/browse/MESOS-3829
> Project: Mesos
>  Issue Type: Bug
>Affects Versions: 0.25.0
> Environment: Marathon: 0.12.0-RC1
> Mesos: 0.25.0
> Docker 1.9.0
>Reporter: Rafael Capucho
>
> Hello,
> I'm suffering from the same error reported here[1]. I have configured my 
> mesos-slave environment as [2] setting DOCKER_STOP_TIMEOUT and 
> EXECUTOR_SHUTDOWN_GRACE_PERIOD.
> When I see the sandbox stdout, I can see in the first line the declaration:
> --stop_timeout="30secs"
> properly configured, but when I click in "Destroy App" in Marathon the stdout 
> keep showing weird things[3] like repeatedly "Killing docker task Shutting 
> down".
> In my code I deal with SIGTERM and it isn't being reached.
> Thank you.
> [1] - 
> https://groups.google.com/forum/?hl=en#!topic/marathon-framework/Oy0dN0Lron0
> [2] - https://paste.ee/r/grRyS
> [3] - https://paste.ee/r/SghOr



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (MESOS-3462) Containerization issues with mesos running on CoreOS

2015-11-07 Thread haosdent (JIRA)

 [ 
https://issues.apache.org/jira/browse/MESOS-3462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

haosdent updated MESOS-3462:

Assignee: (was: haosdent)

> Containerization issues with mesos running on CoreOS
> 
>
> Key: MESOS-3462
> URL: https://issues.apache.org/jira/browse/MESOS-3462
> Project: Mesos
>  Issue Type: Bug
>  Components: containerization
>Affects Versions: 0.24.0
> Environment: CoreOS 801.0.0 64-bit
>Reporter: Francis Chuang
>
> These are the steps to I used to build mesos 0.24.0 on Ubuntu 15.04 64-bit:
> wget http://www.apache.org/dist/mesos/0.24.0/mesos-0.24.0.tar.gz
> wget http://mirror.ventraip.net.au/apache/apr/apr-1.5.2.tar.gz
> wget http://mirror.ventraip.net.au/apache/apr/apr-util-1.5.4.tar.gz
> wget http://mirror.ventraip.net.au/apache/subversion/subversion-1.9.0.tar.gz
> wget http://www.sqlite.org/sqlite-amalgamation-3071501.zip
> wget ftp://ftp.cyrusimap.org/cyrus-sasl/cyrus-sasl-2.1.26.tar.gz
> mkdir /tmp/mesos-build
> cd /tmp/mesos-build
> - Build apr
> tar zxf apr-$APR_VERSION.tar.gz
> cd apr-$APR_VERSION
> ./configure CC=gcc-4.8 --prefix=/tmp/mesos-build/apr
> make
> make install
> cd ..
> - Build apr-util
> tar zxf apr-util-$APR_UTIL_VERSION.tar.gz
> cd apr-util-$APR_UTIL_VERSION
> ./configure CC=gcc-4.8 --prefix=/tmp/mesos-build/apr-util 
> --with-apr=/tmp/mesos-build/apr
> make
> make install
> cd ..
> - Build libsasl2
> tar zxf cyrus-sasl-$SASL_VERSION.tar.gz
> cd cyrus-sasl-$SASL_VERSION
> ./configure CC=gcc-4.8 CPPFLAGS=-I/usr/include/openssl 
> --prefix=/tmp/mesos-build/sasl2 --enable-cram
> make
> make install
> cd ..
> - Build subversion
> tar zxf subversion-$SVN_VERSION.tar.gz
> unzip sqlite-amalgamation-$SQLITE_AMALGATION_VERSION.zip
> mv sqlite-amalgamation-$SQLITE_AMALGATION_VERSION/ 
> subversion-$SVN_VERSION/sqlite-amalgamation/
> cd subversion-$SVN_VERSION
> ./configure CC=gcc-4.8 CXX=g++-4.8 --prefix=/tmp/mesos-build/svn 
> --with-apr=/tmp/mesos-build/apr --with-apr-util=/tmp/mesos-build/apr-util 
> --with-sasl=/tmp/mesos-build/sasl2
> make
> make install
> cd ..
> - Build curl
> tar zxf curl-$CURL_VERSION.tar.gz
> cd curl-$CURL_VERSION
> ./configure CC=gcc-4.8 --prefix=/tmp/mesos-build/curl
> make
> make install
> cd ..
> - Build mesos
> tar zxf mesos-$MESOS_VERSION.tar.gz
> cd mesos-$MESOS_VERSION
> mkdir build
> cd build
> ../configure CC=gcc-4.8 CXX=g++-4.8 
> LD_LIBRARY_PATH=/tmp/mesos-build/sasl2/lib 
> SASL_PATH=/tmp/mesos-build/sasl2/lib/sasl2 --prefix=/tmp/mesos-build/mesos 
> --with-svn=/tmp/mesos-build/svn --with-apr=/tmp/mesos-build/apr 
> --with-sasl=/tmp/mesos-build/sasl2/ --with-curl=/tmp/mesos-build/curl
> make
> make install
> cd ..
> cd ..
> - Copy shared objects into mesos build
> cp apr/lib/libapr-1.so.0.5.2 mesos/lib/libapr-1.so.0
> cp apr-util/lib/libaprutil-1.so.0.5.4 mesos/lib/libaprutil-1.so.0
> cp sasl2/lib/libsasl2.so.3.0.0 mesos/lib/libsasl2.so.3
> cp svn/lib/libsvn_delta-1.so.0.0.0 mesos/lib/libsvn_delta-1.so.0
> cp svn/lib/libsvn_subr-1.so.0.0.0 mesos/lib/libsvn_subr-1.so.0
> I then compress the build into an archive and distributed it onto my CoreOS 
> nodes.
> Once I have the archive extracted on each node, I start the master and slaves:
> /opt/mesos/sbin/mesos-master --zk=zk://192.168.33.10/mesos --quorum=1 
> --hostname=192.168.33.10 --ip=192.168.33.10 
> --webui_dir=/opt/mesos/share/mesos/webui --cluster=mesos
> /opt/mesos/sbin/mesos-slave --hostname=192.168.33.11 --ip=192.168.33.11 
> --master=zk://192.168.33.10/mesos 
> --executor_environment_variables='{"LD_LIBRARY_PATH": "/opt/mesos/lib", 
> "PATH": "/opt/java/bin:/usr/sbin:/usr/bin"}' --containerizers=docker,mesos 
> --executor_registration_timeout=60mins 
> --launcher_dir=/opt/mesos/libexec/mesos/
> In addition, the following environment variables are set:
> LD_LIBRARY_PATH=/opt/mesos/lib/
> JAVA_HOME=/opt/java
> MESOS_NATIVE_JAVA_LIBRARY=/opt/mesos/lib/libmesos.so
> I am finding that when I run meso-hdfs from 
> https://github.com/mesosphere/hdfs, the scheduler starts properly and 
> launches the executors. However, the executors will fail and terminate 
> without writing any error to stderr and stdout.
> I have reproduced the same problem with mesos 0.24, 0.23 and 0.22.1
> If I install mesos onto a Ubuntu machine (tried 14.04 and 15.04 64-bit) using 
> the apt-repositories, this problem does not happen.
> I am not well-versed with mesos internals, but it was pointed out that it's 
> most likely a containerization issue. This issue on github documents the 
> process of trying to get mesos-hdfs to work on my compiled mesos binaries: 
> https://github.com/mesosphere/hdfs/issues/194



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-809) External control of the ip that Mesos components publish to zookeeper

2015-11-06 Thread Dimitri (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14994998#comment-14994998
 ] 

Dimitri commented on MESOS-809:
---

Sorry for interupting. I am trying hard to make mesos bind to a local interface 
instead of the public one. 
I have found this thread where it says that binding with env will do the trick.

LIBPROCESS_ADVERTISE_IP: "10.0.0.3"
CONSUL_IP:   "10.0.0.3"

I have tried both, but none work. 
I am using mesos with PanteraS, It's a bundled docker image for mesos. I have 
tried with the env on the host and in the panteras image, none work. could I 
have some help ?

> External control of the ip that Mesos components publish to zookeeper
> -
>
> Key: MESOS-809
> URL: https://issues.apache.org/jira/browse/MESOS-809
> Project: Mesos
>  Issue Type: Improvement
>  Components: framework, master, slave
>Affects Versions: 0.14.2
>Reporter: Khalid Goudeaux
>Assignee: Anindya Sinha
>Priority: Minor
> Fix For: 0.24.0
>
>
> With tools like Docker making containers more manageable, it's tempting to 
> use containers for all software installation. The CoreOS project is an 
> example of this.
> When an application is run inside a container it sees a different ip/hostname 
> from the host system running the container. That ip is only valid from inside 
> that host, no other machine can see it.
> From inside a container, the Mesos master and slave publish that private ip 
> to zookeeper and as a result they can't find each other if they're on 
> different machines. The --ip option can't help because the public ip isn't 
> available for binding from within a container.
> Essentially, from inside the container, mesos processes don't know the ip 
> they're available at (they may not know the port either).
> It would be nice to bootstrap the processes with the correct ip for them to 
> publish to zookeeper.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-2587) libprocess should allow configuration of ip/port separate from the ones it binds to

2015-11-06 Thread Dimitri (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-2587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14994999#comment-14994999
 ] 

Dimitri commented on MESOS-2587:


Sorry for interupting. I am trying hard to make mesos bind to a local interface 
instead of the public one. 
I have found this thread where it says that binding with env will do the trick.

LIBPROCESS_ADVERTISE_IP: "10.0.0.3"
CONSUL_IP:   "10.0.0.3"

I have tried both, but none work. 
I am using mesos with PanteraS, It's a bundled docker image for mesos. I have 
tried with the env on the host and in the panteras image, none work. could I 
have some help ?

> libprocess should allow configuration of ip/port separate from the ones it 
> binds to
> ---
>
> Key: MESOS-2587
>     URL: https://issues.apache.org/jira/browse/MESOS-2587
> Project: Mesos
>  Issue Type: Bug
>  Components: libprocess
>Reporter: Cosmin Lehene
>
> Currently libprocess will advertise {{LIBPROCESS_IP}}{{LIBPROCESS_PORT}}, but 
> if a framework runs in a container without an an interface that has a 
> publicly accessible IP (e.g. a container in bridge mode) it will advertise an 
> IP that will not be reachable by master.
> With this, we could advertise the external IP (reachable from master) of the 
> bridge from within a container. 
> This should allow frameworks running in containers to work in the safer 
> bridged mode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3815) docker executor not works when SSL enable

2015-11-07 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14995123#comment-14995123
 ] 

haosdent commented on MESOS-3815:
-

Use exactly environment variables. https://reviews.apache.org/r/40046/diff/1

> docker executor not works when SSL enable
> -
>
> Key: MESOS-3815
> URL: https://issues.apache.org/jira/browse/MESOS-3815
> Project: Mesos
>  Issue Type: Bug
>Reporter: haosdent
>Assignee: haosdent
>
> Because docker executor not pass SSL related environment variables, 
> mesos-docker-executor could not works normal when SSL enable. More details 
> could found in http://search-hadoop.com/m/0Vlr6DsslDSvVs72



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3787) As a developer, I'd like to be able to expand environment variables through the Docker executor.

2015-11-03 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14988693#comment-14988693
 ] 

haosdent commented on MESOS-3787:
-

Because have two process?

> As a developer, I'd like to be able to expand environment variables through 
> the Docker executor.
> 
>
> Key: MESOS-3787
> URL: https://issues.apache.org/jira/browse/MESOS-3787
> Project: Mesos
>  Issue Type: Wish
>Reporter: John Garcia
>  Labels: mesosphere
> Attachments: mesos.patch, test-example.json
>
>
> We'd like to have expanded variables usable in [the json files used to create 
> a Marathon app, hence] the Task's CommandInfo, so that the executor is able 
> to detect the correct values at runtime.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3787) As a developer, I'd like to be able to expand environment variables through the Docker executor.

2015-11-03 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14988726#comment-14988726
 ] 

haosdent commented on MESOS-3787:
-

I notice the second item in you post about "ps" have already show the variables.

> As a developer, I'd like to be able to expand environment variables through 
> the Docker executor.
> 
>
> Key: MESOS-3787
> URL: https://issues.apache.org/jira/browse/MESOS-3787
> Project: Mesos
>  Issue Type: Wish
>Reporter: John Garcia
>  Labels: mesosphere
> Attachments: mesos.patch, test-example.json
>
>
> We'd like to have expanded variables usable in [the json files used to create 
> a Marathon app, hence] the Task's CommandInfo, so that the executor is able 
> to detect the correct values at runtime.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


<    5   6   7   8   9   10   11   12   13   14   >