> On Oct. 24, 2014, 6:08 p.m., Vinod Kone wrote:
> > src/master/master.cpp, line 2301
> > <https://reviews.apache.org/r/26382/diff/7/?file=727958#file727958line2301>
> >
> >     why is this TASK_ERROR instead of TASK_LOST? A relaunch would likely 
> > succeed here right?
> 
> Dominic Hamon wrote:
>     I thought if the offer id is invalid it likely would be again next time. 
> Is that not the case?

no. they cannot reuse an offer in subsequent calls to launchTasks(). so the 
only possible way to relaunch the task would be to use a new offer.


> On Oct. 24, 2014, 6:08 p.m., Vinod Kone wrote:
> > src/common/http.cpp, line 103
> > <https://reviews.apache.org/r/26382/diff/7/?file=727941#file727941line103>
> >
> >     what happens if this is not set?
> 
> Dominic Hamon wrote:
>     nothing - i thought it was a useful thing to export in the JSON model 
> though.

nothing, as in empty? or does it return a default source (enum 0)? if it's the 
latter it's a problem. you should guard this statement with status.has_source().


> On Oct. 24, 2014, 6:08 p.m., Vinod Kone wrote:
> > include/mesos/mesos.proto, line 636
> > <https://reviews.apache.org/r/26382/diff/7/?file=727940#file727940line636>
> >
> >     I think this will become a crutch. We should know the reasons for why a 
> > task is lost. i.e., kill this.
> 
> Dominic Hamon wrote:
>     it still might be 'unknown' in cases where the caller doesn't set the 
> reason. is that acceptable?

the caller is mesos code (master, slave) which should always know the reason. 
do you have an example?


> On Oct. 24, 2014, 6:08 p.m., Vinod Kone wrote:
> > include/mesos/mesos.proto, lines 624-629
> > <https://reviews.apache.org/r/26382/diff/7/?file=727940#file727940line624>
> >
> >     Pull this one and Reason inside TaskStatus.
> >     
> >     Also kill the SOURCE_ and REASON_ prefixes.
> 
> Dominic Hamon wrote:
>     and not enum TaskState? I was following that model.
>     
>     i'd like to keep the prefixes as it makes it easy to be sure in the 
> callsite that you have the right enum. Autocomplete then works nicely too.

The name "TaskState" is fully qualified and it's easy to understand what it 
represents, whereas a top level "Reason" and "Source" doesn't tell us what it 
is referring to.

Having SOURCE_ prefix inside a Source namespace (enum) seems a bit redundant. 
Callers already do set_*source(), so I don't think it is that ambigous. But 
it's upto you.


- Vinod


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/26382/#review58174
-----------------------------------------------------------


On Oct. 27, 2014, 5:18 p.m., Dominic Hamon wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/26382/
> -----------------------------------------------------------
> 
> (Updated Oct. 27, 2014, 5:18 p.m.)
> 
> 
> Review request for mesos, Vinod Kone and Bill Farner.
> 
> 
> Bugs: MESOS-1830 and MESOS-343
>     https://issues.apache.org/jira/browse/MESOS-1830
>     https://issues.apache.org/jira/browse/MESOS-343
> 
> 
> Repository: mesos-git
> 
> 
> Description
> -------
> 
> Added source and reason, enabled TASK_ERROR, and made the changes necessary 
> throughout the codebase.
> 
> 
> Diffs
> -----
> 
>   include/mesos/mesos.proto 6b93e9000761857c4f335f2a8c8088e155078f54 
>   src/common/protobuf_utils.hpp 212d5124b9a4cc58e61719fa7f07a61cd166e834 
>   src/common/protobuf_utils.cpp a9b65e328c4c62bff7fbf5633dda25d742d79019 
>   src/examples/balloon_framework.cpp b05d5679fe2915142907af0b2dc00c6cd76eb9c1 
>   src/examples/java/TestFramework.java 
> bc593d0abfacb00690b1492b2b82c970f4e4de6d 
>   src/examples/low_level_scheduler_libprocess.cpp 
> 7ef5ea78ade4ed856b97009fdfe31281f0a55c17 
>   src/examples/low_level_scheduler_pthread.cpp 
> 6e233a10117a1c7aa669806b5b430e746e227ee5 
>   src/examples/no_executor_framework.cpp 
> f98a0735b9f287e7f1bf98af6c2e9a47ca6a77b2 
>   src/examples/test_framework.cpp 187a611ebfe35cb13ee48aa5eca934cf55f34dea 
>   src/master/master.cpp 9ebdc357fe679ed5bd54e0f4b9091ab9bcc2b00f 
>   src/scaling/nested_exec.py 17e61706c75e6e849b0c40ae5232d8dc60804c55 
>   src/sched/sched.cpp 0fb8c7bda75545389f8024489b3c76ae115111f4 
>   src/slave/slave.cpp 5e7c107aa8f36aff954fa6a46c431a20c313ebc2 
>   src/tests/common/http_tests.cpp 912653b26615c86cc0204d80f58e6046c4b91a98 
>   src/tests/fault_tolerance_tests.cpp 
> a18a41a3e34ff112e04e693447d757403e5013bd 
>   src/tests/master_authorization_tests.cpp 
> 652e80d0d4567b225c6ffb326725ddfde06f7fd3 
>   src/tests/master_tests.cpp f60e3761e5a558d379c56513937d2f59e898a490 
>   src/tests/reconciliation_tests.cpp 4ba53940951584d9baa2c7aa4e124814471206bc 
>   src/tests/resource_offers_tests.cpp 
> fe66432b1bf75ee25feb73c4bb353e4d4e5b503f 
> 
> Diff: https://reviews.apache.org/r/26382/diff/
> 
> 
> Testing
> -------
> 
> make check
> 
> 
> Thanks,
> 
> Dominic Hamon
> 
>

Reply via email to