Ahem. Yes. I had meant 300,000 ;) Cheers,
Jim On 28 October 2015 at 12:10, Rad Gruchalski <[email protected]> wrote: > However, it should be 300000, not 30000. It’s milliseconds - 5 mins in > milliseconds is 300000. > > Optional. Default: 300000 (5 minutes) > > Kind regards, > Radek Gruchalski > [email protected] <[email protected]> > de.linkedin.com/in/radgruchalski/ > > > *Confidentiality:*This communication is intended for the above-named > person and may be confidential and/or legally privileged. > If it has come to you in error you must take no action based on it, nor > must you copy or show it to anyone; please delete/destroy and inform the > sender immediately. > > On Wednesday, 28 October 2015 at 13:08, Rad Gruchalski wrote: > > Nobody getting those today ;) Good catch. Worth keeping in mind! > > Kind regards, > Radek Gruchalski > [email protected] <[email protected]> > de.linkedin.com/in/radgruchalski/ > > > *Confidentiality:*This communication is intended for the above-named > person and may be confidential and/or legally privileged. > If it has come to you in error you must take no action based on it, nor > must you copy or show it to anyone; please delete/destroy and inform the > sender immediately. > > On Wednesday, 28 October 2015 at 13:06, James Vanns wrote: > > I shall fix my own problem.... it's embarrassing. Top marks to those of > you that notice I supplied 3000 instead of 30000 (which I understand is > actually the default anyway) to task_launch_timeout! > > Jim > > > On 28 October 2015 at 10:21, James Vanns <[email protected]> wrote: > > Hi all. > > Mesos version = 0.23.0-1.0.ubuntu1404 (mesosphere APT repo) > Marathon version = 0.10.1 (mesosphere APT repo) > > Hopefully this is a simple one for someone to answer, though I couldn't > find anything immediately > obvious in the documentation. We're trialling Mesos in a cloud (EC2/GCE) > environment and the one > thing that continues to bite us in the ass is this; continued task > failures until the docker image is > fully downloaded! Why is this!? Some of our images a small (say 200MB), > some much larger (2GB) > due to the nature of the software packages we're containerising. > Regardless of this size, they fail the > first dozen (or more) times until one of the slaves has pulled the image. > Why is there an apparent > hard time-out and how can I avoid it? I don't want the task to register as > a fail - it hasn't even had a > chance to run yet! Up until now we've just been tolerating the bouncing > around of these tasks but it's > now reached a point where it's darn annoying ;) > > I've tried setting executor_registration_timeout to '5mins' but this made > no apparent difference (every > minute the task is killed still). I should note that these tasks are > launched using the Marathon > framework and I've tried setting 'task_launch_timeout' to '3000' and > again, it makes no difference. > > Based on a brief glance of a mesos slave log file it seems the master > instructs the slave to kill the task off after 1 minute. > > Please advise. > > Cheers, > > Jim > > -- > Senior Code Pig > Industrial Light & Magic > > > > > -- > -- > Senior Code Pig > Industrial Light & Magic > > > > -- -- Senior Code Pig Industrial Light & Magic

