inline - 

----- Original Message -----
> From: "CCAAT" <cc...@tampabay.rr.com>
> To: user@mesos.apache.org
> Cc: cc...@tampabay.rr.com
> Sent: Monday, September 15, 2014 5:33:08 PM
> Subject: Re: spark and mesos issue
> 
> Hello Brenden/Vinod,
> 
> Is your installation using "systemd" ?
> 
> Has anyone documented systemd configurations/issues for the various
> linux distro running mesos/spark?
> 
> What if a cluster is running on a mixture of systems that use/do_not_use
> systemd; are there any issues, related to systemd and mesos/spark?

Yes, and I'll have patches posted today, I'm still debugging.  Basically you 
need the init kickers + re-parent the cgroup code. 

> 
> Has anyone tried to use Ftrace/trace-cmd/kernelshark in tracing down
> or optimizations of the linux kernel for machines dedicated to
> mesos/spark?
> 
> Are there  (kernel) .config files published for key kernel resources
> dedicated to the optimization of mesos/spark anywhere ?
> 
> 
> curiously,
> James
> 
> 
> 
> 
> On 09/15/14 16:13, Brenden Matthews wrote:
> > I started hitting a similar problem, and it seems to be related to
> > memory overhead and tasks getting OOM killed.  I filed a ticket here:
> >
> > https://issues.apache.org/jira/browse/SPARK-3535
> >
> > On Wed, Jul 16, 2014 at 5:27 AM, Ray Rodriguez <rayrod2...@gmail.com
> > <mailto:rayrod2...@gmail.com>> wrote:
> >
> >     I'll set some time aside today to gather and post some logs and
> >     details about this issue from our end.
> >
> >
> >     On Wed, Jul 16, 2014 at 2:05 AM, Vinod Kone <vinodk...@gmail.com
> >     <mailto:vinodk...@gmail.com>> wrote:
> >
> >
> >
> >
> >         On Tue, Jul 15, 2014 at 11:02 PM, Vinod Kone <vi...@twitter.com
> >         <mailto:vi...@twitter.com>> wrote:
> >
> >
> >             On Fri, Jul 4, 2014 at 2:05 AM, Gurvinder Singh
> >             <gurvinder.si...@uninett.no
> >             <mailto:gurvinder.si...@uninett.no>> wrote:
> >
> >                 ERROR storage.BlockManagerMasterActor: Got two different
> >                 block manager
> >                 registrations on 201407031041-1227224054-5050-24004-0
> >
> >                 Googling about it seems that mesos is starting slaves at
> >                 the same time
> >                 and giving them the same id. So may bug in mesos ?
> >
> >
> >             Has this issue been resolved? We need more information to
> >             triage this. Maybe some logs that show the lifecycle of the
> >             duplicate instances?
> >
> >
> >             @vinodkone
> >
> >
> >
> >
> 
> 

-- 
Cheers,
Timothy St. Clair
Red Hat Inc.

Reply via email to