Yes you can create an issue, or actually contribute a patch to update it :)
Sorry the docs is a bit light, I'm going to make it more complete along the way. Tim On Fri, Sep 11, 2015 at 11:11 AM, Tom Waterhouse (tomwater) < tomwa...@cisco.com> wrote: > Tim, > > Thank you for the explanation. You are correct, my Mesos experience is > very light, and I haven’t deployed anything via Marathon yet. What you > have stated here makes sense, I will look into doing this. > > Adding this info to the docs would be great. Is the appropriate action to > create an issue regarding improvement of the docs? For those of us who are > gaining the experience having such a pointer is very helpful. > > Tom > > From: Tim Chen <t...@mesosphere.io> > Date: Thursday, September 10, 2015 at 10:25 AM > To: Tom Waterhouse <tomwa...@cisco.com> > Cc: "user@spark.apache.org" <user@spark.apache.org> > Subject: Re: Spark on Mesos with Jobs in Cluster Mode Documentation > > Hi Tom, > > Sorry the documentation isn't really rich, since it's probably assuming > users understands how Mesos and framework works. > > First I need explain the rationale of why create the dispatcher. If you're > not familiar with Mesos yet, each node in your datacenter is installed a > Mesos slave where it's responsible for publishing resources and > running/watching tasks, and Mesos master is responsible for taking the > aggregated resources and scheduling them among frameworks. > > Frameworks are not managed by Mesos, as Mesos master/slave doesn't launch > and maintain framework but assume they're launched and kept running on its > own. All the existing frameworks in the ecosystem therefore all have their > own ways to deploy, HA and persist state (e.g: Aurora, Marathon, etc). > > Therefore, to introduce cluster mode with Mesos, we must create a > framework that is long running that can be running in your datacenter, and > can handle launching spark drivers on demand and handle HA, etc. This is > what the dispatcher is all about. > > So the idea is that you should launch the dispatcher not on the client, > but on a machine in your datacenter. In Mesosphere's DCOS we launch all > frameworks and long running services with Marathon, and you can use > Marathon to launch the Spark dispatcher. > > Then all clients instead of specifying the Mesos master URL (e.g: > mesos://mesos.master:2181), then just talks to the dispatcher only > (mesos://spark-dispatcher.mesos:7077), and the dispatcher will then start > and watch the driver for you. > > Tim > > > > On Thu, Sep 10, 2015 at 10:13 AM, Tom Waterhouse (tomwater) < > tomwa...@cisco.com> wrote: > >> After spending most of yesterday scouring the Internet for sources of >> documentation for submitting Spark jobs in cluster mode to a Spark cluster >> managed by Mesos I was able to do just that, but I am not convinced that >> how I have things setup is correct. >> >> I used the Mesos published >> <https://open.mesosphere.com/getting-started/datacenter/install/> >> instructions for setting up my Mesos cluster. I have three Zookeeper >> instances, three Mesos master instances, and three Mesos slave instances. >> This is all running in Openstack. >> >> The documentation on the Spark documentation site states that “To use >> cluster mode, you must start the MesosClusterDispatcher in your cluster via >> the sbin/start-mesos-dispatcher.sh script, passing in the Mesos master >> url (e.g: mesos://host:5050).” That is it, no more information than >> that. So that is what I did: I have one machine that I use as the Spark >> client for submitting jobs. I started the Mesos dispatcher with script as >> described, and using the client machine’s IP address and port as the target >> for the job submitted the job. >> >> The job is currently running in Mesos as expected. This is not however >> how I would have expected to configure the system. As running there is one >> instance of the Spark Mesos dispatcher running outside of Mesos, so not a >> part of the sphere of Mesos resource management. >> >> I used the following Stack Overflow posts as guidelines: >> http://stackoverflow.com/questions/31164725/spark-mesos-dispatcher >> http://stackoverflow.com/questions/31294515/start-spark-via-mesos >> >> There must be better documentation on how to deploy Spark in Mesos with >> jobs able to be deployed in cluster mode. >> >> I can follow up with more specific information regarding my deployment >> if necessary. >> >> Tom >> > >