Spark UI port
Hello all, I am running PySpark Job (v2.0.2) with checkpoint enabled in Mesos cluster and am using Marathon for orchestration. When the job is restarted using Marathon, Spark UI is not getting started at the port specified by Marathon. Instead, it is picking port from the checkpoint. Is there a way we can make spark job to use the port assigned by Marathon instead of Spark job picking the configuration from the Checkpoint? -- Thanks, Sunil
Re: Spark UI port issue when deploying Spark driver on YARN in yarn-cluster mode on EMR
On YARN, spark does not manage the cluster, but YARN does. Usually the cluster manager UI is under http://:9026/cluster. I believe that it chooses the port for the spark driver UI randomly, but an easy way of accessing it is by clicking on the "Application Master" link under the "Tracking UI" column in the cluster manager UI. Note that by default on EMR, this link will use the amazon internal ip, so you need to set up a vpn to view these kind of links from your browser. Tomer On Tue, Dec 23, 2014 at 6:04 PM, Roberto Coluccio < roberto.coluc...@gmail.com> wrote: > Hello folks, > > I'm trying to deploy a Spark driver on Amazon EMR in yarn-cluster mode > expecting to be able to access the Spark UI from the :4040 > address (default port). The problem here is that the Spark UI port is > always defined randomly at runtime, although I also tried to specify it in > the spark-defaults.conf file: in order to do so, I used this: > https://github.com/awslabs/emr-bootstrap-actions/tree/master/spark#3-utilize-an-emr-step-to-configure-the-spark-default-configuration-optional > , setting the spark.ui.port to a static known value. No luck, every time I > launch (using the spark-submit script from the yarn-master node) a Spark > driver, the UI port is chose randomly. > > Is there any configurations I'm missing out here? > > Thank you very much. > > Roberto >
Spark UI port issue when deploying Spark driver on YARN in yarn-cluster mode on EMR
Hello folks, I'm trying to deploy a Spark driver on Amazon EMR in yarn-cluster mode expecting to be able to access the Spark UI from the :4040 address (default port). The problem here is that the Spark UI port is always defined randomly at runtime, although I also tried to specify it in the spark-defaults.conf file: in order to do so, I used this: https://github.com/awslabs/emr-bootstrap-actions/tree/master/spark#3-utilize-an-emr-step-to-configure-the-spark-default-configuration-optional , setting the spark.ui.port to a static known value. No luck, every time I launch (using the spark-submit script from the yarn-master node) a Spark driver, the UI port is chose randomly. Is there any configurations I'm missing out here? Thank you very much. Roberto