Spark UI port

2017-09-10 Thread Sunil Kalyanpur
Hello all,

I am running PySpark Job (v2.0.2) with checkpoint enabled in Mesos cluster
and am using Marathon for orchestration.

When the job is restarted using Marathon, Spark UI is not getting started
at the port specified by Marathon. Instead, it is picking port from the
checkpoint.

Is there a way we can make spark job to use the port assigned by Marathon
instead of Spark job picking the configuration from the Checkpoint?

-- 
Thanks,
Sunil


Re: Spark UI port issue when deploying Spark driver on YARN in yarn-cluster mode on EMR

2014-12-23 Thread Tomer Benyamini
On YARN, spark does not manage the cluster, but YARN does. Usually the
cluster manager UI is under http://:9026/cluster. I believe
that it chooses the port for the spark driver UI randomly, but an easy way
of accessing it is by clicking on the "Application Master" link under the
"Tracking UI" column in the cluster manager UI. Note that by default on
EMR, this link will use the amazon internal ip, so you need to set up a vpn
to view these kind of links from your browser.

Tomer

On Tue, Dec 23, 2014 at 6:04 PM, Roberto Coluccio <
roberto.coluc...@gmail.com> wrote:

> Hello folks,
>
> I'm trying to deploy a Spark driver on Amazon EMR in yarn-cluster mode
> expecting to be able to access the Spark UI from the :4040
> address (default port). The problem here is that the Spark UI port is
> always defined randomly at runtime, although I also tried to specify it in
> the spark-defaults.conf file: in order to do so, I used this:
> https://github.com/awslabs/emr-bootstrap-actions/tree/master/spark#3-utilize-an-emr-step-to-configure-the-spark-default-configuration-optional
> , setting the spark.ui.port to a static known value. No luck, every time I
> launch (using the spark-submit script from the yarn-master node) a Spark
> driver, the UI port is chose randomly.
>
> Is there any configurations I'm missing out here?
>
> Thank you very much.
>
> Roberto
>


Spark UI port issue when deploying Spark driver on YARN in yarn-cluster mode on EMR

2014-12-23 Thread Roberto Coluccio
Hello folks,

I'm trying to deploy a Spark driver on Amazon EMR in yarn-cluster mode
expecting to be able to access the Spark UI from the :4040
address (default port). The problem here is that the Spark UI port is
always defined randomly at runtime, although I also tried to specify it in
the spark-defaults.conf file: in order to do so, I used this:
https://github.com/awslabs/emr-bootstrap-actions/tree/master/spark#3-utilize-an-emr-step-to-configure-the-spark-default-configuration-optional
, setting the spark.ui.port to a static known value. No luck, every time I
launch (using the spark-submit script from the yarn-master node) a Spark
driver, the UI port is chose randomly.

Is there any configurations I'm missing out here?

Thank you very much.

Roberto