KaiXu created SPARK-27289:
-----------------------------

             Summary: spark-submit explicit configuration does not take effect 
but Spark UI shows it's effective
                 Key: SPARK-27289
                 URL: https://issues.apache.org/jira/browse/SPARK-27289
             Project: Spark
          Issue Type: Bug
          Components: Deploy, Documentation, Spark Submit, Web UI
    Affects Versions: 2.3.3
            Reporter: KaiXu


The [doc 
|https://spark.apache.org/docs/latest/submitting-applications.html]says that  
"In general, configuration values explicitly set on a {{SparkConf}} take the 
highest precedence, then flags passed to {{spark-submit}}, then values in the 
defaults file", but when setting spark.local.dir through --conf with 
spark-submit, it still uses the values from 
${SPARK_HOME}/conf/spark-defaults.conf, what's more, the Spark runtime UI 
environment variables shows the value from --conf, which is really misleading.

e.g.

I set submit my application through the command:

/opt/spark233/bin/spark-submit --properties-file /opt/spark.conf --conf 
spark.local.dir=/tmp/spark_local -v --class 
org.apache.spark.examples.mllib.SparseNaiveBayes --master 
spark://bdw-slave20:7077 
/opt/sparkbench/assembly/target/sparkbench-assembly-7.1-SNAPSHOT-dist.jar 
hdfs://bdw-slave20:8020/Bayes/Input

 

the spark.local.dir in ${SPARK_HOME}/conf/spark-defaults.conf is:

spark.local.dir=/mnt/nvme1/spark_local

when the application is running, I found the intermediate shuffle data was 
wrote to /mnt/nvme1/spark_local, which is set through 
${SPARK_HOME}/conf/spark-defaults.conf, but the Web UI shows that the 
environment value spark.local.dir=/tmp/spark_local.

The spark-submit verbose also shows spark.local.dir=/tmp/spark_local, it's 
misleading. 

 

!image-2019-03-27-10-59-38-377.png!

spark-submit verbose:

XXXX

Spark properties used, including those specified through
 --conf and those from the properties file /opt/spark.conf:
 (spark.local.dir,/tmp/spark_local)
 (spark.default.parallelism,132)
 (spark.driver.memory,10g)
 (spark.executor.memory,352g)

XXXXX



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to