Thomas Graves created SPARK-15405:
-------------------------------------

             Summary: YARN uploading the same __spark_conf__.zip twice
                 Key: SPARK-15405
                 URL: https://issues.apache.org/jira/browse/SPARK-15405
             Project: Spark
          Issue Type: Bug
          Components: YARN
    Affects Versions: 2.0.0
            Reporter: Thomas Graves


I was running 2.0 and noticed we are now uploading what appears to be the same 
__spark_conf__.zip file twice.

This was introduced when we changed how the cache files are handled:
https://github.com/apache/spark/commit/f47dbf27fa034629fab12d0f3c89ab75edb03f86

If they are truly the same we should be able to just use the same zip file:

16/05/19 14:31:22 INFO Client: Uploading resource 
file:/tmp/spark-ad014dac-9682-4d83-af7a-53b16e5d6423/__spark_conf__717768860288979034.zip
 -> 
hdfs://axonitered-nn1.red.ygrid.yahoo.com:8020/user/tgraves/.sparkStaging/application_1463551738094_11599/__spark_conf__717768860288979034.zip




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to