Patrick Liu created SPARK-3875: ---------------------------------- Summary: Add TEMP DIRECTORY configuration Key: SPARK-3875 URL: https://issues.apache.org/jira/browse/SPARK-3875 Project: Spark Issue Type: Improvement Components: Spark Core Affects Versions: 1.1.0 Reporter: Patrick Liu
Currently, the Spark uses "java.io.tmpdir" to find the /tmp/ directory. Then, the /tmp/ directory is used to 1. Setup the HTTP File Server 2. Broadcast directory 3. Fetch Dependency files or jars by Executors The size of the /tmp/ directory will keep growing. The free space of the system disk will be less. I think we could add a configuration "spark.tmp.dir" in conf/spark-env.sh or conf/spark-defaults.conf to set this particular directory. Let's say, set the directory to a data disk. If "spark.tmp.dir" is not set, use the default "java.io.tmpdir" -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org