[GitHub] spark pull request: [SPARK-3875] Add TEMP DIRECTORY configuration

2015-02-17 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/spark/pull/2729


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3875] Add TEMP DIRECTORY configuration

2015-02-06 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/2729#issuecomment-73221480
  
It sounds like this is a wont-fix, given the discussion. Do you mind 
closing this PR?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3875] Add TEMP DIRECTORY configuration

2014-10-20 Thread tgravescs
Github user tgravescs commented on the pull request:

https://github.com/apache/spark/pull/2729#issuecomment-59756273
  
Yes, as @mridulm pointed out. This should not be settable by the users on 
yarn.  It should automatically use the yarn approved directories. We have logic 
in there for setting the java.io.tmpdir in ClientBase.  If this is added we 
would need to do something similar and not let the user override it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3875] Add TEMP DIRECTORY configuration

2014-10-09 Thread srowen
Github user srowen commented on the pull request:

https://github.com/apache/spark/pull/2729#issuecomment-58473598
  
`Utils.getLocalDir` already does basically this. I agree that files should 
not have to go to `/tmp` since this is rarely a good place for lots of stuff on 
servers. But I do wonder whether this local dir is in fact the standard and 
right place for all of these things rather than another temp dir setting. 
Distributions already correctly configure where `Utils.getLocalDir` looks so 
this is going to be much better if possible.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3875] Add TEMP DIRECTORY configuration

2014-10-09 Thread srowen
Github user srowen commented on a diff in the pull request:

https://github.com/apache/spark/pull/2729#discussion_r18630676
  
--- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala ---
@@ -233,7 +233,11 @@ private[spark] object Utils extends Logging {
   }
 
   /** Create a temporary directory inside the given parent directory */
-  def createTempDir(root: String = System.getProperty(java.io.tmpdir)): 
File = {
+  def createTempDir(): File = {
+val root: String = SparkEnv.get.conf.contains(spark.tmp.dir) match {
--- End diff --

PS this can all be one line if you set the default value to `... 
.get(spark.tmp.dir, System.getProperty(java.io.tmpdir))`


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3875] Add TEMP DIRECTORY configuration

2014-10-09 Thread mridulm
Github user mridulm commented on the pull request:

https://github.com/apache/spark/pull/2729#issuecomment-58473957
  
At least for yarn, this will create issues if overridden from default.
Not sure about mesos.

Why not use std java property and define it for local and standalone mode 
where relevant.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3875] Add TEMP DIRECTORY configuration

2014-10-09 Thread kelepi
Github user kelepi commented on a diff in the pull request:

https://github.com/apache/spark/pull/2729#discussion_r18630974
  
--- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala ---
@@ -233,7 +233,11 @@ private[spark] object Utils extends Logging {
   }
 
   /** Create a temporary directory inside the given parent directory */
-  def createTempDir(root: String = System.getProperty(java.io.tmpdir)): 
File = {
+  def createTempDir(): File = {
+val root: String = SparkEnv.get.conf.contains(spark.tmp.dir) match {
--- End diff --

Thanks @srowen for the comment. That's more clear!
Currently I am running Spark 1.1 in Standalone mode. And I have set the 
'SPARK_LOCAL_DIRS' to data disk,  which will store the broadcast Files in the 
target directory. However, the dependencies for executors will be fetched and 
stored in the /tmp/ directory.
And the dependencies will not be removed.
Like the snappy dependency, for example:
snappy-1.0.5.3-f4880c9f-95d9-4ab6-b1c8-8686d0b88f42-libsnappyjava.so
So the /tmp/ directory will grow large.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3875] Add TEMP DIRECTORY configuration

2014-10-09 Thread kelepi
Github user kelepi commented on the pull request:

https://github.com/apache/spark/pull/2729#issuecomment-58474600
  
@mridulm Using std java property is fine. 
Just add a more specific configuration argument.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: [SPARK-3875] Add TEMP DIRECTORY configuration

2014-10-09 Thread mridulm
Github user mridulm commented on the pull request:

https://github.com/apache/spark/pull/2729#issuecomment-58479810
  
There is a java property which controls this ... java.io.tmpdir
On 09-Oct-2014 1:22 pm, 刘钰帆 notificati...@github.com wrote:

 @mridulm https://github.com/mridulm Using std java property is fine.
 Just add a more specific configuration argument.

 —
 Reply to this email directly or view it on GitHub
 https://github.com/apache/spark/pull/2729#issuecomment-58474600.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org