[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH

2015-05-08 Thread Thomas Graves (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Graves updated SPARK-6869:
-
Priority: Blocker  (was: Minor)

 Add pyspark archives path to PYTHONPATH
 ---

 Key: SPARK-6869
 URL: https://issues.apache.org/jira/browse/SPARK-6869
 Project: Spark
  Issue Type: Bug
  Components: PySpark
Affects Versions: 1.0.0
Reporter: Weizhong
Priority: Blocker

 From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the 
 assembly jar are package by JDK 1.7+, so ship pyspark archives to executors 
 by Yarn with --py-files. The pyspark archives name must contains 
 spark-pyspark.
 1st: zip pyspark to spark-pyspark_2.10.zip
 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files 
 spark-pyspark_2.10.zip app.py args



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH

2015-05-08 Thread Thomas Graves (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Graves updated SPARK-6869:
-
Assignee: Lianhui Wang

 Add pyspark archives path to PYTHONPATH
 ---

 Key: SPARK-6869
 URL: https://issues.apache.org/jira/browse/SPARK-6869
 Project: Spark
  Issue Type: Bug
  Components: PySpark
Affects Versions: 1.0.0
Reporter: Weizhong
Assignee: Lianhui Wang
Priority: Blocker
 Fix For: 1.4.0


 From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the 
 assembly jar are package by JDK 1.7+, so ship pyspark archives to executors 
 by Yarn with --py-files. The pyspark archives name must contains 
 spark-pyspark.
 1st: zip pyspark to spark-pyspark_2.10.zip
 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files 
 spark-pyspark_2.10.zip app.py args



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH

2015-05-08 Thread Thomas Graves (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Graves updated SPARK-6869:
-
Target Version/s: 1.4.0

 Add pyspark archives path to PYTHONPATH
 ---

 Key: SPARK-6869
 URL: https://issues.apache.org/jira/browse/SPARK-6869
 Project: Spark
  Issue Type: Bug
  Components: PySpark
Affects Versions: 1.0.0
Reporter: Weizhong
Priority: Blocker
 Fix For: 1.4.0


 From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the 
 assembly jar are package by JDK 1.7+, so ship pyspark archives to executors 
 by Yarn with --py-files. The pyspark archives name must contains 
 spark-pyspark.
 1st: zip pyspark to spark-pyspark_2.10.zip
 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files 
 spark-pyspark_2.10.zip app.py args



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH

2015-05-08 Thread Thomas Graves (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Graves updated SPARK-6869:
-
Issue Type: Bug  (was: Improvement)

 Add pyspark archives path to PYTHONPATH
 ---

 Key: SPARK-6869
 URL: https://issues.apache.org/jira/browse/SPARK-6869
 Project: Spark
  Issue Type: Bug
  Components: PySpark
Affects Versions: 1.0.0
Reporter: Weizhong
Priority: Minor

 From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the 
 assembly jar are package by JDK 1.7+, so ship pyspark archives to executors 
 by Yarn with --py-files. The pyspark archives name must contains 
 spark-pyspark.
 1st: zip pyspark to spark-pyspark_2.10.zip
 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files 
 spark-pyspark_2.10.zip app.py args



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH

2015-04-16 Thread Weizhong (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Weizhong updated SPARK-6869:

Description: 
From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the 
assembly jar are package by JDK 1.7+, so ship pyspark archives to executors by 
Yarn with --py-files. The pyspark archives name must contains spark-pyspark.

1st: zip pyspark to spark-pyspark_2.10.zip
2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files 
spark-pyspark_2.10.zip app.py args

  was:From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when 
the assembly jar are package by JDK 1.7+, so pass the PYTHONPATH(set in 
spark-env.sh) to executor so that executor python process can read pyspark file 
from local file system rather than from assembly jar.

Summary: Add pyspark archives path to PYTHONPATH  (was: Pass PYTHONPATH 
to executor, so that executor can read pyspark file from local file system on 
executor node)

 Add pyspark archives path to PYTHONPATH
 ---

 Key: SPARK-6869
 URL: https://issues.apache.org/jira/browse/SPARK-6869
 Project: Spark
  Issue Type: Improvement
  Components: PySpark
Affects Versions: 1.0.0
Reporter: Weizhong
Priority: Minor

 From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the 
 assembly jar are package by JDK 1.7+, so ship pyspark archives to executors 
 by Yarn with --py-files. The pyspark archives name must contains 
 spark-pyspark.
 1st: zip pyspark to spark-pyspark_2.10.zip
 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files 
 spark-pyspark_2.10.zip app.py args



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org