[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-6869: - Priority: Blocker (was: Minor) Add pyspark archives path to PYTHONPATH --- Key: SPARK-6869 URL: https://issues.apache.org/jira/browse/SPARK-6869 Project: Spark Issue Type: Bug Components: PySpark Affects Versions: 1.0.0 Reporter: Weizhong Priority: Blocker From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the assembly jar are package by JDK 1.7+, so ship pyspark archives to executors by Yarn with --py-files. The pyspark archives name must contains spark-pyspark. 1st: zip pyspark to spark-pyspark_2.10.zip 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files spark-pyspark_2.10.zip app.py args -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-6869: - Assignee: Lianhui Wang Add pyspark archives path to PYTHONPATH --- Key: SPARK-6869 URL: https://issues.apache.org/jira/browse/SPARK-6869 Project: Spark Issue Type: Bug Components: PySpark Affects Versions: 1.0.0 Reporter: Weizhong Assignee: Lianhui Wang Priority: Blocker Fix For: 1.4.0 From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the assembly jar are package by JDK 1.7+, so ship pyspark archives to executors by Yarn with --py-files. The pyspark archives name must contains spark-pyspark. 1st: zip pyspark to spark-pyspark_2.10.zip 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files spark-pyspark_2.10.zip app.py args -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-6869: - Target Version/s: 1.4.0 Add pyspark archives path to PYTHONPATH --- Key: SPARK-6869 URL: https://issues.apache.org/jira/browse/SPARK-6869 Project: Spark Issue Type: Bug Components: PySpark Affects Versions: 1.0.0 Reporter: Weizhong Priority: Blocker Fix For: 1.4.0 From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the assembly jar are package by JDK 1.7+, so ship pyspark archives to executors by Yarn with --py-files. The pyspark archives name must contains spark-pyspark. 1st: zip pyspark to spark-pyspark_2.10.zip 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files spark-pyspark_2.10.zip app.py args -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-6869: - Issue Type: Bug (was: Improvement) Add pyspark archives path to PYTHONPATH --- Key: SPARK-6869 URL: https://issues.apache.org/jira/browse/SPARK-6869 Project: Spark Issue Type: Bug Components: PySpark Affects Versions: 1.0.0 Reporter: Weizhong Priority: Minor From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the assembly jar are package by JDK 1.7+, so ship pyspark archives to executors by Yarn with --py-files. The pyspark archives name must contains spark-pyspark. 1st: zip pyspark to spark-pyspark_2.10.zip 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files spark-pyspark_2.10.zip app.py args -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-6869) Add pyspark archives path to PYTHONPATH
[ https://issues.apache.org/jira/browse/SPARK-6869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weizhong updated SPARK-6869: Description: From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the assembly jar are package by JDK 1.7+, so ship pyspark archives to executors by Yarn with --py-files. The pyspark archives name must contains spark-pyspark. 1st: zip pyspark to spark-pyspark_2.10.zip 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files spark-pyspark_2.10.zip app.py args was:From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the assembly jar are package by JDK 1.7+, so pass the PYTHONPATH(set in spark-env.sh) to executor so that executor python process can read pyspark file from local file system rather than from assembly jar. Summary: Add pyspark archives path to PYTHONPATH (was: Pass PYTHONPATH to executor, so that executor can read pyspark file from local file system on executor node) Add pyspark archives path to PYTHONPATH --- Key: SPARK-6869 URL: https://issues.apache.org/jira/browse/SPARK-6869 Project: Spark Issue Type: Improvement Components: PySpark Affects Versions: 1.0.0 Reporter: Weizhong Priority: Minor From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the assembly jar are package by JDK 1.7+, so ship pyspark archives to executors by Yarn with --py-files. The pyspark archives name must contains spark-pyspark. 1st: zip pyspark to spark-pyspark_2.10.zip 2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files spark-pyspark_2.10.zip app.py args -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org