[jira] [Comment Edited] (SPARK-13955) Spark in yarn mode fails
[ https://issues.apache.org/jira/browse/SPARK-13955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15734082#comment-15734082 ] liyunzhang_intel edited comment on SPARK-13955 at 12/9/16 2:40 AM: --- [~jerryshao]: following is the detail steps when i use "spark.yarn.archive" 1. zip all jars: zip spark-archive.zip $SPARK_HOME/jars/* 2. upload the zip to hdfs: hadoop fs -copyFromLocal spark-archive.zip hdfs://bdpe42:8020/ 3. modify the spark-defaults.conf spark.yarn.archive=hdfs://bdpe42:8020/spark-archive.zip 4. run pi in yarn client mode {code} ./bin/spark-submit --class org.apache.spark.examples.SparkPi --master yarn-client --num-executors 3 --driver-memory 1g --executor-memory 1g --executor-cores 1 $spark_example_jar > sparkPi.log 2>&1 {code} The exception in container log is {code} Error: Could not find or load main class org.apache.spark.deploy.yarn.ExecutorLauncher {code} The spark version is 2.0.2. was (Author: kellyzly): test pi in yarn-client mode by using "spark.yarn.archive" [~jerryshao]: following is the detail steps when i use "spark.yarn.archive" 1. zip all jars: zip spark-archive.zip $SPARK_HOME/jars/* 2. upload the zip to hdfs: hadoop fs -copyFromLocal spark-archive.zip hdfs://bdpe42:8020/ 3. modify the spark-defaults.conf spark.yarn.archive=hdfs://bdpe42:8020/spark-archive.zip 4. run pi in yarn client mode {code} ./bin/spark-submit --class org.apache.spark.examples.SparkPi --master yarn-client --num-executors 3 --driver-memory 1g --executor-memory 1g --executor-cores 1 $spark_example_jar > sparkPi.log 2>&1 {code} The exception in container log is {code} Error: Could not find or load main class org.apache.spark.deploy.yarn.ExecutorLauncher {code} The spark version is 2.0.2. > Spark in yarn mode fails > > > Key: SPARK-13955 > URL: https://issues.apache.org/jira/browse/SPARK-13955 > Project: Spark > Issue Type: Bug > Components: YARN >Affects Versions: 2.0.0 >Reporter: Jeff Zhang >Assignee: Marcelo Vanzin > Fix For: 2.0.0 > > > I ran spark-shell in yarn client, but from the logs seems the spark assembly > jar is not uploaded to HDFS. This may be known issue in the process of > SPARK-11157, create this ticket to track this issue. [~vanzin] > {noformat} > 16/03/17 17:57:48 INFO Client: Will allocate AM container, with 896 MB memory > including 384 MB overhead > 16/03/17 17:57:48 INFO Client: Setting up container launch context for our AM > 16/03/17 17:57:48 INFO Client: Setting up the launch environment for our AM > container > 16/03/17 17:57:48 INFO Client: Preparing resources for our AM container > 16/03/17 17:57:48 WARN Client: Neither spark.yarn.jars nor spark.yarn.archive > is set, falling back to uploading libraries under SPARK_HOME. > 16/03/17 17:57:48 INFO Client: Uploading resource > file:/Users/jzhang/github/spark/lib/apache-rat-0.10.jar -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/apache-rat-0.10.jar > 16/03/17 17:57:49 INFO Client: Uploading resource > file:/Users/jzhang/github/spark/lib/apache-rat-0.11.jar -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/apache-rat-0.11.jar > 16/03/17 17:57:49 INFO Client: Uploading resource > file:/private/var/folders/dp/hmchg5dd3vbcvds26q91spdwgp/T/spark-abed04bf-6ac2-448b-91a9-dcc1c401a18f/__spark_conf__4163776487351314654.zip > -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/__spark_conf__4163776487351314654.zip > 16/03/17 17:57:49 INFO SecurityManager: Changing view acls to: jzhang > 16/03/17 17:57:49 INFO SecurityManager: Changing modify acls to: jzhang > 16/03/17 17:57:49 INFO SecurityManager: SecurityManager: authentication > disabled; ui acls disabled; users with view permissions: Set(jzhang); users > with modify permissions: Set(jzhang) > 16/03/17 17:57:49 INFO Client: Submitting application 6 to ResourceManager > {noformat} > message in AM container > {noformat} > Error: Could not find or load main class > org.apache.spark.deploy.yarn.ExecutorLauncher > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Comment Edited] (SPARK-13955) Spark in yarn mode fails
[ https://issues.apache.org/jira/browse/SPARK-13955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15731574#comment-15731574 ] liyunzhang_intel edited comment on SPARK-13955 at 12/8/16 8:54 AM: --- [~jerryshao] &[~tzachz]: when i try to use "spark.yarn.jars", i found following ways work append following in your conf/spark-defaults.conf, you need specified the location of every jar in $SPARK_HOME/jars in spark.yarn.jars like(here i don't paste all, too long. The seperator between jars is ",") {code} spark.yarn.jars=/home/zly/spark-2.0.0-bin-hadoop2-without-hive/jars/activation-1.1.1.jar, {code} The [document|http://spark.apache.org/docs/latest/running-on-yarn.html] describes "spark.yarn.jars": {code} List of libraries containing Spark code to distribute to YARN containers. By default, Spark on YARN will use Spark jars installed locally, but the Spark jars can also be in a world-readable location on HDFS. This allows YARN to cache it on nodes so that it doesn't need to be distributed each time an application runs. To point to jars on HDFS, for example, set this configuration to hdfs:///some/path. Globs are allowed. {code} but when i try to use "spark.yarn.archive" like above, it fails. was (Author: kellyzly): [~jerryshao] &[~tzachz]: when i try to use "spark.yarn.jars", i found following ways work append following in your conf/spark-defaults.conf, you need specified the location of every jar in $SPARK_HOME/jars in spark.yarn.jars like(here i don't paste all, too long. The seperator between jars is ",") {code} spark.yarn.jars=/home/zly/spark-2.0.0-bin-hadoop2-without-hive/jars/activation-1.1.1.jar, {code} when i try to use "spark.yarn.archive" like above, it fails. > Spark in yarn mode fails > > > Key: SPARK-13955 > URL: https://issues.apache.org/jira/browse/SPARK-13955 > Project: Spark > Issue Type: Bug > Components: YARN >Affects Versions: 2.0.0 >Reporter: Jeff Zhang >Assignee: Marcelo Vanzin > Fix For: 2.0.0 > > > I ran spark-shell in yarn client, but from the logs seems the spark assembly > jar is not uploaded to HDFS. This may be known issue in the process of > SPARK-11157, create this ticket to track this issue. [~vanzin] > {noformat} > 16/03/17 17:57:48 INFO Client: Will allocate AM container, with 896 MB memory > including 384 MB overhead > 16/03/17 17:57:48 INFO Client: Setting up container launch context for our AM > 16/03/17 17:57:48 INFO Client: Setting up the launch environment for our AM > container > 16/03/17 17:57:48 INFO Client: Preparing resources for our AM container > 16/03/17 17:57:48 WARN Client: Neither spark.yarn.jars nor spark.yarn.archive > is set, falling back to uploading libraries under SPARK_HOME. > 16/03/17 17:57:48 INFO Client: Uploading resource > file:/Users/jzhang/github/spark/lib/apache-rat-0.10.jar -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/apache-rat-0.10.jar > 16/03/17 17:57:49 INFO Client: Uploading resource > file:/Users/jzhang/github/spark/lib/apache-rat-0.11.jar -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/apache-rat-0.11.jar > 16/03/17 17:57:49 INFO Client: Uploading resource > file:/private/var/folders/dp/hmchg5dd3vbcvds26q91spdwgp/T/spark-abed04bf-6ac2-448b-91a9-dcc1c401a18f/__spark_conf__4163776487351314654.zip > -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/__spark_conf__4163776487351314654.zip > 16/03/17 17:57:49 INFO SecurityManager: Changing view acls to: jzhang > 16/03/17 17:57:49 INFO SecurityManager: Changing modify acls to: jzhang > 16/03/17 17:57:49 INFO SecurityManager: SecurityManager: authentication > disabled; ui acls disabled; users with view permissions: Set(jzhang); users > with modify permissions: Set(jzhang) > 16/03/17 17:57:49 INFO Client: Submitting application 6 to ResourceManager > {noformat} > message in AM container > {noformat} > Error: Could not find or load main class > org.apache.spark.deploy.yarn.ExecutorLauncher > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Comment Edited] (SPARK-13955) Spark in yarn mode fails
[ https://issues.apache.org/jira/browse/SPARK-13955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15199927#comment-15199927 ] Marcelo Vanzin edited comment on SPARK-13955 at 3/17/16 4:59 PM: - How did you build Spark? Did you do "mvn package" or "sbt assembly"? I see the code is trying to upload "file:/Users/jzhang/github/spark/lib/", do you have a file called RELEASE on your repo's root? (That triggers a different code path, since that file is only expected to exist in a Spark distribution, not in a Spark source repo.) was (Author: vanzin): How did you build Spark? Did you do "mvn package" or "sbt assembly"? I see the code is trying to uploade "file:/Users/jzhang/github/spark/lib/", do you have a file called RELEASE on your repo's root? (That triggers a different code path, since that file is only expected to exist in a Spark distribution, not in a Spark source repo.) > Spark in yarn mode fails > > > Key: SPARK-13955 > URL: https://issues.apache.org/jira/browse/SPARK-13955 > Project: Spark > Issue Type: Bug >Affects Versions: 2.0.0 >Reporter: Jeff Zhang > > I ran spark-shell in yarn client, but from the logs seems the spark assembly > jar is not uploaded to HDFS. This may be known issue in the process of > SPARK-11157, create this ticket to track this issue. [~vanzin] > {noformat} > 16/03/17 17:57:48 INFO Client: Will allocate AM container, with 896 MB memory > including 384 MB overhead > 16/03/17 17:57:48 INFO Client: Setting up container launch context for our AM > 16/03/17 17:57:48 INFO Client: Setting up the launch environment for our AM > container > 16/03/17 17:57:48 INFO Client: Preparing resources for our AM container > 16/03/17 17:57:48 WARN Client: Neither spark.yarn.jars nor spark.yarn.archive > is set, falling back to uploading libraries under SPARK_HOME. > 16/03/17 17:57:48 INFO Client: Uploading resource > file:/Users/jzhang/github/spark/lib/apache-rat-0.10.jar -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/apache-rat-0.10.jar > 16/03/17 17:57:49 INFO Client: Uploading resource > file:/Users/jzhang/github/spark/lib/apache-rat-0.11.jar -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/apache-rat-0.11.jar > 16/03/17 17:57:49 INFO Client: Uploading resource > file:/private/var/folders/dp/hmchg5dd3vbcvds26q91spdwgp/T/spark-abed04bf-6ac2-448b-91a9-dcc1c401a18f/__spark_conf__4163776487351314654.zip > -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/__spark_conf__4163776487351314654.zip > 16/03/17 17:57:49 INFO SecurityManager: Changing view acls to: jzhang > 16/03/17 17:57:49 INFO SecurityManager: Changing modify acls to: jzhang > 16/03/17 17:57:49 INFO SecurityManager: SecurityManager: authentication > disabled; ui acls disabled; users with view permissions: Set(jzhang); users > with modify permissions: Set(jzhang) > 16/03/17 17:57:49 INFO Client: Submitting application 6 to ResourceManager > {noformat} > message in AM container > {noformat} > Error: Could not find or load main class > org.apache.spark.deploy.yarn.ExecutorLauncher > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Comment Edited] (SPARK-13955) Spark in yarn mode fails
[ https://issues.apache.org/jira/browse/SPARK-13955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202411#comment-15202411 ] Jeff Zhang edited comment on SPARK-13955 at 3/19/16 12:28 AM: -- I can reproduce it with both sbt and mvn. The scenario is same that spark assembly jar is not uploaded to hdfs. And there's no RELEASE file in my repo was (Author: zjffdu): I can reproduce it with both sbt and mvn. The scenario is same that spark assembly jar is not uploaded to hdfs. > Spark in yarn mode fails > > > Key: SPARK-13955 > URL: https://issues.apache.org/jira/browse/SPARK-13955 > Project: Spark > Issue Type: Bug >Affects Versions: 2.0.0 >Reporter: Jeff Zhang > > I ran spark-shell in yarn client, but from the logs seems the spark assembly > jar is not uploaded to HDFS. This may be known issue in the process of > SPARK-11157, create this ticket to track this issue. [~vanzin] > {noformat} > 16/03/17 17:57:48 INFO Client: Will allocate AM container, with 896 MB memory > including 384 MB overhead > 16/03/17 17:57:48 INFO Client: Setting up container launch context for our AM > 16/03/17 17:57:48 INFO Client: Setting up the launch environment for our AM > container > 16/03/17 17:57:48 INFO Client: Preparing resources for our AM container > 16/03/17 17:57:48 WARN Client: Neither spark.yarn.jars nor spark.yarn.archive > is set, falling back to uploading libraries under SPARK_HOME. > 16/03/17 17:57:48 INFO Client: Uploading resource > file:/Users/jzhang/github/spark/lib/apache-rat-0.10.jar -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/apache-rat-0.10.jar > 16/03/17 17:57:49 INFO Client: Uploading resource > file:/Users/jzhang/github/spark/lib/apache-rat-0.11.jar -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/apache-rat-0.11.jar > 16/03/17 17:57:49 INFO Client: Uploading resource > file:/private/var/folders/dp/hmchg5dd3vbcvds26q91spdwgp/T/spark-abed04bf-6ac2-448b-91a9-dcc1c401a18f/__spark_conf__4163776487351314654.zip > -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/__spark_conf__4163776487351314654.zip > 16/03/17 17:57:49 INFO SecurityManager: Changing view acls to: jzhang > 16/03/17 17:57:49 INFO SecurityManager: Changing modify acls to: jzhang > 16/03/17 17:57:49 INFO SecurityManager: SecurityManager: authentication > disabled; ui acls disabled; users with view permissions: Set(jzhang); users > with modify permissions: Set(jzhang) > 16/03/17 17:57:49 INFO Client: Submitting application 6 to ResourceManager > {noformat} > message in AM container > {noformat} > Error: Could not find or load main class > org.apache.spark.deploy.yarn.ExecutorLauncher > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Comment Edited] (SPARK-13955) Spark in yarn mode fails
[ https://issues.apache.org/jira/browse/SPARK-13955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202411#comment-15202411 ] Jeff Zhang edited comment on SPARK-13955 at 3/19/16 12:29 AM: -- I can reproduce it with both sbt and mvn. The scenario is the same that spark assembly jar is not uploaded to hdfs. was (Author: zjffdu): I can reproduce it with both sbt and mvn. The scenario is same that spark assembly jar is not uploaded to hdfs. And there's no RELEASE file in my repo > Spark in yarn mode fails > > > Key: SPARK-13955 > URL: https://issues.apache.org/jira/browse/SPARK-13955 > Project: Spark > Issue Type: Bug >Affects Versions: 2.0.0 >Reporter: Jeff Zhang > > I ran spark-shell in yarn client, but from the logs seems the spark assembly > jar is not uploaded to HDFS. This may be known issue in the process of > SPARK-11157, create this ticket to track this issue. [~vanzin] > {noformat} > 16/03/17 17:57:48 INFO Client: Will allocate AM container, with 896 MB memory > including 384 MB overhead > 16/03/17 17:57:48 INFO Client: Setting up container launch context for our AM > 16/03/17 17:57:48 INFO Client: Setting up the launch environment for our AM > container > 16/03/17 17:57:48 INFO Client: Preparing resources for our AM container > 16/03/17 17:57:48 WARN Client: Neither spark.yarn.jars nor spark.yarn.archive > is set, falling back to uploading libraries under SPARK_HOME. > 16/03/17 17:57:48 INFO Client: Uploading resource > file:/Users/jzhang/github/spark/lib/apache-rat-0.10.jar -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/apache-rat-0.10.jar > 16/03/17 17:57:49 INFO Client: Uploading resource > file:/Users/jzhang/github/spark/lib/apache-rat-0.11.jar -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/apache-rat-0.11.jar > 16/03/17 17:57:49 INFO Client: Uploading resource > file:/private/var/folders/dp/hmchg5dd3vbcvds26q91spdwgp/T/spark-abed04bf-6ac2-448b-91a9-dcc1c401a18f/__spark_conf__4163776487351314654.zip > -> > hdfs://localhost:9000/user/jzhang/.sparkStaging/application_1458187008455_0006/__spark_conf__4163776487351314654.zip > 16/03/17 17:57:49 INFO SecurityManager: Changing view acls to: jzhang > 16/03/17 17:57:49 INFO SecurityManager: Changing modify acls to: jzhang > 16/03/17 17:57:49 INFO SecurityManager: SecurityManager: authentication > disabled; ui acls disabled; users with view permissions: Set(jzhang); users > with modify permissions: Set(jzhang) > 16/03/17 17:57:49 INFO Client: Submitting application 6 to ResourceManager > {noformat} > message in AM container > {noformat} > Error: Could not find or load main class > org.apache.spark.deploy.yarn.ExecutorLauncher > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org