Hello, Sorry for the delay. The issue you're running into is because most HBase classes are in the system class path, while jars added with "--jars" are only visible to the application class loader created by Spark. So classes in the system class path cannot see them.
You can work around this by setting "--driver-classpath /opt/.../htrace-core-3.1.0-incubating.jar" and "--conf spark.executor.extraClassPath= /opt/.../htrace-core-3.1.0-incubating.jar" in your spark-submit command line. (You can also add those configs to your spark-defaults.conf to avoid having to type them all the time; and don't forget to include any other jars that might be needed.) On Mon, May 18, 2015 at 11:14 PM, Fengyun RAO <raofeng...@gmail.com> wrote: > Thanks, Marcelo! > > > Below is the full log, > > > SLF4J: Class path contains multiple SLF4J bindings. > SLF4J: Found binding in > [jar:file:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/jars/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: Found binding in > [jar:file:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/jars/avro-tools-1.7.6-cdh5.4.0.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an > explanation. > SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory] > 15/05/19 14:08:58 INFO yarn.ApplicationMaster: Registered signal handlers for > [TERM, HUP, INT] > 15/05/19 14:08:59 INFO yarn.ApplicationMaster: ApplicationAttemptId: > appattempt_1432015548391_0003_000001 > 15/05/19 14:09:00 INFO spark.SecurityManager: Changing view acls to: > nobody,raofengyun > 15/05/19 14:09:00 INFO spark.SecurityManager: Changing modify acls to: > nobody,raofengyun > 15/05/19 14:09:00 INFO spark.SecurityManager: SecurityManager: authentication > disabled; ui acls disabled; users with view permissions: Set(nobody, > raofengyun); users with modify permissions: Set(nobody, raofengyun) > 15/05/19 14:09:00 INFO yarn.ApplicationMaster: Starting the user application > in a separate Thread > 15/05/19 14:09:00 INFO yarn.ApplicationMaster: Waiting for spark context > initialization > 15/05/19 14:09:00 INFO yarn.ApplicationMaster: Waiting for spark context > initialization ... > 15/05/19 14:09:00 INFO spark.SparkContext: Running Spark version 1.3.0 > 15/05/19 14:09:00 INFO spark.SecurityManager: Changing view acls to: > nobody,raofengyun > 15/05/19 14:09:00 INFO spark.SecurityManager: Changing modify acls to: > nobody,raofengyun > 15/05/19 14:09:00 INFO spark.SecurityManager: SecurityManager: authentication > disabled; ui acls disabled; users with view permissions: Set(nobody, > raofengyun); users with modify permissions: Set(nobody, raofengyun) > 15/05/19 14:09:01 INFO slf4j.Slf4jLogger: Slf4jLogger started > 15/05/19 14:09:01 INFO Remoting: Starting remoting > 15/05/19 14:09:01 INFO Remoting: Remoting started; listening on addresses > :[akka.tcp://sparkDriver@gs-server-v-127:7191] > 15/05/19 14:09:01 INFO Remoting: Remoting now listens on addresses: > [akka.tcp://sparkDriver@gs-server-v-127:7191] > 15/05/19 14:09:01 INFO util.Utils: Successfully started service 'sparkDriver' > on port 7191. > 15/05/19 14:09:01 INFO spark.SparkEnv: Registering MapOutputTracker > 15/05/19 14:09:01 INFO spark.SparkEnv: Registering BlockManagerMaster > 15/05/19 14:09:01 INFO storage.DiskBlockManager: Created local directory at > /data1/cdh/yarn/nm/usercache/raofengyun/appcache/application_1432015548391_0003/blockmgr-3250910b-693e-46ff-b057-26d552fd8abd > 15/05/19 14:09:01 INFO storage.MemoryStore: MemoryStore started with capacity > 259.7 MB > 15/05/19 14:09:01 INFO spark.HttpFileServer: HTTP File server directory is > /data1/cdh/yarn/nm/usercache/raofengyun/appcache/application_1432015548391_0003/httpd-5bc614bc-d8b1-473d-a807-4d9252eb679d > 15/05/19 14:09:01 INFO spark.HttpServer: Starting HTTP Server > 15/05/19 14:09:01 INFO server.Server: jetty-8.y.z-SNAPSHOT > 15/05/19 14:09:01 INFO server.AbstractConnector: Started > SocketConnector@0.0.0.0:9349 > 15/05/19 14:09:01 INFO util.Utils: Successfully started service 'HTTP file > server' on port 9349. > 15/05/19 14:09:01 INFO spark.SparkEnv: Registering OutputCommitCoordinator > 15/05/19 14:09:01 INFO ui.JettyUtils: Adding filter: > org.apache.hadoop.yarn.server.webproxy.amfilter.AmIpFilter > 15/05/19 14:09:01 INFO server.Server: jetty-8.y.z-SNAPSHOT > 15/05/19 14:09:01 INFO server.AbstractConnector: Started > SelectChannelConnector@0.0.0.0:63023 > 15/05/19 14:09:01 INFO util.Utils: Successfully started service 'SparkUI' on > port 63023. > 15/05/19 14:09:01 INFO ui.SparkUI: Started SparkUI at > http://gs-server-v-127:63023 > 15/05/19 14:09:02 INFO cluster.YarnClusterScheduler: Created > YarnClusterScheduler > 15/05/19 14:09:02 INFO netty.NettyBlockTransferService: Server created on > 33526 > 15/05/19 14:09:02 INFO storage.BlockManagerMaster: Trying to register > BlockManager > 15/05/19 14:09:02 INFO storage.BlockManagerMasterActor: Registering block > manager gs-server-v-127:33526 with 259.7 MB RAM, BlockManagerId(<driver>, > gs-server-v-127, 33526) > 15/05/19 14:09:02 INFO storage.BlockManagerMaster: Registered BlockManager > 15/05/19 14:09:02 INFO scheduler.EventLoggingListener: Logging events to > hdfs://gs-server-v-127:8020/user/spark/applicationHistory/application_1432015548391_0003 > 15/05/19 14:09:02 INFO yarn.ApplicationMaster: Listen to driver: > akka.tcp://sparkDriver@gs-server-v-127:7191/user/YarnScheduler > 15/05/19 14:09:02 INFO cluster.YarnClusterSchedulerBackend: ApplicationMaster > registered as Actor[akka://sparkDriver/user/YarnAM#1902752386] > 15/05/19 14:09:02 INFO client.RMProxy: Connecting to ResourceManager at > gs-server-v-127/10.200.200.56:8030 > 15/05/19 14:09:02 INFO yarn.YarnRMClient: Registering the ApplicationMaster > 15/05/19 14:09:03 INFO yarn.YarnAllocator: Will request 2 executor > containers, each with 1 cores and 4480 MB memory including 384 MB overhead > 15/05/19 14:09:03 INFO yarn.YarnAllocator: Container request (host: Any, > capability: <memory:4480, vCores:1>) > 15/05/19 14:09:03 INFO yarn.YarnAllocator: Container request (host: Any, > capability: <memory:4480, vCores:1>) > 15/05/19 14:09:03 INFO yarn.ApplicationMaster: Started progress reporter > thread - sleep time : 5000 > 15/05/19 14:09:03 INFO impl.AMRMClientImpl: Received new token for : > gs-server-v-127:8041 > 15/05/19 14:09:03 INFO impl.AMRMClientImpl: Received new token for : > gs-server-v-129:8041 > 15/05/19 14:09:03 INFO yarn.YarnAllocator: Launching container > container_1432015548391_0003_01_000002 for on host gs-server-v-127 > 15/05/19 14:09:03 INFO yarn.YarnAllocator: Launching ExecutorRunnable. > driverUrl: > akka.tcp://sparkDriver@gs-server-v-127:7191/user/CoarseGrainedScheduler, > executorHostname: gs-server-v-127 > 15/05/19 14:09:03 INFO yarn.YarnAllocator: Launching container > container_1432015548391_0003_01_000003 for on host gs-server-v-129 > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Starting Executor Container > 15/05/19 14:09:03 INFO yarn.YarnAllocator: Launching ExecutorRunnable. > driverUrl: > akka.tcp://sparkDriver@gs-server-v-127:7191/user/CoarseGrainedScheduler, > executorHostname: gs-server-v-129 > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Starting Executor Container > 15/05/19 14:09:03 INFO yarn.YarnAllocator: Received 2 containers from YARN, > launching executors on 2 of them. > 15/05/19 14:09:03 INFO impl.ContainerManagementProtocolProxy: > yarn.client.max-cached-nodemanagers-proxies : 0 > 15/05/19 14:09:03 INFO impl.ContainerManagementProtocolProxy: > yarn.client.max-cached-nodemanagers-proxies : 0 > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Setting up > ContainerLaunchContext > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Setting up > ContainerLaunchContext > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Preparing Local resources > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Preparing Local resources > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Prepared Local resources > Map(__app__.jar -> resource { scheme: "hdfs" host: "gs-server-v-127" port: > 8020 file: > "/user/raofengyun/.sparkStaging/application_1432015548391_0003/spark-wd-etl-1.0-jar-with-dependencies.jar" > } size: 10759465 timestamp: 1432015733920 type: FILE visibility: PRIVATE, > htrace-core-3.1.0-incubating.jar -> resource { scheme: "hdfs" host: > "gs-server-v-127" port: 8020 file: > "/user/raofengyun/.sparkStaging/application_1432015548391_0003/htrace-core-3.1.0-incubating.jar" > } size: 1475955 timestamp: 1432015734434 type: FILE visibility: PRIVATE) > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Prepared Local resources > Map(__app__.jar -> resource { scheme: "hdfs" host: "gs-server-v-127" port: > 8020 file: > "/user/raofengyun/.sparkStaging/application_1432015548391_0003/spark-wd-etl-1.0-jar-with-dependencies.jar" > } size: 10759465 timestamp: 1432015733920 type: FILE visibility: PRIVATE, > htrace-core-3.1.0-incubating.jar -> resource { scheme: "hdfs" host: > "gs-server-v-127" port: 8020 file: > "/user/raofengyun/.sparkStaging/application_1432015548391_0003/htrace-core-3.1.0-incubating.jar" > } size: 1475955 timestamp: 1432015734434 type: FILE visibility: PRIVATE) > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Setting up executor with > environment: Map(CLASSPATH -> > {{PWD}}<CPS>/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/spark/assembly/lib/spark-assembly-1.3.0-cdh5.4.0-hadoop2.6.0-cdh5.4.0.jar<CPS>$HADOOP_CLIENT_CONF_DIR<CPS>$HADOOP_CONF_DIR<CPS>$HADOOP_COMMON_HOME/*<CPS>$HADOOP_COMMON_HOME/lib/*<CPS>$HADOOP_HDFS_HOME/*<CPS>$HADOOP_HDFS_HOME/lib/*<CPS>$HADOOP_YARN_HOME/*<CPS>$HADOOP_YARN_HOME/lib/*<CPS>$HADOOP_MAPRED_HOME/*<CPS>$HADOOP_MAPRED_HOME/lib/*<CPS>$MR2_CLASSPATH<CPS>/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/bin/../lib/hadoop/client/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/spark/conf/yarn-conf:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop/.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/./:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-yarn/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-yarn/.//*:/usr/lib/hadoop-mapreduce//.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hive/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/flume-ng/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/../parquet/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/../avro/*:/opt/cloudera/parcels/GPLEXTRAS-5.2.0-1.cdh5.2.0.p0.20/lib/hadoop/lib/*, > SPARK_LOG_URL_STDERR -> > http://gs-server-v-127:8042/node/containerlogs/container_1432015548391_0003_01_000002/raofengyun/stderr?start=0, > SPARK_DIST_CLASSPATH -> > /opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/bin/../lib/hadoop/client/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/spark/conf/yarn-conf:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop/.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/./:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-yarn/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-yarn/.//*:/usr/lib/hadoop-mapreduce//.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hive/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/flume-ng/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/../parquet/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/../avro/*:/opt/cloudera/parcels/GPLEXTRAS-5.2.0-1.cdh5.2.0.p0.20/lib/hadoop/lib/*, > SPARK_YARN_STAGING_DIR -> .sparkStaging/application_1432015548391_0003, > SPARK_YARN_CACHE_FILES_FILE_SIZES -> 10759465,1475955, SPARK_USER -> > raofengyun, SPARK_YARN_CACHE_FILES_VISIBILITIES -> PRIVATE,PRIVATE, > SPARK_YARN_MODE -> true, SPARK_YARN_CACHE_FILES_TIME_STAMPS -> > 1432015733920,1432015734434, SPARK_LOG_URL_STDOUT -> > http://gs-server-v-127:8042/node/containerlogs/container_1432015548391_0003_01_000002/raofengyun/stdout?start=0, > SPARK_YARN_CACHE_FILES -> > hdfs://gs-server-v-127:8020/user/raofengyun/.sparkStaging/application_1432015548391_0003/spark-wd-etl-1.0-jar-with-dependencies.jar#__app__.jar,hdfs://gs-server-v-127:8020/user/raofengyun/.sparkStaging/application_1432015548391_0003/htrace-core-3.1.0-incubating.jar#htrace-core-3.1.0-incubating.jar) > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Setting up executor with > environment: Map(CLASSPATH -> > {{PWD}}<CPS>/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/spark/assembly/lib/spark-assembly-1.3.0-cdh5.4.0-hadoop2.6.0-cdh5.4.0.jar<CPS>$HADOOP_CLIENT_CONF_DIR<CPS>$HADOOP_CONF_DIR<CPS>$HADOOP_COMMON_HOME/*<CPS>$HADOOP_COMMON_HOME/lib/*<CPS>$HADOOP_HDFS_HOME/*<CPS>$HADOOP_HDFS_HOME/lib/*<CPS>$HADOOP_YARN_HOME/*<CPS>$HADOOP_YARN_HOME/lib/*<CPS>$HADOOP_MAPRED_HOME/*<CPS>$HADOOP_MAPRED_HOME/lib/*<CPS>$MR2_CLASSPATH<CPS>/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/bin/../lib/hadoop/client/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/spark/conf/yarn-conf:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop/.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/./:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-yarn/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-yarn/.//*:/usr/lib/hadoop-mapreduce//.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hive/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/flume-ng/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/../parquet/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/../avro/*:/opt/cloudera/parcels/GPLEXTRAS-5.2.0-1.cdh5.2.0.p0.20/lib/hadoop/lib/*, > SPARK_LOG_URL_STDERR -> > http://gs-server-v-129:8042/node/containerlogs/container_1432015548391_0003_01_000003/raofengyun/stderr?start=0, > SPARK_DIST_CLASSPATH -> > /opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/bin/../lib/hadoop/client/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/spark/conf/yarn-conf:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop/.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/./:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-hdfs/.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-yarn/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/libexec/../../hadoop-yarn/.//*:/usr/lib/hadoop-mapreduce//.//*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hive/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/flume-ng/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/../parquet/lib/*:/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/../avro/*:/opt/cloudera/parcels/GPLEXTRAS-5.2.0-1.cdh5.2.0.p0.20/lib/hadoop/lib/*, > SPARK_YARN_STAGING_DIR -> .sparkStaging/application_1432015548391_0003, > SPARK_YARN_CACHE_FILES_FILE_SIZES -> 10759465,1475955, SPARK_USER -> > raofengyun, SPARK_YARN_CACHE_FILES_VISIBILITIES -> PRIVATE,PRIVATE, > SPARK_YARN_MODE -> true, SPARK_YARN_CACHE_FILES_TIME_STAMPS -> > 1432015733920,1432015734434, SPARK_LOG_URL_STDOUT -> > http://gs-server-v-129:8042/node/containerlogs/container_1432015548391_0003_01_000003/raofengyun/stdout?start=0, > SPARK_YARN_CACHE_FILES -> > hdfs://gs-server-v-127:8020/user/raofengyun/.sparkStaging/application_1432015548391_0003/spark-wd-etl-1.0-jar-with-dependencies.jar#__app__.jar,hdfs://gs-server-v-127:8020/user/raofengyun/.sparkStaging/application_1432015548391_0003/htrace-core-3.1.0-incubating.jar#htrace-core-3.1.0-incubating.jar) > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Setting up executor with > commands: > List(LD_LIBRARY_PATH="/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/lib/native:$LD_LIBRARY_PATH", > {{JAVA_HOME}}/bin/java, -server, -XX:OnOutOfMemoryError='kill %p', > -Xms4096m, -Xmx4096m, -Djava.io.tmpdir={{PWD}}/tmp, > '-Dspark.shuffle.service.port=7337', '-Dspark.driver.port=7191', > '-Dspark.ui.port=0', -Dspark.yarn.app.container.log.dir=<LOG_DIR>, > org.apache.spark.executor.CoarseGrainedExecutorBackend, --driver-url, > akka.tcp://sparkDriver@gs-server-v-127:7191/user/CoarseGrainedScheduler, > --executor-id, 2, --hostname, gs-server-v-129, --cores, 1, --app-id, > application_1432015548391_0003, --user-class-path, file:$PWD/__app__.jar, > --user-class-path, file:$PWD/htrace-core-3.1.0-incubating.jar, 1>, > <LOG_DIR>/stdout, 2>, <LOG_DIR>/stderr) > 15/05/19 14:09:03 INFO yarn.ExecutorRunnable: Setting up executor with > commands: > List(LD_LIBRARY_PATH="/opt/cloudera/parcels/CDH-5.4.0-1.cdh5.4.0.p0.27/lib/hadoop/lib/native:$LD_LIBRARY_PATH", > {{JAVA_HOME}}/bin/java, -server, -XX:OnOutOfMemoryError='kill %p', > -Xms4096m, -Xmx4096m, -Djava.io.tmpdir={{PWD}}/tmp, > '-Dspark.shuffle.service.port=7337', '-Dspark.driver.port=7191', > '-Dspark.ui.port=0', -Dspark.yarn.app.container.log.dir=<LOG_DIR>, > org.apache.spark.executor.CoarseGrainedExecutorBackend, --driver-url, > akka.tcp://sparkDriver@gs-server-v-127:7191/user/CoarseGrainedScheduler, > --executor-id, 1, --hostname, gs-server-v-127, --cores, 1, --app-id, > application_1432015548391_0003, --user-class-path, file:$PWD/__app__.jar, > --user-class-path, file:$PWD/htrace-core-3.1.0-incubating.jar, 1>, > <LOG_DIR>/stdout, 2>, <LOG_DIR>/stderr) > 15/05/19 14:09:03 INFO impl.ContainerManagementProtocolProxy: Opening proxy : > gs-server-v-127:8041 > 15/05/19 14:09:03 INFO impl.ContainerManagementProtocolProxy: Opening proxy : > gs-server-v-129:8041 > 15/05/19 14:09:07 INFO cluster.YarnClusterSchedulerBackend: Registered > executor: > Actor[akka.tcp://sparkExecutor@gs-server-v-127:22773/user/Executor#-351658265] > with ID 1 > 15/05/19 14:09:07 INFO storage.BlockManagerMasterActor: Registering block > manager gs-server-v-127:40594 with 2.1 GB RAM, BlockManagerId(1, > gs-server-v-127, 40594) > 15/05/19 14:09:09 INFO cluster.YarnClusterSchedulerBackend: Registered > executor: > Actor[akka.tcp://sparkExecutor@gs-server-v-129:44560/user/Executor#-89679559] > with ID 2 > 15/05/19 14:09:09 INFO cluster.YarnClusterSchedulerBackend: SchedulerBackend > is ready for scheduling beginning after reached minRegisteredResourcesRatio: > 0.8 > 15/05/19 14:09:09 INFO cluster.YarnClusterScheduler: > YarnClusterScheduler.postStartHook done > 15/05/19 14:09:09 INFO storage.BlockManagerMasterActor: Registering block > manager gs-server-v-129:2745 with 2.1 GB RAM, BlockManagerId(2, > gs-server-v-129, 2745) > 15/05/19 14:09:09 INFO storage.MemoryStore: ensureFreeSpace(285833) called > with curMem=0, maxMem=272357130 > 15/05/19 14:09:09 INFO storage.MemoryStore: Block broadcast_0 stored as > values in memory (estimated size 279.1 KB, free 259.5 MB) > 15/05/19 14:09:10 INFO storage.MemoryStore: ensureFreeSpace(22334) called > with curMem=285833, maxMem=272357130 > 15/05/19 14:09:10 INFO storage.MemoryStore: Block broadcast_0_piece0 stored > as bytes in memory (estimated size 21.8 KB, free 259.4 MB) > 15/05/19 14:09:10 INFO storage.BlockManagerInfo: Added broadcast_0_piece0 in > memory on gs-server-v-127:33526 (size: 21.8 KB, free: 259.7 MB) > 15/05/19 14:09:10 INFO storage.BlockManagerMaster: Updated info of block > broadcast_0_piece0 > 15/05/19 14:09:10 INFO spark.SparkContext: Created broadcast 0 from > newAPIHadoopRDD at WdEtl.scala:56 > 15/05/19 14:09:10 INFO spark.SparkContext: Starting job: foreach at > WdEtl.scala:74 > 15/05/19 14:09:10 INFO input.FileInputFormat: Total input paths to process : 1 > 15/05/19 14:09:10 INFO scheduler.DAGScheduler: Registering RDD 1 (flatMap at > WdEtl.scala:62) > 15/05/19 14:09:10 INFO scheduler.DAGScheduler: Got job 0 (foreach at > WdEtl.scala:74) with 4 output partitions (allowLocal=false) > 15/05/19 14:09:10 INFO scheduler.DAGScheduler: Final stage: Stage 1(foreach > at WdEtl.scala:74) > 15/05/19 14:09:10 INFO scheduler.DAGScheduler: Parents of final stage: > List(Stage 0) > 15/05/19 14:09:10 INFO scheduler.DAGScheduler: Missing parents: List(Stage 0) > 15/05/19 14:09:10 INFO scheduler.DAGScheduler: Submitting Stage 0 > (MapPartitionsRDD[1] at flatMap at WdEtl.scala:62), which has no missing > parents > 15/05/19 14:09:10 INFO storage.MemoryStore: ensureFreeSpace(3928) called with > curMem=308167, maxMem=272357130 > 15/05/19 14:09:10 INFO storage.MemoryStore: Block broadcast_1 stored as > values in memory (estimated size 3.8 KB, free 259.4 MB) > 15/05/19 14:09:10 INFO storage.MemoryStore: ensureFreeSpace(2212) called with > curMem=312095, maxMem=272357130 > 15/05/19 14:09:10 INFO storage.MemoryStore: Block broadcast_1_piece0 stored > as bytes in memory (estimated size 2.2 KB, free 259.4 MB) > 15/05/19 14:09:10 INFO storage.BlockManagerInfo: Added broadcast_1_piece0 in > memory on gs-server-v-127:33526 (size: 2.2 KB, free: 259.7 MB) > 15/05/19 14:09:10 INFO storage.BlockManagerMaster: Updated info of block > broadcast_1_piece0 > 15/05/19 14:09:10 INFO spark.SparkContext: Created broadcast 1 from broadcast > at DAGScheduler.scala:839 > 15/05/19 14:09:10 INFO scheduler.DAGScheduler: Submitting 1 missing tasks > from Stage 0 (MapPartitionsRDD[1] at flatMap at WdEtl.scala:62) > 15/05/19 14:09:10 INFO cluster.YarnClusterScheduler: Adding task set 0.0 with > 1 tasks > 15/05/19 14:09:10 INFO scheduler.TaskSetManager: Starting task 0.0 in stage > 0.0 (TID 0, gs-server-v-127, NODE_LOCAL, 1356 bytes) > 15/05/19 14:09:11 INFO storage.BlockManagerInfo: Added broadcast_1_piece0 in > memory on gs-server-v-127:40594 (size: 2.2 KB, free: 2.1 GB) > 15/05/19 14:09:12 INFO storage.BlockManagerInfo: Added broadcast_0_piece0 in > memory on gs-server-v-127:40594 (size: 21.8 KB, free: 2.1 GB) > 15/05/19 14:10:38 INFO scheduler.TaskSetManager: Finished task 0.0 in stage > 0.0 (TID 0) in 87219 ms on gs-server-v-127 (1/1) > 15/05/19 14:10:38 INFO cluster.YarnClusterScheduler: Removed TaskSet 0.0, > whose tasks have all completed, from pool > 15/05/19 14:10:38 INFO scheduler.DAGScheduler: Stage 0 (flatMap at > WdEtl.scala:62) finished in 87.274 s > 15/05/19 14:10:38 INFO scheduler.DAGScheduler: looking for newly runnable > stages > 15/05/19 14:10:38 INFO scheduler.DAGScheduler: running: Set() > 15/05/19 14:10:38 INFO scheduler.DAGScheduler: waiting: Set(Stage 1) > 15/05/19 14:10:38 INFO scheduler.DAGScheduler: failed: Set() > 15/05/19 14:10:38 INFO scheduler.DAGScheduler: Missing parents for Stage 1: > List() > 15/05/19 14:10:38 INFO scheduler.DAGScheduler: Submitting Stage 1 > (MapPartitionsRDD[3] at mapPartitionsWithIndex at WdEtl.scala:64), which is > now runnable > 15/05/19 14:10:38 INFO storage.MemoryStore: ensureFreeSpace(4728) called with > curMem=314307, maxMem=272357130 > 15/05/19 14:10:38 INFO storage.MemoryStore: Block broadcast_2 stored as > values in memory (estimated size 4.6 KB, free 259.4 MB) > 15/05/19 14:10:38 INFO storage.MemoryStore: ensureFreeSpace(2594) called with > curMem=319035, maxMem=272357130 > 15/05/19 14:10:38 INFO storage.MemoryStore: Block broadcast_2_piece0 stored > as bytes in memory (estimated size 2.5 KB, free 259.4 MB) > 15/05/19 14:10:38 INFO storage.BlockManagerInfo: Added broadcast_2_piece0 in > memory on gs-server-v-127:33526 (size: 2.5 KB, free: 259.7 MB) > 15/05/19 14:10:38 INFO storage.BlockManagerMaster: Updated info of block > broadcast_2_piece0 > 15/05/19 14:10:38 INFO spark.SparkContext: Created broadcast 2 from broadcast > at DAGScheduler.scala:839 > 15/05/19 14:10:38 INFO scheduler.DAGScheduler: Submitting 4 missing tasks > from Stage 1 (MapPartitionsRDD[3] at mapPartitionsWithIndex at WdEtl.scala:64) > 15/05/19 14:10:38 INFO cluster.YarnClusterScheduler: Adding task set 1.0 with > 4 tasks > 15/05/19 14:10:38 INFO scheduler.TaskSetManager: Starting task 0.0 in stage > 1.0 (TID 1, gs-server-v-129, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:38 INFO scheduler.TaskSetManager: Starting task 1.0 in stage > 1.0 (TID 2, gs-server-v-127, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:38 INFO storage.BlockManagerInfo: Added broadcast_2_piece0 in > memory on gs-server-v-127:40594 (size: 2.5 KB, free: 2.1 GB) > 15/05/19 14:10:38 INFO spark.MapOutputTrackerMasterActor: Asked to send map > output locations for shuffle 0 to sparkExecutor@gs-server-v-127:22773 > 15/05/19 14:10:38 INFO spark.MapOutputTrackerMaster: Size of output statuses > for shuffle 0 is 148 bytes > 15/05/19 14:10:38 INFO storage.BlockManagerInfo: Added broadcast_2_piece0 in > memory on gs-server-v-129:2745 (size: 2.5 KB, free: 2.1 GB) > 15/05/19 14:10:38 INFO spark.MapOutputTrackerMasterActor: Asked to send map > output locations for shuffle 0 to sparkExecutor@gs-server-v-129:44560 > 15/05/19 14:10:40 INFO scheduler.TaskSetManager: Starting task 2.0 in stage > 1.0 (TID 3, gs-server-v-127, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:40 WARN scheduler.TaskSetManager: Lost task 1.0 in stage 1.0 > (TID 2, gs-server-v-127): java.io.IOException: > java.lang.reflect.InvocationTargetException > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:240) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:218) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:119) > at com.gridsum.spark.wd.SessionHandler.<init>(SessionHandler.scala:59) > at com.gridsum.spark.wd.WdEtl$$anonfun$main$3.apply(WdEtl.scala:65) > at com.gridsum.spark.wd.WdEtl$$anonfun$main$3.apply(WdEtl.scala:64) > at org.apache.spark.rdd.RDD$$anonfun$15.apply(RDD.scala:647) > at org.apache.spark.rdd.RDD$$anonfun$15.apply(RDD.scala:647) > at > org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:35) > at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:277) > at org.apache.spark.rdd.RDD.iterator(RDD.scala:244) > at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:61) > at org.apache.spark.scheduler.Task.run(Task.scala:64) > at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:203) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > Caused by: java.lang.reflect.InvocationTargetException > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) > at > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57) > at > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) > at java.lang.reflect.Constructor.newInstance(Constructor.java:526) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:238) > ... 16 more > Caused by: java.lang.NoClassDefFoundError: org/apache/htrace/Trace > at > org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.exists(RecoverableZooKeeper.java:218) > at org.apache.hadoop.hbase.zookeeper.ZKUtil.checkExists(ZKUtil.java:481) > at > org.apache.hadoop.hbase.zookeeper.ZKClusterId.readClusterIdZNode(ZKClusterId.java:65) > at > org.apache.hadoop.hbase.client.ZooKeeperRegistry.getClusterId(ZooKeeperRegistry.java:86) > at > org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.retrieveClusterId(ConnectionManager.java:850) > at > org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.<init>(ConnectionManager.java:635) > ... 21 more > Caused by: java.lang.ClassNotFoundException: org.apache.htrace.Trace > at java.net.URLClassLoader$1.run(URLClassLoader.java:366) > at java.net.URLClassLoader$1.run(URLClassLoader.java:355) > at java.security.AccessController.doPrivileged(Native Method) > at java.net.URLClassLoader.findClass(URLClassLoader.java:354) > at java.lang.ClassLoader.loadClass(ClassLoader.java:425) > at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308) > at java.lang.ClassLoader.loadClass(ClassLoader.java:358) > ... 27 more > > 15/05/19 14:10:41 INFO scheduler.TaskSetManager: Starting task 1.1 in stage > 1.0 (TID 4, gs-server-v-129, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:41 INFO scheduler.TaskSetManager: Lost task 0.0 in stage 1.0 > (TID 1) on executor gs-server-v-129: java.io.IOException > (java.lang.reflect.InvocationTargetException) [duplicate 1] > 15/05/19 14:10:42 INFO scheduler.TaskSetManager: Starting task 0.1 in stage > 1.0 (TID 5, gs-server-v-127, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:42 INFO scheduler.TaskSetManager: Lost task 2.0 in stage 1.0 > (TID 3) on executor gs-server-v-127: java.io.IOException > (java.lang.reflect.InvocationTargetException) [duplicate 2] > 15/05/19 14:10:43 INFO scheduler.TaskSetManager: Starting task 2.1 in stage > 1.0 (TID 6, gs-server-v-127, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:43 INFO scheduler.TaskSetManager: Lost task 0.1 in stage 1.0 > (TID 5) on executor gs-server-v-127: java.io.IOException > (java.lang.reflect.InvocationTargetException) [duplicate 3] > 15/05/19 14:10:43 INFO scheduler.TaskSetManager: Starting task 0.2 in stage > 1.0 (TID 7, gs-server-v-129, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:43 INFO scheduler.TaskSetManager: Lost task 1.1 in stage 1.0 > (TID 4) on executor gs-server-v-129: java.io.IOException > (java.lang.reflect.InvocationTargetException) [duplicate 4] > 15/05/19 14:10:44 INFO scheduler.TaskSetManager: Starting task 1.2 in stage > 1.0 (TID 8, gs-server-v-127, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:44 INFO scheduler.TaskSetManager: Lost task 2.1 in stage 1.0 > (TID 6) on executor gs-server-v-127: java.io.IOException > (java.lang.reflect.InvocationTargetException) [duplicate 5] > 15/05/19 14:10:45 INFO scheduler.TaskSetManager: Starting task 2.2 in stage > 1.0 (TID 9, gs-server-v-129, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:45 INFO scheduler.TaskSetManager: Lost task 0.2 in stage 1.0 > (TID 7) on executor gs-server-v-129: java.io.IOException > (java.lang.reflect.InvocationTargetException) [duplicate 6] > 15/05/19 14:10:46 INFO scheduler.TaskSetManager: Starting task 0.3 in stage > 1.0 (TID 10, gs-server-v-127, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:46 INFO scheduler.TaskSetManager: Lost task 1.2 in stage 1.0 > (TID 8) on executor gs-server-v-127: java.io.IOException > (java.lang.reflect.InvocationTargetException) [duplicate 7] > 15/05/19 14:10:46 INFO scheduler.TaskSetManager: Starting task 1.3 in stage > 1.0 (TID 11, gs-server-v-129, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:46 INFO scheduler.TaskSetManager: Lost task 2.2 in stage 1.0 > (TID 9) on executor gs-server-v-129: java.io.IOException > (java.lang.reflect.InvocationTargetException) [duplicate 8] > 15/05/19 14:10:47 INFO scheduler.TaskSetManager: Starting task 2.3 in stage > 1.0 (TID 12, gs-server-v-127, PROCESS_LOCAL, 1056 bytes) > 15/05/19 14:10:47 INFO scheduler.TaskSetManager: Lost task 0.3 in stage 1.0 > (TID 10) on executor gs-server-v-127: java.io.IOException > (java.lang.reflect.InvocationTargetException) [duplicate 9] > 15/05/19 14:10:47 ERROR scheduler.TaskSetManager: Task 0 in stage 1.0 failed > 4 times; aborting job > 15/05/19 14:10:47 INFO cluster.YarnClusterScheduler: Cancelling stage 1 > 15/05/19 14:10:47 INFO cluster.YarnClusterScheduler: Stage 1 was cancelled > 15/05/19 14:10:47 INFO scheduler.DAGScheduler: Job 0 failed: foreach at > WdEtl.scala:74, took 96.765394 s > 15/05/19 14:10:47 ERROR yarn.ApplicationMaster: User class threw exception: > Job aborted due to stage failure: Task 0 in stage 1.0 failed 4 times, most > recent failure: Lost task 0.3 in stage 1.0 (TID 10, gs-server-v-127): > java.io.IOException: java.lang.reflect.InvocationTargetException > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:240) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:218) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:119) > at com.gridsum.spark.wd.SessionHandler.<init>(SessionHandler.scala:59) > at com.gridsum.spark.wd.WdEtl$$anonfun$main$3.apply(WdEtl.scala:65) > at com.gridsum.spark.wd.WdEtl$$anonfun$main$3.apply(WdEtl.scala:64) > at org.apache.spark.rdd.RDD$$anonfun$15.apply(RDD.scala:647) > at org.apache.spark.rdd.RDD$$anonfun$15.apply(RDD.scala:647) > at > org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:35) > at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:277) > at org.apache.spark.rdd.RDD.iterator(RDD.scala:244) > at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:61) > at org.apache.spark.scheduler.Task.run(Task.scala:64) > at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:203) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > Caused by: java.lang.reflect.InvocationTargetException > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) > at > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57) > at > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) > at java.lang.reflect.Constructor.newInstance(Constructor.java:526) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:238) > ... 16 more > Caused by: java.lang.NoClassDefFoundError: org/apache/htrace/Trace > at > org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.exists(RecoverableZooKeeper.java:218) > at org.apache.hadoop.hbase.zookeeper.ZKUtil.checkExists(ZKUtil.java:481) > at > org.apache.hadoop.hbase.zookeeper.ZKClusterId.readClusterIdZNode(ZKClusterId.java:65) > at > org.apache.hadoop.hbase.client.ZooKeeperRegistry.getClusterId(ZooKeeperRegistry.java:86) > at > org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.retrieveClusterId(ConnectionManager.java:850) > at > org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.<init>(ConnectionManager.java:635) > ... 21 more > > Driver stacktrace: > org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in > stage 1.0 failed 4 times, most recent failure: Lost task 0.3 in stage 1.0 > (TID 10, gs-server-v-127): java.io.IOException: > java.lang.reflect.InvocationTargetException > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:240) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:218) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:119) > at com.gridsum.spark.wd.SessionHandler.<init>(SessionHandler.scala:59) > at com.gridsum.spark.wd.WdEtl$$anonfun$main$3.apply(WdEtl.scala:65) > at com.gridsum.spark.wd.WdEtl$$anonfun$main$3.apply(WdEtl.scala:64) > at org.apache.spark.rdd.RDD$$anonfun$15.apply(RDD.scala:647) > at org.apache.spark.rdd.RDD$$anonfun$15.apply(RDD.scala:647) > at > org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:35) > at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:277) > at org.apache.spark.rdd.RDD.iterator(RDD.scala:244) > at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:61) > at org.apache.spark.scheduler.Task.run(Task.scala:64) > at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:203) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > Caused by: java.lang.reflect.InvocationTargetException > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) > at > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57) > at > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) > at java.lang.reflect.Constructor.newInstance(Constructor.java:526) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:238) > ... 16 more > Caused by: java.lang.NoClassDefFoundError: org/apache/htrace/Trace > at > org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.exists(RecoverableZooKeeper.java:218) > at org.apache.hadoop.hbase.zookeeper.ZKUtil.checkExists(ZKUtil.java:481) > at > org.apache.hadoop.hbase.zookeeper.ZKClusterId.readClusterIdZNode(ZKClusterId.java:65) > at > org.apache.hadoop.hbase.client.ZooKeeperRegistry.getClusterId(ZooKeeperRegistry.java:86) > at > org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.retrieveClusterId(ConnectionManager.java:850) > at > org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.<init>(ConnectionManager.java:635) > ... 21 more > > Driver stacktrace: > at > org.apache.spark.scheduler.DAGScheduler.org$apache$spark$scheduler$DAGScheduler$$failJobAndIndependentStages(DAGScheduler.scala:1203) > at > org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1192) > at > org.apache.spark.scheduler.DAGScheduler$$anonfun$abortStage$1.apply(DAGScheduler.scala:1191) > at > scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59) > at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:47) > at > org.apache.spark.scheduler.DAGScheduler.abortStage(DAGScheduler.scala:1191) > at > org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:693) > at > org.apache.spark.scheduler.DAGScheduler$$anonfun$handleTaskSetFailed$1.apply(DAGScheduler.scala:693) > at scala.Option.foreach(Option.scala:236) > at > org.apache.spark.scheduler.DAGScheduler.handleTaskSetFailed(DAGScheduler.scala:693) > at > org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:1393) > at > org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:1354) > at org.apache.spark.util.EventLoop$$anon$1.run(EventLoop.scala:48) > 15/05/19 14:10:47 INFO yarn.ApplicationMaster: Final app status: FAILED, > exitCode: 15, (reason: User class threw exception: Job aborted due to stage > failure: Task 0 in stage 1.0 failed 4 times, most recent failure: Lost task > 0.3 in stage 1.0 (TID 10, gs-server-v-127): java.io.IOException: > java.lang.reflect.InvocationTargetException > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:240) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:218) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:119) > at com.gridsum.spark.wd.SessionHandler.<init>(SessionHandler.scala:59) > at com.gridsum.spark.wd.WdEtl$$anonfun$main$3.apply(WdEtl.scala:65) > at com.gridsum.spark.wd.WdEtl$$anonfun$main$3.apply(WdEtl.scala:64) > at org.apache.spark.rdd.RDD$$anonfun$15.apply(RDD.scala:647) > at org.apache.spark.rdd.RDD$$anonfun$15.apply(RDD.scala:647) > at > org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:35) > at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:277) > at org.apache.spark.rdd.RDD.iterator(RDD.scala:244) > at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:61) > at org.apache.spark.scheduler.Task.run(Task.scala:64) > at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:203) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) > at java.lang.Thread.run(Thread.java:745) > Caused by: java.lang.reflect.InvocationTargetException > at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method) > at > sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:57) > at > sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45) > at java.lang.reflect.Constructor.newInstance(Constructor.java:526) > at > org.apache.hadoop.hbase.client.ConnectionFactory.createConnection(ConnectionFactory.java:238) > ... 16 more > Caused by: java.lang.NoClassDefFoundError: org/apache/htrace/Trace > at > org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.exists(RecoverableZooKeeper.java:218) > at org.apache.hadoop.hbase.zookeeper.ZKUtil.checkExists(ZKUtil.java:481) > at > org.apache.hadoop.hbase.zookeeper.ZKClusterId.readClusterIdZNode(ZKClusterId.java:65) > at > org.apache.hadoop.hbase.client.ZooKeeperRegistry.getClusterId(ZooKeeperRegistry.java:86) > at > org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.retrieveClusterId(ConnectionManager.java:850) > at > org.apache.hadoop.hbase.client.ConnectionManager$HConnectionImplementation.<init>(ConnectionManager.java:635) > ... 21 more > > Driver stacktrace:) > 15/05/19 14:10:47 INFO yarn.ApplicationMaster: Invoking sc stop from shutdown > hook > 15/05/19 14:10:47 WARN scheduler.TaskSetManager: Lost task 2.3 in stage 1.0 > (TID 12, gs-server-v-127): TaskKilled (killed intentionally) > 15/05/19 14:10:47 INFO cluster.YarnClusterScheduler: Removed TaskSet 1.0, > whose tasks have all completed, from pool > 15/05/19 14:10:47 WARN scheduler.TaskSetManager: Lost task 1.3 in stage 1.0 > (TID 11, gs-server-v-129): TaskKilled (killed intentionally) > 15/05/19 14:10:47 INFO cluster.YarnClusterScheduler: Removed TaskSet 1.0, > whose tasks have all completed, from pool > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/metrics/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/stages/stage/kill,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/static,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/executors/threadDump/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/executors/threadDump,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/executors/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/executors,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/environment/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/environment,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/storage/rdd/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/storage/rdd,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/storage/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/storage,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/stages/pool/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/stages/pool,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/stages/stage/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/stages/stage,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/stages/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/stages,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/jobs/job/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/jobs/job,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/jobs/json,null} > 15/05/19 14:10:47 INFO handler.ContextHandler: stopped > o.s.j.s.ServletContextHandler{/jobs,null} > 15/05/19 14:10:47 INFO ui.SparkUI: Stopped Spark web UI at > http://gs-server-v-127:63023 > 15/05/19 14:10:47 INFO scheduler.DAGScheduler: Stopping DAGScheduler > 15/05/19 14:10:47 INFO cluster.YarnClusterSchedulerBackend: Shutting down all > executors > 15/05/19 14:10:47 INFO cluster.YarnClusterSchedulerBackend: Asking each > executor to shut down > 15/05/19 14:10:47 INFO > scheduler.OutputCommitCoordinator$OutputCommitCoordinatorActor: > OutputCommitCoordinator stopped! > 15/05/19 14:10:47 INFO spark.MapOutputTrackerMasterActor: > MapOutputTrackerActor stopped! > 15/05/19 14:10:47 INFO storage.MemoryStore: MemoryStore cleared > 15/05/19 14:10:47 INFO storage.BlockManager: BlockManager stopped > 15/05/19 14:10:47 INFO storage.BlockManagerMaster: BlockManagerMaster stopped > 15/05/19 14:10:47 INFO remote.RemoteActorRefProvider$RemotingTerminator: > Shutting down remote daemon. > 15/05/19 14:10:47 INFO remote.RemoteActorRefProvider$RemotingTerminator: > Remote daemon shut down; proceeding with flushing remote transports. > 15/05/19 14:10:47 INFO spark.SparkContext: Successfully stopped SparkContext > > > 2015-05-19 1:12 GMT+08:00 Marcelo Vanzin <van...@cloudera.com>: > >> On Sun, May 17, 2015 at 3:53 PM, Wilfred Spiegelenburg < >> wspiegelenb...@cloudera.com> wrote: >> >>> When you run the driver in the cluster the application really runs from >>> the cluster and the client goes away. If the driver does not have access to >>> the jars, i.e. if they are not on the cluster available somewhere, this >>> will happen. >>> >> >> That's not true. Files specified in "--jars" and "--files" are uploaded >> to the cluster before the app starts (unless they have the "local:" >> prefix). The visible effect on the configuration is that these files will >> show up in "spark.yarn.secondary.jars" as Fengyun mentioned in one of >> his messages. >> >> Fengyun, woule you mind sharing more than just a partial stack trace? >> e.g., the full driver logs would help in figuring out what's going on with >> that file. >> >> -- >> Marcelo >> >> -- >> >> --- >> You received this message because you are subscribed to the Google Groups >> "CDH Users" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to cdh-user+unsubscr...@cloudera.org. >> For more options, visit https://groups.google.com/a/cloudera.org/d/optout >> . >> > > -- Marcelo