Alejandro Fernandez created AMBARI-9954:
-------------------------------------------

             Summary: Spark on tez apps fails needs tez.tar.gz copied to HDFS
                 Key: AMBARI-9954
                 URL: https://issues.apache.org/jira/browse/AMBARI-9954
             Project: Ambari
          Issue Type: Bug
          Components: ambari-server
    Affects Versions: 2.0.0
            Reporter: Alejandro Fernandez
            Assignee: Alejandro Fernandez
             Fix For: 2.0.0


The spark on tez apps fails because tez.tar.gz needs to be copied to HDFS.
Currently, only Pig Service Check and Hive START copy it to HDFS.

{noformat}
$ /usr/hdp/current/spark-client/bin/spark-submit  --class 
org.apache.spark.examples.SparkPi --master 
execution-context:org.apache.spark.tez.TezJobExecutionContext 
/usr/hdp/current/spark-client/lib/spark-examples-1.2.1.2.2.2.0-2538-hadoop2.6.0.2.2.2.0-2538.jar
 3

tput: No value for $TERM and no -T specified
Spark assembly has been built with Hive, including Datanucleus jars on classpath
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in 
[jar:file:/grid/0/hdp/2.2.2.0-2538/spark/lib/spark-examples-1.2.1.2.2.2.0-2538-hadoop2.6.0.2.2.2.0-2538.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in 
[jar:file:/grid/0/hdp/2.2.2.0-2538/spark/external/spark-native-yarn/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
15/03/04 09:27:53 INFO spark.SecurityManager: Changing view acls to: hrt_qa
15/03/04 09:27:53 INFO spark.SecurityManager: Changing modify acls to: hrt_qa
15/03/04 09:27:53 INFO spark.SecurityManager: SecurityManager: authentication 
disabled; ui acls disabled; users with view permissions: Set(hrt_qa); users 
with modify permissions: Set(hrt_qa)
15/03/04 09:27:54 INFO slf4j.Slf4jLogger: Slf4jLogger started
15/03/04 09:27:54 INFO Remoting: Starting remoting
15/03/04 09:27:54 INFO Remoting: Remoting started; listening on addresses 
:[akka.tcp://sparkDriver@ip-172-31-47-166.ec2.internal:34628]
15/03/04 09:27:54 INFO util.Utils: Successfully started service 'sparkDriver' 
on port 34628.
15/03/04 09:27:54 INFO spark.SparkEnv: Registering MapOutputTracker
15/03/04 09:27:54 INFO spark.SparkEnv: Registering BlockManagerMaster
15/03/04 09:27:54 INFO storage.DiskBlockManager: Created local directory at 
/tmp/spark-c3fe89f7-4117-41fc-8a62-01f0451c9060/spark-a209539b-07ae-42ad-a83d-9ad53b1c6adc
15/03/04 09:27:54 INFO storage.MemoryStore: MemoryStore started with capacity 
265.4 MB
15/03/04 09:27:55 WARN util.NativeCodeLoader: Unable to load native-hadoop 
library for your platform... using builtin-java classes where applicable
15/03/04 09:27:55 INFO spark.HttpFileServer: HTTP File server directory is 
/tmp/spark-995afa1c-3d9a-453b-a84c-b02b73aab8d7/spark-57d211b1-32a5-453c-b2d0-5f010b4cf74a
15/03/04 09:27:55 INFO spark.HttpServer: Starting HTTP Server
15/03/04 09:27:55 INFO server.Server: jetty-8.y.z-SNAPSHOT
15/03/04 09:27:55 INFO server.AbstractConnector: Started 
SocketConnector@0.0.0.0:52966
15/03/04 09:27:55 INFO util.Utils: Successfully started service 'HTTP file 
server' on port 52966.
15/03/04 09:27:55 INFO server.Server: jetty-8.y.z-SNAPSHOT
15/03/04 09:27:55 INFO server.AbstractConnector: Started 
SelectChannelConnector@0.0.0.0:4040
15/03/04 09:27:55 INFO util.Utils: Successfully started service 'SparkUI' on 
port 4040.
15/03/04 09:27:55 INFO ui.SparkUI: Started SparkUI at 
http://ip-172-31-47-166.ec2.internal:4040
15/03/04 09:27:55 INFO spark.SparkContext: Will use custom job execution 
context org.apache.spark.tez.TezJobExecutionContext
15/03/04 09:27:56 INFO tez.TezJobExecutionContext: Config dir: /etc/hadoop/conf
15/03/04 09:27:56 INFO tez.TezJobExecutionContext: FileSystem: 
hdfs://ip-172-31-47-165.ec2.internal:8020
15/03/04 09:27:56 INFO tez.TezJobExecutionContext: Error while accessing 
configuration. Possible cause - 'version missmatch'
org.apache.hadoop.conf.Configuration is loaded from 
file:/grid/0/hdp/2.2.2.0-2538/spark/external/spark-native-yarn/lib/hadoop-common-2.6.0.2.2.2.0-2538.jar
org.apache.tez.dag.api.TezConfiguration is loaded from 
file:/grid/0/hdp/2.2.2.0-2538/spark/external/spark-native-yarn/lib/tez-api-0.5.2.2.2.2.0-2538.jar
15/03/04 09:27:57 WARN shortcircuit.DomainSocketFactory: The short-circuit 
local reads feature cannot be used because libhadoop cannot be loaded.
15/03/04 09:27:57 INFO netty.NettyBlockTransferService: Server created on 58099
15/03/04 09:27:57 INFO storage.BlockManagerMaster: Trying to register 
BlockManager
15/03/04 09:27:57 INFO storage.BlockManagerMasterActor: Registering block 
manager ip-172-31-47-166.ec2.internal:58099 with 265.4 MB RAM, 
BlockManagerId(<driver>, ip-172-31-47-166.ec2.internal, 58099)
15/03/04 09:27:57 INFO storage.BlockManagerMaster: Registered BlockManager
15/03/04 09:27:58 INFO tez.TezDelegate: Job: Spark Pi will be submitted to the 
following YARN cluster:
15/03/04 09:27:58 INFO tez.TezDelegate: Default FS Address: 
hdfs://ip-172-31-47-165.ec2.internal:8020
15/03/04 09:27:58 INFO tez.TezDelegate: RM Host Name: 
ip-172-31-47-165.ec2.internal
15/03/04 09:27:58 INFO tez.TezDelegate: RM Address: 
ip-172-31-47-165.ec2.internal:8050
15/03/04 09:27:58 INFO tez.TezDelegate: RM Scheduler Address: 
ip-172-31-47-165.ec2.internal:8030
15/03/04 09:27:58 INFO tez.TezDelegate: RM Resource Tracker Address: null
15/03/04 09:27:58 INFO tez.TezDelegate: Application classpath dir is: 
hdfs://ip-172-31-47-165.ec2.internal:8020/user/hrt_qa/Spark Pi/app-classpath
15/03/04 09:28:08 INFO utils.HadoopUtils: Skipping provisioning of 
tez-api-0.5.2.2.2.2.0-2538.jar since Tez libraries are already provisioned
15/03/04 09:28:08 INFO utils.HadoopUtils: Skipping provisioning of 
tez-common-0.5.2.2.2.2.0-2538.jar since Tez libraries are already provisioned
15/03/04 09:28:08 INFO utils.HadoopUtils: Skipping provisioning of 
tez-dag-0.5.2.2.2.2.0-2538.jar since Tez libraries are already provisioned
15/03/04 09:28:08 INFO utils.HadoopUtils: Skipping provisioning of 
tez-mapreduce-0.5.2.2.2.2.0-2538.jar since Tez libraries are already provisioned
15/03/04 09:28:08 INFO utils.HadoopUtils: Skipping provisioning of 
tez-runtime-internals-0.5.2.2.2.2.0-2538.jar since Tez libraries are already 
provisioned
15/03/04 09:28:08 INFO utils.HadoopUtils: Skipping provisioning of 
tez-runtime-library-0.5.2.2.2.2.0-2538.jar since Tez libraries are already 
provisioned
15/03/04 09:28:09 INFO client.TezClient: Tez Client Version: [ 
component=tez-api, version=0.5.2.2.2.2.0-2538, 
revision=2d3c6b639d5b1048bd20aad5736823a35edd2485, 
SCM-URL=scm:git:https://git-wip-us.apache.org/repos/asf/tez.git, 
buildTIme=20150304-0248 ]
15/03/04 09:28:09 INFO impl.TimelineClientImpl: Timeline service address: 
http://ip-172-31-47-165.ec2.internal:8188/ws/v1/timeline/
15/03/04 09:28:10 INFO client.RMProxy: Connecting to ResourceManager at 
ip-172-31-47-165.ec2.internal/172.31.47.165:8050
15/03/04 09:28:10 INFO tez.Utils: STAGE Result: Stage 0 vertex: 0
15/03/04 09:28:10 INFO tez.Utils: DAG: {0=(stage: 0; vertex:0; input:[])}
15/03/04 09:28:10 INFO tez.DAGBuilder: Submitting generated DAG to YARN/Tez 
cluster
15/03/04 09:28:10 INFO client.TezClient: Submitting DAG application with id: 
application_1425459626716_0002
15/03/04 09:28:10 INFO client.TezClientUtils: Using tez.lib.uris value from 
configuration: /hdp/apps/2.2.2.0-2538/tez/tez.tar.gz
java.io.FileNotFoundException: File does not exist: 
/hdp/apps/2.2.2.0-2538/tez/tez.tar.gz
at 
org.apache.hadoop.hdfs.DistributedFileSystem$19.doCall(DistributedFileSystem.java:1140)
at 
org.apache.hadoop.hdfs.DistributedFileSystem$19.doCall(DistributedFileSystem.java:1132)
at 
org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
at 
org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1132)
at org.apache.hadoop.fs.FileSystem.resolvePath(FileSystem.java:750)
at org.apache.tez.client.TezClientUtils.getLRFileStatus(TezClientUtils.java:127)
at 
org.apache.tez.client.TezClientUtils.setupTezJarsLocalResources(TezClientUtils.java:178)
at org.apache.tez.client.TezClient.getTezJarResources(TezClient.java:721)
at org.apache.tez.client.TezClient.submitDAGApplication(TezClient.java:689)
at org.apache.tez.client.TezClient.submitDAGApplication(TezClient.java:667)
at org.apache.tez.client.TezClient.submitDAG(TezClient.java:353)
at org.apache.spark.tez.DAGBuilder.run(DAGBuilder.java:164)
at org.apache.spark.tez.DAGBuilder.access$000(DAGBuilder.java:60)
at org.apache.spark.tez.DAGBuilder$1.execute(DAGBuilder.java:148)
at org.apache.spark.tez.TezDelegate.submitApplication(TezDelegate.scala:82)
at 
org.apache.spark.tez.TezJobExecutionContext.runJob(TezJobExecutionContext.scala:168)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:1292)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:1358)
at org.apache.spark.rdd.RDD.reduce(RDD.scala:882)
at org.apache.spark.examples.SparkPi$.main(SparkPi.scala:41)
at org.apache.spark.examples.SparkPi.main(SparkPi.scala)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.spark.deploy.SparkSubmit$.launch(SparkSubmit.scala:367)
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:77)
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
Error: application failed with exception
java.lang.IllegalStateException: Failed to execute DAG
at org.apache.spark.tez.DAGBuilder.run(DAGBuilder.java:177)
at org.apache.spark.tez.DAGBuilder.access$000(DAGBuilder.java:60)
at org.apache.spark.tez.DAGBuilder$1.execute(DAGBuilder.java:148)
at org.apache.spark.tez.TezDelegate.submitApplication(TezDelegate.scala:82)
at 
org.apache.spark.tez.TezJobExecutionContext.runJob(TezJobExecutionContext.scala:168)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:1292)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:1358)
at org.apache.spark.rdd.RDD.reduce(RDD.scala:882)
at org.apache.spark.examples.SparkPi$.main(SparkPi.scala:41)
at org.apache.spark.examples.SparkPi.main(SparkPi.scala)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.spark.deploy.SparkSubmit$.launch(SparkSubmit.scala:367)
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:77)
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
Caused by: java.io.FileNotFoundException: File does not exist: 
/hdp/apps/2.2.2.0-2538/tez/tez.tar.gz
at 
org.apache.hadoop.hdfs.DistributedFileSystem$19.doCall(DistributedFileSystem.java:1140)
at 
org.apache.hadoop.hdfs.DistributedFileSystem$19.doCall(DistributedFileSystem.java:1132)
at 
org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
at 
org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1132)
at org.apache.hadoop.fs.FileSystem.resolvePath(FileSystem.java:750)
at org.apache.tez.client.TezClientUtils.getLRFileStatus(TezClientUtils.java:127)
at 
org.apache.tez.client.TezClientUtils.setupTezJarsLocalResources(TezClientUtils.java:178)
at org.apache.tez.client.TezClient.getTezJarResources(TezClient.java:721)
at org.apache.tez.client.TezClient.submitDAGApplication(TezClient.java:689)
at org.apache.tez.client.TezClient.submitDAGApplication(TezClient.java:667)
at org.apache.tez.client.TezClient.submitDAG(TezClient.java:353)
at org.apache.spark.tez.DAGBuilder.run(DAGBuilder.java:164)
... 16 more
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in 
[jar:file:/grid/0/hdp/2.2.2.0-2538/spark/lib/spark-examples-1.2.1.2.2.2.0-2538-hadoop2.6.0.2.2.2.0-2538.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in 
[jar:file:/grid/0/hdp/2.2.2.0-2538/spark/external/spark-native-yarn/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
{noformat}

Here is the log location 
http://qelog.hortonworks.com/log/ec2-amb-s11-3-us-1425455713-spark



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to