Satish Subhashrao Saley created OOZIE-2787:
----------------------------------------------

             Summary: Oozie distributes application jar twice making the spark 
job fail
                 Key: OOZIE-2787
                 URL: https://issues.apache.org/jira/browse/OOZIE-2787
             Project: Oozie
          Issue Type: Bug
            Reporter: Satish Subhashrao Saley
            Assignee: Satish Subhashrao Saley


Oozie adds the application jar to the list of files to be uploaded to 
distributed cache. Since this gets added twice, the job fails. This is observed 
from spark 2.1.0 which introduces a check for same file and fails the job.

{code}
--master
yarn
--deploy-mode
cluster
--name
oozieSparkStarter
--class
ScalaWordCount
--queue 
default
--conf
spark.executor.extraClassPath=$PWD/*
--conf
spark.driver.extraClassPath=$PWD/*
--conf
spark.executor.extraJavaOptions=-Dlog4j.configuration=spark-log4j.properties
--conf
spark.driver.extraJavaOptions=-Dlog4j.configuration=spark-log4j.properties
--conf
spark.yarn.security.tokens.hive.enabled=false
--conf
spark.yarn.security.tokens.hbase.enabled=false
--files
hdfs://mycluster.com/user/saley/oozie/apps/sparkapp/lib/spark-example.jar
--properties-file
spark-defaults.conf
--verbose
spark-example.jar
samplefile.txt
output
{code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to