Satish Subhashrao Saley created OOZIE-2787: ----------------------------------------------
Summary: Oozie distributes application jar twice making the spark job fail Key: OOZIE-2787 URL: https://issues.apache.org/jira/browse/OOZIE-2787 Project: Oozie Issue Type: Bug Reporter: Satish Subhashrao Saley Assignee: Satish Subhashrao Saley Oozie adds the application jar to the list of files to be uploaded to distributed cache. Since this gets added twice, the job fails. This is observed from spark 2.1.0 which introduces a check for same file and fails the job. {code} --master yarn --deploy-mode cluster --name oozieSparkStarter --class ScalaWordCount --queue default --conf spark.executor.extraClassPath=$PWD/* --conf spark.driver.extraClassPath=$PWD/* --conf spark.executor.extraJavaOptions=-Dlog4j.configuration=spark-log4j.properties --conf spark.driver.extraJavaOptions=-Dlog4j.configuration=spark-log4j.properties --conf spark.yarn.security.tokens.hive.enabled=false --conf spark.yarn.security.tokens.hbase.enabled=false --files hdfs://mycluster.com/user/saley/oozie/apps/sparkapp/lib/spark-example.jar --properties-file spark-defaults.conf --verbose spark-example.jar samplefile.txt output {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346)