GitHub user ajithme opened a pull request:
https://github.com/apache/spark/pull/22120
[SPARK-25131]Event logs missing applicationAttemptId for
SparkListenerApplicationStart
When master=yarn and deploy-mode=client, event logs do not contain
applicationAttemptId for SparkListenerApplicationStart. This is caused at
org.apache.spark.scheduler.cluster.YarnClientSchedulerBackend#start where we do
bindToYarn(client.submitApplication(), None) which sets appAttemptId to None.
We can however, get the appAttemptId after waitForApplication() and set it
This i have tested manually and verified
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ajithme/spark missingAttemptId
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/22120.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #22120
commit cc7625c06609c1092ca26d6d6b4f522b9f844710
Author: Ajith
Date: 2018-08-16T12:54:06Z
Set application attempt id in yarn client mode
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org