GitHub user noodle-fb opened a pull request:

    https://github.com/apache/spark/pull/17422

    Attach accumulators / metrics to 'TaskKilled' end reason

    ## What changes were proposed in this pull request?
    
    The ultimate goal is for listeners to `onTaskEnd` to receive metrics when a 
task is killed intentionally, since the data is currently just thrown away. 
This is already done for ExceptionFailure, so this just copies the same 
approach.
    
    ## How was this patch tested?
    
    The unit test in DAGSchedulerSuite that tests this for ExceptionFailure was 
modified to test the same thing for TaskKilled. I also re-tested all the unit 
tests modified by the [last change to 
TaskKilled](https://github.com/apache/spark/commit/8e558041aa0c41ba9fb2ce242daaf6d6ed4d85b7),
 and made sure they all still pass.
    
    For integration tests, I ran a query that caused a speculative task retry 
on our deployment, and verified that the metrics showed up in our logging for 
that retry when it was killed.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/noodle-fb/spark task-killed-metrics

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/17422.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #17422
    
----
commit 0625dc3187c9c8fa8d507adc0da75747c30f0ebc
Author: Charles Lewis <noo...@fb.com>
Date:   2017-03-22T20:33:55Z

    report metrics for killed tasks

commit ee883b2f3da10a4e4a48f4a98910ccadceac461c
Author: Charles Lewis <noo...@fb.com>
Date:   2017-03-24T19:06:46Z

    add task killed to exception accum test

commit 25ffbf49b2779d4fa795d754ee20fbe3542dd57d
Author: Charles Lewis <noo...@fb.com>
Date:   2017-03-24T23:20:59Z

    extra fixes for task killed reason merge

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to