GitHub user misutoth opened a pull request: https://github.com/apache/spark/pull/22331
Tests for idempotency of FileStreamSink - Work in Progress ## What changes were proposed in this pull request? Reproduce File Sink duplication in driver failure scenario to help understanding the situation. ## How was this patch tested? This is a test addition only that was run and the last 2 tests failed showing there is a problem. You can merge this pull request into a Git repository by running: $ git pull https://github.com/misutoth/spark file-sink-dupe Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/22331.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #22331 ---- commit 0a5c6c45a4b90fc2ea8bd2647b6d3d3dfd8bd1a4 Author: Mihaly Toth <misutoth@...> Date: 2018-09-03T11:47:52Z Tests for idempotency of FileStreamSink ---- --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org