Github user CodingCat commented on the pull request:
https://github.com/apache/incubator-spark/pull/626#issuecomment-35841703
@pwendell Thanks for the comments, I also considered what you mentioned,
but will that prevent other components like Spark Streaming from doing the
right job? (I'm not familiar with streaming, but it seems that it will
overwrite the existing directory...)
Also how to prevent the situation that the user occasionally run the job
over the same directory for two times, but with different partition number (the
second running has smaller value); eventually, the directory will contain the
results from two runnings.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. To do so, please top-post your response.
If your project does not have this feature enabled and wishes so, or if the
feature is enabled but not working, please contact infrastructure at
[email protected] or file a JIRA ticket with INFRA.
---