[GitHub] [spark] HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2019-11-27 Thread GitBox
HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files URL: https://github.com/apache/spark/pull/24128#issuecomment-558976957 > IMHO, the core problem is the compact metadata log grows bigger and bigger, and it

[GitHub] [spark] HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2019-11-27 Thread GitBox
HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files URL: https://github.com/apache/spark/pull/24128#issuecomment-558976957 > IMHO, the core problem is the compact metadata log grows bigger and bigger, and it

[GitHub] [spark] HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2019-11-21 Thread GitBox
HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files URL: https://github.com/apache/spark/pull/24128#issuecomment-557398360 UPDATE: SPARK-29995 is just filed from other end user which denotes same issues SPAR

[GitHub] [spark] HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2019-03-18 Thread GitBox
HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files URL: https://github.com/apache/spark/pull/24128#issuecomment-474149223 This is a commit based on master for the last alternative (exclude old output files

[GitHub] [spark] HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2019-03-18 Thread GitBox
HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files URL: https://github.com/apache/spark/pull/24128#issuecomment-474153829 Rebased to the approach: applying retention. Also updated JIRA and PR as well.

[GitHub] [spark] HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files

2019-03-18 Thread GitBox
HeartSaVioR edited a comment on issue #24128: [SPARK-27188][SS] FileStreamSink: provide a new option to have retention on output files URL: https://github.com/apache/spark/pull/24128#issuecomment-474150802 > We shouldn't implement ad-hoc changes with unclear behavior and semantics just to