[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-18 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 @vanzin I will close this PR, but can't take your point completely. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-16 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/16142 > #16165 has supported deleting too old in-progress job logs. So I think it is OK in this case. It's *not* OK. That change uses an heuristic to decide when to delete "inprogress" logs: if

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-16 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 @srowen It has real demand. I have received some reports from our users: they care about space cost from job event logs, but not care about logs age, no matter how old it is. If you guys

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-16 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 > The current scan code does not make one request to the NameNode per log file in the directory. Your code does. That should be avoided. Make sense, current implementation can be optimized

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-16 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/16142 I personally don't see much value in deleting by 'size' because there's no particular reason to believe that files that happen to be bigger contain less important info. If you want to delete more

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-15 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/16142 > I just add a new clean-up mode, but not add the cleaner itself. But that's kinda the point. How many different ways of cleaning need to be added? Will this one be enough? Will people ask

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-15 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 Besides, the unit test has proved that the older file will be cleaned first. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-15 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 @vanzin > For the feature, it feels like it's trying to make the SHS more like a "log management system" than a history server. Sorry, I do not get it. I just add a new clean-up

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-15 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/16142 I'm not such a big fan of this feature, but mostly I'm not a big fan of the current implementation. For the feature, it feels like it's trying to make the SHS more like a "log management

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-13 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 cc @vanzin --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-13 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 @AmplabJenkins retest please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-13 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 retest it please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16142 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/70066/ Test FAILed. ---

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16142 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-12 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16142 **[Test build #70066 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70066/consoleFull)** for PR 16142 at commit

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-12 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 retest please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-12 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16142 **[Test build #70059 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70059/consoleFull)** for PR 16142 at commit

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16142 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/70059/ Test FAILed. ---

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-12 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16142 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-12 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16142 **[Test build #70059 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/70059/consoleFull)** for PR 16142 at commit

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-12 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 @vanzin I have removed related changes in `EventLoggingListener`, and provide a new clean mode, i.e. `space` based mode. Please take a review. --- If your project is set up for it, you can reply

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-07 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 @vanzin I found you provided this base work, could you please take a look? Thank you. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 @srowen If I have understand what you mean correctly, the **"log rotation"** is different with **"job event log clean up"**. The "job event log" is used to reply to build spark history ui.

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/16142 Yes, but the alternative is reimplementing an ad-hoc log rotation system here, which isn't great either. Are you saying the history server already manages logs? pardon, I don't know it at all. ---

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 @srowen Spark History Server may do the clean-up work. The precondition is we start it and it keeps running. Besides, if there are abundant applications constantly, the event log may still take up

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/16142 Hm, does Spark generally manage log rotation? I confess ignorance. It's something you often leave to a cron job or something to archive and clean up. --- If your project is set up for it, you can

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 cc @srowen --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16142 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69668/ Test PASSed. ---

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16142 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16142 **[Test build #69668 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69668/consoleFull)** for PR 16142 at commit

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16142 **[Test build #69668 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69668/consoleFull)** for PR 16142 at commit

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 @SparkQA Test it agagin --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread uncleGen
Github user uncleGen commented on the issue: https://github.com/apache/spark/pull/16142 retest it please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16142 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16142 **[Test build #69666 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69666/consoleFull)** for PR 16142 at commit

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16142 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/69666/ Test FAILed. ---

[GitHub] spark issue #16142: [SPARK-18716][CORE] Restrict the disk usage of spark eve...

2016-12-05 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16142 **[Test build #69666 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/69666/consoleFull)** for PR 16142 at commit