> On Apr 24, 2018, at 5:01 PM, Greg Stein <[email protected]> wrote:
> 
> Let's go back to the start: stuff older than six months will be deleted.
> What could possibly need to be retained?

        - Not every job runs every day.  Some are extremely situational.

        - Some users might have specifically marked certain data to be retained 
for very specific reasons.

        I know in my case I marked some logs to not be deleted because I was 
using them to debug the systemic Jenkins build node crashes. I want to keep the 
data to see if the usage numbers, etc, go down over time.

        So yes, there may be some value to some of that data that will not be 
obvious to an outside observer.

> Assume all jobs will be touched.

        … which is why giving a directory listing of just the base directory 
would be useful to see who needs to look. If INFRA is unwilling to provide that 
data, then keep any directories that reference:

        - precommit
        - hadoop
        - yarn
        - hdfs
        - mapreduce
        - hbase
        - yetus

Thanks!

Reply via email to