Imran Rashid created SPARK-28165:
------------------------------------

             Summary: SHS does not delete old inprogress files until 
cleaner.maxAge after SHS start time
                 Key: SPARK-28165
                 URL: https://issues.apache.org/jira/browse/SPARK-28165
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
    Affects Versions: 2.4.3, 2.3.3
            Reporter: Imran Rashid


The SHS will not delete inprogress files until 
{{spark.history.fs.cleaner.maxAge}} time after it has started (7 days by 
default), regardless of when the last modification to the file was.  This is 
particularly problematic if the SHS gets restarted regularly, as then you'll 
end up never deleting old files.

There might not be much we can do about this -- we can't really trust the 
modification time of the file, as that isn't always updated reliably.

We could take the last time of any event from the file, but then we'd have to 
turn off the optimization of SPARK-6951, to avoid reading the entire file just 
for the listing.

*WORKAROUND*: have the SHS save state across restarts to local disk by 
specifying a path in {{spark.history.store.path}}.  It'll still take 7 days 
from when you add that config for the cleaning to happen, but then going for 
the cleaning should happen reliably.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to