That’s true, spill dirs don’t get cleaned up when something goes wrong. We are are restarting long running jobs once in a while for cleanups and have spark.cleaner.ttl set to a lower value than the default.
> On 14.04.2015, at 17:57, Guillaume Pitel <guillaume.pi...@exensa.com> wrote: > > Right, I remember now, the only problematic case is when things go bad and > the cleaner is not executed. > > Also, it can be a problem when reusing the same sparkcontext for many runs. > > Guillaume >> It cleans the work dir, and SPARK_LOCAL_DIRS should be cleaned >> automatically. From the source code comments: >> // SPARK_LOCAL_DIRS environment variable, and deleted by the Worker when the >> // application finishes. >> >> >>> On 13.04.2015, at 11:26, Guillaume Pitel <guillaume.pi...@exensa.com >>> <mailto:guillaume.pi...@exensa.com>> wrote: >>> >>> Does it also cleanup spark local dirs ? I thought it was only cleaning >>> $SPARK_HOME/work/ >>> >>> Guillaume >>>> I have set SPARK_WORKER_OPTS in spark-env.sh for that. For example: >>>> >>>> export SPARK_WORKER_OPTS="-Dspark.worker.cleanup.enabled=true >>>> -Dspark.worker.cleanup.appDataTtl=<seconds>" >>>> >>>>> On 11.04.2015, at 00:01, Wang, Ningjun (LNG-NPV) >>>>> <ningjun.w...@lexisnexis.com <mailto:ningjun.w...@lexisnexis.com>> wrote: >>>>> >>>>> Does anybody have an answer for this? >>>>> >>>>> Thanks >>>>> Ningjun >>>>> >>>>> From: Wang, Ningjun (LNG-NPV) >>>>> Sent: Thursday, April 02, 2015 12:14 PM >>>>> To: user@spark.apache.org <mailto:user@spark.apache.org> >>>>> Subject: Is the disk space in SPARK_LOCAL_DIRS cleanned up? >>>>> >>>>> I set SPARK_LOCAL_DIRS to C:\temp\spark-temp. When RDDs are shuffled, >>>>> spark writes to this folder. I found that the disk space of this folder >>>>> keep on increase quickly and at certain point I will run out of disk >>>>> space. >>>>> >>>>> I wonder does spark clean up the disk space in this folder once the >>>>> shuffle operation is done? If not, I need to write a job to clean it up >>>>> myself. But how do I know which sub folders there can be removed? >>>>> >>>>> Ningjun >>>> >>> >>> >>> -- >>> <exensa_logo_mail.png> >>> Guillaume PITEL, Président >>> +33(0)626 222 431 >>> >>> eXenSa S.A.S. <http://www.exensa.com/> >>> 41, rue Périer - 92120 Montrouge - FRANCE >>> Tel +33(0)184 163 677 / Fax +33(0)972 283 705 >> > > > -- > <exensa_logo_mail.png> > Guillaume PITEL, Président > +33(0)626 222 431 > > eXenSa S.A.S. <http://www.exensa.com/> > 41, rue Périer - 92120 Montrouge - FRANCE > Tel +33(0)184 163 677 / Fax +33(0)972 283 705