[ 
https://issues.apache.org/jira/browse/SPARK-28712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16909910#comment-16909910
 ] 

Jungtaek Lim commented on SPARK-28712:
--------------------------------------

[~yangcong3643]

Looks like this is not due to file handler leak as "lsof" gives nothing.

I think you're hitting SPARK-28025 where crc files are not cleaned up. The crc 
file itself is pretty tiny but it consumes some size (bigger than origin size) 
according to the filesystem, and it will consume nontrivial space if your batch 
number goes to 10000s, even 100000s.

As a workaround you can follow the configuration suggested as a workaround in 
Jira comment, but I still think it should be fixed.

> spark structured stream with kafka don't really delete temp files in spark 
> standalone cluster
> ---------------------------------------------------------------------------------------------
>
>                 Key: SPARK-28712
>                 URL: https://issues.apache.org/jira/browse/SPARK-28712
>             Project: Spark
>          Issue Type: Bug
>          Components: Structured Streaming
>    Affects Versions: 2.4.3
>         Environment: redhat 7
> jdk 1.8
> scala 2.11.12
>  spark standalone cluster 2.4.3
>  kafka 0.10.2.1
>  
>            Reporter: 凭落
>            Priority: Major
>
> the folder in  Driver
> {noformat}
> /tmp/temporary-xxxxxxxx{noformat}
>  takes up all the space in /tmp after runing spark structured stream job a 
> long time.
> it is mainly under the offsets and commits folders.but when I watch it by us 
> command
> {noformat}
> du -sh offsets     du -sh commits{noformat}
> it got more than 600M,but when We  use command
> {noformat}
> ll -h offsets       ll -h commits{noformat}
> it got 400K.
> I think it is because when the file is deleted,it is still used in job.
> It wasn't released only if the job is stopped.
> How can I solve it?
> We use 
> {code}
> df.writeStream.trigger(ProcessingTime("1 seconds"))
> {code}
> not
> {code}
> df.writeStream.trigger(Continuous("1 seconds"))
> {code}
> Is there something wrong here?



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to