retention policy for spark structured streaming dataset

Lian Jiang Wed, 14 Mar 2018 11:36:59 -0700

I have a spark structured streaming job which dump data into a parquet
file. To avoid the parquet file grows infinitely, I want to discard 3 month
old data. Does spark streaming supports this? Or I need to stop the
streaming job, trim the parquet file and restart the streaming job? Thanks
for any hints.

retention policy for spark structured streaming dataset

Reply via email to