Streaming File Sink - Parquet File Writer

2019-10-29 Thread Vinay Patil
Hi, I am not able to roll the files based on file size as the bulkFormat has onCheckpointRollingPolicy. One way is to write CustomStreamingFileSink and provide RollingPolicy like RowFormatBuilder. Is this the correct way to go ahead ? Another way is to write ParquetEncoder and use RowFormatBuild

Re: Streaming File Sink - Parquet File Writer

2019-10-30 Thread Kostas Kloudas
Hi Vinay, You are correct when saying that the bulk formats only support onCheckpointRollingPolicy. The reason for this has to do with the fact that currently Flink relies on the Hadoop writer for Parquet. Bulk formats keep important details about how they write the actual data (such as compress