Re: Flink S3 sink unable to compress data

2019-10-18 Thread Ravi Bhushan Ratnakar
Hi,

As per my understanding, Encoder's encode method is called for each and
every message and hence it is not logical to create compressor around given
output stream which will lead into unpredictable erroneous situation.
Encode responsibility is to encode the given object, not to compress. It
seems like at the moment RowFormat does not support compression.

https://issues.apache.org/jira/plugins/servlet/mobile#issue/FLINK-11794

If you want to write compressed output,  you could have your own
implementation for BulkFormat

Regards,
Ravi


On Fri 18 Oct, 2019, 20:30 amran dean,  wrote:

> Hello,
> I am writing a custom S3 object encoder (code here:
> https://pastebin.com/raw/9Ag4ZVpX) used via:
>
> StreamingFileSink dataSink = StreamingFileSink
>   .forRowFormat(new Path("s3a://some_path"), new RecordDataSerializer())
>
>
> During execution, it does not produce any data in S3. This pipeline works
> if the data is not compressed beforehand, and no exceptions are thrown in
> Flink logs or in the webUI. I am confused what is happening and would
> appreciate any help.
>


Flink S3 sink unable to compress data

2019-10-18 Thread amran dean
Hello,
I am writing a custom S3 object encoder (code here:
https://pastebin.com/raw/9Ag4ZVpX) used via:

StreamingFileSink dataSink = StreamingFileSink
  .forRowFormat(new Path("s3a://some_path"), new RecordDataSerializer())


During execution, it does not produce any data in S3. This pipeline works
if the data is not compressed beforehand, and no exceptions are thrown in
Flink logs or in the webUI. I am confused what is happening and would
appreciate any help.