Re: How to write dataset as parquet format

2017-11-23 Thread ebru
Flavio and Fabian thanks for your quick answers, it was very helpful. -Ebru > On 22 Nov 2017, at 20:47, Flavio Pompermaier wrote: > > I usually refer to this: > > https://github.com/FelixNeutatz/parquet-flinktacular > > > On 22 Nov 2017 1

Re: How to write dataset as parquet format

2017-11-22 Thread Flavio Pompermaier
I usually refer to this: https://github.com/FelixNeutatz/parquet-flinktacular On 22 Nov 2017 18:29, "Fabian Hueske" wrote: > Hi Ebru, > > AvroParquetOutputFormat seems to implement Hadoop's OutputFormat interface. > Flink provides a wrapper for Hadoop's OutputFormat [1], so you can try to > wra

Re: How to write dataset as parquet format

2017-11-22 Thread Fabian Hueske
Hi Ebru, AvroParquetOutputFormat seems to implement Hadoop's OutputFormat interface. Flink provides a wrapper for Hadoop's OutputFormat [1], so you can try to wrap AvroParquetOutputFormat in Flink's HadoopOutputFormat. Hope this helps, Fabian [1] https://ci.apache.org/projects/flink/flink-docs-r

How to write dataset as parquet format

2017-11-22 Thread ebru
Hello all, We are trying to write dataset as parquet format, we use AvroParquetOutputFormat but it is not compatible with Flinkā€™s FileOutputFormat. Is there a way to write dataset as parquet? -Ebru