[
https://issues.apache.org/jira/browse/ARROW-7625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kouhei Sutou updated ARROW-7625:
--------------------------------
Summary: [GLib] Parquet GLib and Red Parquet (Ruby) do not allow specifying
compression type (was: Parquet GLib and Red Parquet (Ruby) do not allow
specifying compression type)
> [GLib] Parquet GLib and Red Parquet (Ruby) do not allow specifying
> compression type
> -----------------------------------------------------------------------------------
>
> Key: ARROW-7625
> URL: https://issues.apache.org/jira/browse/ARROW-7625
> Project: Apache Arrow
> Issue Type: Improvement
> Components: GLib
> Environment: red-arrow 0.15.1
> red-parquet 0.15.1
> libarrow 0.15.1
> libparquet 0.15.1
> Reporter: Keith Gable
> Assignee: Kouhei Sutou
> Priority: Major
>
> It seems that the ArrowFileWriter being used by parquet-glib just uses the
> default writer properties
> ([https://github.com/apache/arrow/blob/master/c_glib/parquet-glib/arrow-file-writer.cpp#L184),]
> and does not offer the user the ability to override this. As a consumer of
> the GLib API in Ruby (red-parquet), I therefore have no way of compressing
> Parquet columns. Of course, I can compress the entire file by doing something
> like {{t.save('...', format: 'parquet', compression: 'GZIP')}}, but this is
> not compatible with most tools and isn't the correct way of compressing a
> Parquet file.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)