I responded there but generally, this doesn't seem like it imposes a lot of implementation burden and can be useful.
On Thu, Dec 14, 2023 at 12:59 PM Antoine Pitrou <anto...@python.org> wrote: > > Hello, > > Just a heads up here so as to reach a wider audience: I've posted a > format addition proposal in > https://issues.apache.org/jira/browse/PARQUET-2414 > > Excerpt: > """ > This issue proposed to widen the types supported by the > BYTE_STREAM_SPLIT. By allowing the BYTE_STREAM_SPLIT on any > FIXED_LEN_BYTE_ARRAY column, we can automatically improve compression > efficiency on various column types including: > > half-float data > fixed-width decimal data > > [etc.] > """ > > Feel free to comment here or on the JIRA issue. > > Regards > > Antoine. > > >