RE: [DISCUSS] Interest in a 12.0.1 patch?

2023-05-18 Thread wish maple
I have two parquet related bug fixes and I wonder if we can release them in 12.0.1 1. https://github.com/apache/arrow/pull/35428 2. https://github.com/apache/arrow/pull/35520 Patch 1 can cause BYTE_STREAM_SPLIT unable to be read if the previous parquet page is larger than the incoming one. Patch 2

Re: [DISCUSS][Gandiva] changes in bundled double-conversion

2023-05-18 Thread Sutou Kouhei
Great!!! In "Re: [DISCUSS][Gandiva] changes in bundled double-conversion" on Thu, 18 May 2023 09:28:26 -0400, Ian Cook wrote: > I upstreamed the changes to our vendored double-conversion in [1]. > These changes are now released in double-conversion v3.3.0 [2]. We can > remove our patches w

Re: [DISCUSS] Interest in a 12.0.1 patch?

2023-05-18 Thread Will Jones
Thanks for bringing this up Weston. Joris has already created a 12.0.1 milestone that contains several fixes that are candidates for backport [1], including this one. I think this is the most severe issue though. As a maintainer of the Python deltalake package, which uses the PyArrow Parquet writ

Re: [DISCUSS] Interest in a 12.0.1 patch?

2023-05-18 Thread Matt Topol
I think it's worthwhile enough to justify the work for the patch. If we do end up doing the patch, then we should also include this [1] change for the Go side which, while significant, I didn't believe to be significant enough to warrant a patch on its own. But it is definitely a good idea to inclu

Re: [DISCUSS] Interest in a 12.0.1 patch?

2023-05-18 Thread Ian Cook
There is also a major issue with the 12.0.0 R package that has now been fixed in the repo [2] and needs to be resubmitted to CRAN soon. The R package developers are supportive of a 12.0.1 patch release happening soon so that the resubmission of the R package to CRAN can also include the fix for the

[DISCUSS] Interest in a 12.0.1 patch?

2023-05-18 Thread Weston Pace
Regrettabl, 12.0.0 had a significant performance regression (I'll take the blame for not thinking through all the use cases), most easily exposed when writing datasets from pandas / numpy data, which is being addressed in [1]. I believe this to be a fairly common use case and it may warrant a 12.0

Re: [VOTE][RUST] Release Apache Arrow Rust Object Store 0.6.0 RC1

2023-05-18 Thread L. C. Hsieh
+1 (binding) Verified on M1 Mac. Thanks Raphael. On Thu, May 18, 2023 at 3:31 AM Andrew Lamb wrote: > > +1 (binding) > > I ran the release verification script (Mac x86_64) and reviewed the > changelog . Looks like a good release. > > Thank you, > Andrew > > On Thu, May 18, 2023 at 5:25 AM Rapha

Re: [DISCUSS][Gandiva] changes in bundled double-conversion

2023-05-18 Thread Ian Cook
I upstreamed the changes to our vendored double-conversion in [1]. These changes are now released in double-conversion v3.3.0 [2]. We can remove our patches when we upgrade to v3.3.0 [3]. Ian [1] https://github.com/google/double-conversion/pull/195 [2] https://github.com/google/double-conversion/

Re: [VOTE][RUST] Release Apache Arrow Rust Object Store 0.6.0 RC1

2023-05-18 Thread Andrew Lamb
+1 (binding) I ran the release verification script (Mac x86_64) and reviewed the changelog . Looks like a good release. Thank you, Andrew On Thu, May 18, 2023 at 5:25 AM Raphael Taylor-Davies wrote: > Hi, > > I would like to propose a release of Apache Arrow Rust Object > Store Implementation,

[VOTE][RUST] Release Apache Arrow Rust Object Store 0.6.0 RC1

2023-05-18 Thread Raphael Taylor-Davies
Hi, I would like to propose a release of Apache Arrow Rust Object Store Implementation, version 0.6.0. This release candidate is based on commit: ec7706c1f2aeef5a289e46d1df7785e5c93e6bfb [1] The proposed release tarball and signatures are hosted at [2]. The changelog is located at [3]. Plea