Re: Fwd: [C++] Parquet and Arrow overlap

2024-05-10 Thread Matt Topol
I just wanted to also poke the question of non-Java developers who have worked on the other parquet implementations potentially being recognized as committers or otherwise on the Parquet project (speaking as the primary developer of the Go parquet implementation which also lives in the Arrow reposi

Re: Fwd: [C++] Parquet and Arrow overlap

2024-05-10 Thread Jacob Wujciak
Thank you, that sounds great! On first glance some seem to be rather old and probably don't apply anymore. > BTW, do we really need to make a full copy of them to have a mirror in the Arrow GitHub issues? I am not sure I understand what you mean? A full copy of the open/closed/all issues? I'd say

Re: Fwd: [C++] Parquet and Arrow overlap

2024-05-10 Thread Gang Wu
I can initiate the vote. But before the vote, I think we need to revisit the states of all unresolved tickets and close some as needed. BTW, do we really need to make a full copy of them to have a mirror in the Arrow GitHub issues? I'd like to seek a consensus here before sending the vote. Best,

Re: Flight Python EC2 Server for parquet on S3

2024-05-10 Thread Bruno Murino
Hi, It’s my first time here on this mailing list as well. Regarding the EC2 instance size, I wonder if you’re hitting the IOPS limits of the T instance, give the large volumes of data coming out? I could be way off, though, but that’s where my mind went. Cheers, Bruno Murino > On 10 May 2024

Re: Flight Python EC2 Server for parquet on S3

2024-05-10 Thread Bryce Mecum
Hi Christian, welcome. Your code looks reasonable to me at first glance. It does seem possible you're resource-constrained with that t2.micro instance. You might try using a larger instance or reducing the batch size in your call to iter_batches [1] to some very small number. [1] https://arrow.a

Re: [VOTE][RUST] Release Apache Arrow Rust Object Store 0.10.1 RC1

2024-05-10 Thread L. C. Hsieh
+1 (binding) Verified on M3 Mac. Thanks Raphael. On Fri, May 10, 2024 at 10:31 AM Raphael Taylor-Davies wrote: > > Hi, > > I would like to propose a release of Apache Arrow Rust Object > Store Implementation, version 0.10.1. > > This is primarily motivated by a major bug introduced by 0.10.0 [1

Re: [VOTE] Release Apache Arrow 16.1.0 - RC1

2024-05-10 Thread Jacob Wujciak
+1 (non-binding) I tested C++ sources on pop_os 22.04 Am Fr., 10. Mai 2024 um 17:47 Uhr schrieb Raúl Cumplido : > +1 (binding) > > I have tested both SOURCES and BINARIES successfully on Ubuntu 22.04: > TEST_DEFAULT=0 TEST_SOURCE=1 dev/release/verify-release-candidate.sh > 16.1.0 1 > TEST_DEFAUL

[VOTE][RUST] Release Apache Arrow Rust Object Store 0.10.1 RC1

2024-05-10 Thread Raphael Taylor-Davies
Hi, I would like to propose a release of Apache Arrow Rust Object Store Implementation, version 0.10.1. This is primarily motivated by a major bug introduced by 0.10.0 [1] This release candidate is based on commit: 3d3ddb2108502854da98654ada85364d5627ef21 [2] The proposed release tarball and

Re: [VOTE] Release Apache Arrow 16.1.0 - RC1

2024-05-10 Thread Raúl Cumplido
+1 (binding) I have tested both SOURCES and BINARIES successfully on Ubuntu 22.04: TEST_DEFAULT=0 TEST_SOURCE=1 dev/release/verify-release-candidate.sh 16.1.0 1 TEST_DEFAULT=0 TEST_BINARIES=1 dev/release 16.1.0 1 with: * Python 3.10.12 * gcc (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0 * NVIDIA CUD

Flight Python EC2 Server for parquet on S3

2024-05-10 Thread Christian Casazza
Hello everyone, This is my first time emailing this mailing list, so I hope I am explaining things correctly below. I am attempting to get started with Arrow Flight. I am storing parquet files and Iceberg tables on S3. I would like to use arrow flight as the interface data consumers use to access

Re: [VOTE] Release Apache Arrow 16.1.0 - RC1

2024-05-10 Thread wish maple
Ah, only PMC can vote binding Please regard me as non-binding Best, Xuwei Fu wish maple 于2024年5月10日周五 10:39写道: > +1 (binding) > > TEST_DEFAULT=0 TEST_CPP=1 ./verify-release-candidate.sh 16.1.0 1 > Release candidate 16.1.0 works well on my M1 MacOS > > Best, > Xuwei Fu > > David Li 于2024年5月10日