[RESULT][VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 6.0.0 RC0

2021-11-16 Thread QP Hou
The vote has passed with 3 +1 binding votes and 2 non-binding votes. Thank you to all who helped with the release verification. We will proceed to finalize the release by following https://github.com/apache/arrow-datafusion/tree/master/dev/release#finalize-the-release. On Tue, Nov 16, 2021 at

Re: [VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 6.0.0 RC0

2021-11-16 Thread Sutou Kouhei
+1 I ran the followings on Debian GNU/Linux sid: dev/release/verify-release-candidate.sh 6.0.0 0 Thanks, -- kou In "Re: [VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 6.0.0 RC0" on Tue, 16 Nov 2021 18:20:37 -0800, QP Hou wrote: > It looks like we just need one more

Re: [VOTE][RUST][Datafusion] Release Apache Arrow Datafusion 6.0.0 RC0

2021-11-16 Thread QP Hou
It looks like we just need one more binding vote to get this release out, would appreciate a +1 from another PMC member :) On Sun, Nov 14, 2021 at 4:54 PM Andy Grove wrote: > > +1 (binding) > > I ran the release verification script on Ubuntu 20.04 and also ran the > Ballista integration tests. >

Re: [VOTE] Release Apache Arrow 6.0.1 - RC1

2021-11-16 Thread Sutou Kouhei
Hi, Are there any people who want to verify and vote this? I'll close this vote tomorrow if nobody stops it. Thanks, -- kou In <2026.093449.2083493459766152633@clear-code.com> "Re: [VOTE] Release Apache Arrow 6.0.1 - RC1" on Tue, 16 Nov 2021 09:34:49 +0900 (JST), Sutou Kouhei

Re: output_schema for ExecNode

2021-11-16 Thread Weston Pace
I think the ExecPlan itself would probably need some changes. Right now each node has an output schema. Most of the node implementations depend on this in some way or another. For example, a filter node binds the expression to the schema once at plan construction time. If the schema is

Re: [VOTE] Release Apache Arrow 6.0.1 - RC1

2021-11-16 Thread Sutou Kouhei
+1 I ran the followings on Debian GNU/Linux sid: * LANG=C \ TZ=UTC \ ARROW_CMAKE_OPTIONS="-DBoost_NO_BOOST_CMAKE=ON" \ CUDA_TOOLKIT_ROOT=/usr \ dev/release/verify-release-candidate.sh source 6.0.1 1 * LANG=C \ PYTHON=python3 \

Re: Synergies with Apache Avro?

2021-11-16 Thread Jorge Cardoso Leitão
> > I haven't looked at it for a while but my recollection, at least in java, > is streaming process for each step outlined rather than a batch process > (i.e. decompress some bytes, then decode them lazily a "Next Row" is > called). Sorry for the late reply, It took me a bit to go through the

Re: [C++] Adopting a library for (distributed) tracing

2021-11-16 Thread David Li
Following up here: I'm hoping we can enable this in 7.0.0 and am still working on getting all the builds passing (currently RPM packages fail to build with it enabled). OpenTelemetry released their v1.0.0 recently so that should not be a problem anymore. Some changes in approach: * For now,

Re: [Parquet][C++][Python] Maximum Row Group Length Default

2021-11-16 Thread Sarah Gilmore
Hi Micah, Thanks for the clarifying! I just created this Jira issue to track the issue with Pyarrow. Thanks again! Sarah From: Micah Kornfield Sent: Monday, November 15, 2021 3:34 PM To: dev Subject: Re: