Re: Any standard way for min/max values per record-batch?

2021-07-19 Thread Kohei KaiGai
Hello, Let me share our trial to support the min/max statistics per record batch. https://github.com/heterodb/pg-strom/wiki/806:-Apache-Arrow-Min-Max-Statistics-Hint The latest pg2arrow supports --stat option that can specify the columns to include min/max statistics for each record batch. The

[Rust] Proposed 5.0.0 release blog

2021-07-19 Thread Andrew Lamb
Here is a PR [1] with a proposed 5.0.0 Rust release blog announcement. Help / Comments / Contributions more than welcome Andrew [1] https://github.com/apache/arrow-site/pull/128

Re: [Discuss] [Rust] Arrow2/parquet2 going foward

2021-07-19 Thread Andrew Lamb
> If we do indeed have the expectation of stability over its whole public surface, I certainly do not have this expectation between major releases. Who does? I believe it is a disservice to the overall community to release two API incompatible Rust implementations of Arrow to crates.io. It will

Re: [Discuss] [Rust] Arrow2/parquet2 going foward

2021-07-19 Thread Jorge Cardoso Leitão
Hi, Whatever its technical faults may be, projects that rely on arrow (such as > anything based on DataFusion, like my own) need to be supported as they > have made the bet on Rust Arrow. > 1.X versioning in Apache Arrow was never meant to represent stability of their individual libraries, but