Re: [compute] limit push-down and nested types

2022-09-20 Thread Lei Xu
Happy to provide some implementation details. On 2022/09/20 18:58:18 Weston Pace wrote: > > However, I'm wondering if there's a better path to integrating that more > > "natively" into Arrow. Happy to make contributions if that's an option. > > I'm not quite sure I understand where Arrow

Re: [compute] limit push-down and nested types

2022-09-20 Thread Weston Pace
> However, I'm wondering if there's a better path to integrating that more > "natively" into Arrow. Happy to make contributions if that's an option. I'm not quite sure I understand where Arrow integration comes into play here. Would that scanner use Arrow internally? Or would you only convert

[compute] limit push-down and nested types

2022-09-20 Thread Chang She
Hi there, We're creating a new columnar data format for computer vision with Arrow integration as a first class citizen (github.com/eto-ai/lance). It significantly outperforms parquet in a variety of computer vision workloads. *Question 1:* Because vision data tends to be large-ish blobs, we