Re: [DISCUSS][C++] Help needed to refactor Skyhook

2024-03-18 Thread Benjamin Kietzman
I'll put recommendations for the design on the issue. Thanks! On Fri, Mar 15, 2024 at 2:03 PM Aldrin wrote: > I created a new issue [1] to track the refactoring. Could you clarify the > request (here or in the issue)? > > My understanding is that the Skyhook file format code [2] should be > refa

Re: [DISCUSS][C++] Help needed to refactor Skyhook

2024-03-15 Thread Aldrin
I created a new issue [1] to track the refactoring. Could you clarify the request (here or in the issue)? My understanding is that the Skyhook file format code [2] should be refactored to use a higher-level interface rather than using dataset::FileFormat and dataset::FragmentScanOptions directl

Re: [DISCUSS][C++] Help needed to refactor Skyhook

2024-03-14 Thread Jayjeet Chakraborty
Hi Ben, I am willing to help out with the refactor too ! On Wed, Mar 13, 2024 at 9:25 PM Aldrin wrote: > I am interested in helping to refactor! > > -Aldrin > > > On Wed, Mar 13, 2024 at 08:54, Benjamin Kietzman > wrote: > > Skyhook [1] enables efficient predicate and projection pushdown from >

Re: [DISCUSS][C++] Help needed to refactor Skyhook

2024-03-13 Thread Aldrin
I am interested in helping to refactor! -Aldrin On Wed, Mar 13, 2024 at 08:54, Benjamin Kietzman wrote: Skyhook [1] enables efficient predicate and projection pushdown from Arrow Dataset to a Ceph storage cluster. This is very cool functionality, but it's tightly coupled t

[DISCUSS][C++] Help needed to refactor Skyhook

2024-03-13 Thread Benjamin Kietzman
Skyhook [1] enables efficient predicate and projection pushdown from Arrow Dataset to a Ceph storage cluster. This is very cool functionality, but it's tightly coupled to the Arrow C++ Dataset implementation in a way which blocks refactoring. In the Arrow C++ codebase today, Acero is designed speci