I'll put recommendations for the design on the issue. Thanks!
On Fri, Mar 15, 2024 at 2:03 PM Aldrin wrote:
> I created a new issue [1] to track the refactoring. Could you clarify the
> request (here or in the issue)?
>
> My understanding is that the Skyhook file format code [2] should be
> refa
I created a new issue [1] to track the refactoring. Could you clarify the
request (here or in the issue)?
My understanding is that the Skyhook file format code [2] should be refactored
to use a higher-level interface rather than using dataset::FileFormat and
dataset::FragmentScanOptions directl
Hi Ben, I am willing to help out with the refactor too !
On Wed, Mar 13, 2024 at 9:25 PM Aldrin wrote:
> I am interested in helping to refactor!
>
> -Aldrin
>
>
> On Wed, Mar 13, 2024 at 08:54, Benjamin Kietzman > wrote:
>
> Skyhook [1] enables efficient predicate and projection pushdown from
>
I am interested in helping to refactor!
-Aldrin
On Wed, Mar 13, 2024 at 08:54, Benjamin Kietzman
wrote: Skyhook [1] enables efficient predicate and projection pushdown from
Arrow Dataset to a Ceph storage cluster. This is very cool
functionality, but it's tightly coupled t
Skyhook [1] enables efficient predicate and projection pushdown from
Arrow Dataset to a Ceph storage cluster. This is very cool
functionality, but it's tightly coupled to the Arrow C++ Dataset
implementation in a way which blocks refactoring. In the Arrow C++
codebase today, Acero is designed speci