Re: Arrow Flight usage with graph databases

2022-07-19 Thread Valentyn Kahamlyk
Hi David, We are planning to use Flight for the prototype. We are also planning to use Flight SQL as a reference, however we wanted to explore ideas whether Arrow Flight Graph can be implemented on top of Arrow Flight (similar to Arrow Flight SQL). Graph databases generally do not expose or enfor

Arrow sync call July 20 at 12:00 US/Eastern, 16:00 UTC

2022-07-19 Thread Ian Cook
Hi all, Our biweekly sync call is tomorrow at 12:00 noon Eastern time. The Zoom meeting URL for this and other biweekly Arrow sync calls is: https://zoom.us/j/87649033008?pwd=SitsRHluQStlREM0TjJVYkRibVZsUT09 Alternatively, enter this information into the Zoom website or app to join the call: Mee

Re: ExecutionContext, batch ordering clarification

2022-07-19 Thread Yaron Gvili
Hi, I also have a related question: could you recommend a way to get the batches in order when using a source node? If necessary, a way that involves changing or wrapping the source node's code is acceptable. Yaron. From: Li Jin Sent: Tuesday, July 19, 2022 10

Re: ExecutionContext, batch ordering clarification

2022-07-19 Thread Li Jin
Thanks Weston, two follow up questions: (1) What is the threading model when passing "exector=nullptr" to "ExecContext" ? (Does it only uses one thread?) (2) For the file reader, if we want to ensure batches coming out of the reader are ordered but also have parallelism, I'd imagine doing sth like

Re: ExecutionContext, batch ordering clarification

2022-07-19 Thread Weston Pace
If you are using a source node (which it appears you are) then it will be creating new thread tasks for each batch. So, in theory, these could get out of order. My guess is that the file reader is slow enough that by the time you load batch N from disk and decode it, you have a pretty good chance

ExecutionContext, batch ordering clarification

2022-07-19 Thread Ivan Chau
Hi all, I am doing some investigations of the AsOfJoinNode, and consequently have come across some strange behavior when experimenting with the ExecutionContext and in-memory / file streaming source nodes. Our AsOfJoin algorithm requires that the input be in chronological order with respect to on

Re: [C++] Adding Run-Length Encoding to Arrow

2022-07-19 Thread Antoine Pitrou
Le 08/07/2022 à 15:19, Wes McKinney a écrit : * I believe that having a Type::RLE is the right approach in C++ and it makes dynamic dispatch everywhere in the library pretty straightforward. +1 on this, as it will raise a nice NotImplemented error for existing code rather than crash or corr

Re: [VOTE] Accept donation of Rust Object Store donation

2022-07-19 Thread Daniël Heres
+1 (binding) On Tue, Jul 19, 2022, 09:03 Martin Grigorov wrote: > +1 (non-binding) > > On Mon, Jul 18, 2022 at 6:55 PM Andrew Lamb wrote: > > > Hello, > > > > This vote is to determine if the Arrow PMC is in favor of accepting the > > donation of the Rust Object Store crate to arrow-rs. > > > >

Re: [VOTE] Accept donation of Rust Object Store donation

2022-07-19 Thread Martin Grigorov
+1 (non-binding) On Mon, Jul 18, 2022 at 6:55 PM Andrew Lamb wrote: > Hello, > > This vote is to determine if the Arrow PMC is in favor of accepting the > donation of the Rust Object Store crate to arrow-rs. > > We have previously discussed this topic [1] and [2]. > > The proposed donation is at