[C#] active development / activity, especially on Flight

2022-09-13 Thread Kirk Broadhurst
Hello! I'm trying to get some C# developers into using Arrow / Flight and am finding a number of missing capabilities / features. Most I can work around; the most pressing and ugliest workaround will be the lack of do_exchange in the C# FlightClient. I'm wondering if there's anyone out there with

Re: Integration between Flight and Acero

2022-09-13 Thread Weston Pace
Yes. If you need the source node to read in parallel OR if you have multiple fragments (especially if those fragments don't have identical schemas) then you want a dataset and not just a plain source node. On Tue, Sep 13, 2022 at 1:55 PM David Li wrote: > > Yeah, I concur with Weston. > > > To

Re: Integration between Flight and Acero

2022-09-13 Thread David Li
Yeah, I concur with Weston. > To start with I think a custom factory function will be sufficient > (e.g. look at MakeScanNode in scanner.cc for an example). So the > options would somehow describe the coordinates of the flight endpoint. These 'coordinates' would be a FlightDescriptor. >

Re: Integration between Flight and Acero

2022-09-13 Thread Weston Pace
> The alternative path of subclassing SourceNode and having ExecNode::Init or > ExecNode::StartProducing seems quite a bit of change (also I don't think > SourceNode is exposed via public header). But let me know if you think I am > missing something. Agreed that we don't want to go this route.

Re: Integration between Flight and Acero

2022-09-13 Thread Li Jin
Thanks Yaron for the pointer to that PR. On Tue, Sep 13, 2022 at 4:43 PM Yaron Gvili wrote: > If you can wrap the flight reader as a RecordBatchReader, then another > possibility is using an upcoming PR ( > https://github.com/apache/arrow/pull/14041) that enables SourceNode to > accept it. You

Re: [DISC] Remove Kartothek integration tests from nightlies

2022-09-13 Thread Sutou Kouhei
+1 In "[DISC] Remove Kartothek integration tests from nightlies" on Tue, 13 Sep 2022 10:47:17 +0200, Raul Cumplido Dominguez wrote: > Hi, > > Currently Kartothek [1] nightly builds are flaky [2]. > > The Kartothek project does not seem to be active anymore. The last merged > PR was in

Re: Integration between Flight and Acero

2022-09-13 Thread Yaron Gvili
If you can wrap the flight reader as a RecordBatchReader, then another possibility is using an upcoming PR (https://github.com/apache/arrow/pull/14041) that enables SourceNode to accept it. You would need to know the schema when configuring the SourceNode, but you won't need to derived from

Arrow sync call September 14 at 12:00 US/Eastern, 16:00 UTC

2022-09-13 Thread Ian Cook
Hi all, Our biweekly sync call is tomorrow at 12:00 noon Eastern time. The Zoom meeting URL for this and other biweekly Arrow sync calls is: https://zoom.us/j/87649033008?pwd=SitsRHluQStlREM0TjJVYkRibVZsUT09 Alternatively, enter this information into the Zoom website or app to join the call:

Re: Integration between Flight and Acero

2022-09-13 Thread Li Jin
Update: I am going to try what David Li suggested here: https://lists.apache.org/thread/8yfvvyyc79m11z9wql0gzdr25x4b3g7v This seems to be the least amount of code. This does require calling "DoGet" at Acero plan/node creation time rather than execution time but I don't think it's a big deal for

Re: [DISC] Improving Arrow's database support

2022-09-13 Thread David Li
Ah, thanks for the clarification Neal! Jacob/Matt: I put up https://github.com/apache/arrow-adbc/pull/124 to describe the convention but I wonder if we should partition components more granularly than we have so far. On Mon, Sep 12, 2022, at 12:57, Neal Richardson wrote: > On Mon, Sep 12, 2022

Re: Subject: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 12.0.0 RC1

2022-09-13 Thread Ian Joiner
+1 (Non-binding) Verified on my macOS 12.2.1 / Apple M1 Chip On Mon, Sep 12, 2022 at 2:55 PM Andy Grove wrote: > Hi, > > I would like to propose a release of Apache Arrow DataFusion > Implementation, > version 12.0.0. > > This release candidate is based on commit: >

Re: PyArrow build problem

2022-09-13 Thread Jeszy
Hey, Sorry for the confusion, VESA is an internal system. We're working with Yaron to resolve this over a tighter loop, and will circle back with the outcome. Balazs On Tue, Sep 13, 2022 at 8:48 AM Yaron Gvili wrote: > > OK, I posted in VESA-62. > > > Yaron. >

Re: [DISC] Remove Kartothek integration tests from nightlies

2022-09-13 Thread Antoine Pitrou
Hello, +1 from me. We should not have integration builds without a dedicated maintainer to look after them. Regards Antoine. Le 13/09/2022 à 10:47, Raul Cumplido Dominguez a écrit : Hi, Currently Kartothek [1] nightly builds are flaky [2]. The Kartothek project does not seem to be

Re: [DISC] Remove Kartothek integration tests from nightlies

2022-09-13 Thread Jacob Wujciak
+1 Raul Cumplido Dominguez schrieb am Di., 13. Sept. 2022, 10:47: > Hi, > > Currently Kartothek [1] nightly builds are flaky [2]. > > The Kartothek project does not seem to be active anymore. The last merged > PR was in December 2021. > This is not the first time that builds have failed for

[DISC] Remove Kartothek integration tests from nightlies

2022-09-13 Thread Raul Cumplido Dominguez
Hi, Currently Kartothek [1] nightly builds are flaky [2]. The Kartothek project does not seem to be active anymore. The last merged PR was in December 2021. This is not the first time that builds have failed for Kartothek, see [3] from April this year. I would like to remove the Nightly

Re: Subject: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 12.0.0 RC1

2022-09-13 Thread Francis Du
+1 (non-binding) Verified on Intel Mac On Tue, 13 Sept 2022 at 07:30, Remzi Yang <1371656737...@gmail.com> wrote: > +1 (non-binding). Verified on M1 Mac. > Thank you, Andy. > > On Tue, 13 Sept 2022 at 04:59, Willy Kuo wrote: > > > +1 (non-binding) > > > > verified on Intel Mac > > > > Thanks

Re: PyArrow build problem

2022-09-13 Thread Yaron Gvili
OK, I posted in VESA-62. Yaron. From: Sutou Kouhei Sent: Monday, September 12, 2022 8:26 PM To: dev@arrow.apache.org Subject: Re: PyArrow build problem Hi, Ah, this mailing list doesn't accept attachments. Could you upload them to somewhere such as gist and