Re: [RUST] Proposal for more frequent Rust Arrow release process

2021-05-03 Thread Neal Richardson
In case it's not clear where `comdev@` is (it wasn't at all to me), here's the thread Julian referenced: https://lists.apache.org/thread.html/rf12538ef0f60f7257e63391e5d4962a6156564020c99d3dfb193f4d7%40%3Cdev.community.apache.org%3E Neal On Mon, May 3, 2021 at 1:25 PM Julian Hyde wrote: > If

Re: [RUST] Proposal for more frequent Rust Arrow release process

2021-05-03 Thread Julian Hyde
If you think that a release every two weeks, following the standard voting-on-signed-tarball process, is not too onerous, then we're good. If you read my email thread on comdev@ you will see that SkyWalking has been following a similar process. On Mon, May 3, 2021 at 11:41 AM Andrew Lamb wrote:

Re: [Format][RFC] Introduce COMPLEX type for IntervalUnit

2021-05-03 Thread Jacek Pliszka
Good idea, I've created JIRA issue: https://issues.apache.org/jira/browse/ARROW-12637 And named it range to avoid confusion with intervals... Though confusion will stay as it is called interval in Pandas and in logic (Allen's interval algebra) BR, Jacek pon., 3 maj 2021 o 18:05 Micah

Re: [DISCUSS] Host DataFusion website on GitHub pages

2021-05-03 Thread Krisztián Szűcs
On Mon, May 3, 2021 at 8:45 PM Andy Grove wrote: > > To answer Julian's question, the original proposal was for the site to be > published at https://arrow.github.io/arrow-datafusion. This is using the > ASF-provided support for GitHub pages so I had assumed that this was a > standard way of

Re: [DISCUSS] Host DataFusion website on GitHub pages

2021-05-03 Thread Andy Grove
To answer Julian's question, the original proposal was for the site to be published at https://arrow.github.io/arrow-datafusion. This is using the ASF-provided support for GitHub pages so I had assumed that this was a standard way of doing things. One benefit of this approach is that is controlled

Re: [RUST] Proposal for more frequent Rust Arrow release process

2021-05-03 Thread Andrew Lamb
There have been some great discussions on the document. I would say the major unresolved questions are 1. What branching strategy would best balance frequent releases, maintainers' time, and contributors' time. 2. (related) How do we handle breaking API changes between major releases that

Re: [DISCUSS] Host DataFusion website on GitHub pages

2021-05-03 Thread Julian Hyde
I would regard arrow.apache.org/datafusion as “served from apache”. A non-apache domain would be something like arrow-datafusion.github.io or https://datafusion.io. Julian > On May 3, 2021, at 10:05 AM, Wes McKinney

Re: [DISCUSS] Host DataFusion website on GitHub pages

2021-05-03 Thread Wes McKinney
What would be the advantages of this versus publishing a website to arrow.apache.org/datafusion? If the project is actually part of Apache Arrow, I would be worried about having different base URLs altogether for different subprojects On Mon, May 3, 2021 at 11:58 AM Julian Hyde wrote: > > Would

Re: [DISCUSS] Host DataFusion website on GitHub pages

2021-05-03 Thread Julian Hyde
Would this web site be served from an apache.org domain? > On May 3, 2021, at 7:34 AM, Andy Grove wrote: > > Based on a quick reading of ASF documentation, I don't think we need to > vote on creating a website, but I do think that the user guide should be > published from

Re: [Rust] remove Rust from apache/arrow

2021-05-03 Thread Andrew Lamb
Thanks all -- this was epic On Mon, May 3, 2021 at 8:22 AM Krisztián Szűcs wrote: > Hi, > > I've just merged it. Hopefully we didn't forget anything, but even if > we did we can address them in follow-up PRs. > > Thanks, Krisztian > > On Mon, May 3, 2021 at 1:30 PM Jorge Cardoso Leitão >

Re: [Format][RFC] Introduce COMPLEX type for IntervalUnit

2021-05-03 Thread Micah Kornfield
Hi Jacek, This seems like reasonable functionality. I think the probably comes in two parts: 1. This might be a good candidate for a "Well Known"/Officially supported Extension type. I can think of a few different representations but I would guess something like Struct[start: T, struct: end]]

Re: [DISCUSS] Host DataFusion website on GitHub pages

2021-05-03 Thread Andy Grove
Based on a quick reading of ASF documentation, I don't think we need to vote on creating a website, but I do think that the user guide should be published from the next release, and the user guide should be part of the release tarball that we vote on. For now, the website can simply point users

Re: [Format][RFC] Introduce COMPLEX type for IntervalUnit

2021-05-03 Thread Jacek Pliszka
Sorry, my mistake. You are right - I meant anchored intervals as in pandas - ones with defined start and end - and I think many future users will make the same mistake. I would love to be able to do fast overlap joins on arrow level. Best Regards, Jacek niedz., 2 maj 2021 o 23:06 Wes

Re: Pyarrow RecordBatchStreamWriter and dictionaries

2021-05-03 Thread Alessandro Molina
Hi Radu, I was trying to reproduce the issue you described, but I was unable to reproduce the problem. Could you provide an example of how you built the Table? I tried reproducing it with a table with following schema pa.schema([ pa.field('nums', pa.list_(pa.int32())), pa.field('chars',

Re: [Rust] remove Rust from apache/arrow

2021-05-03 Thread Krisztián Szűcs
Hi, I've just merged it. Hopefully we didn't forget anything, but even if we did we can address them in follow-up PRs. Thanks, Krisztian On Mon, May 3, 2021 at 1:30 PM Jorge Cardoso Leitão wrote: > > Hi, > > The PR proposing the removal of the Rust implementation from apache/arrow > is ready:

[Rust] remove Rust from apache/arrow

2021-05-03 Thread Jorge Cardoso Leitão
Hi, The PR proposing the removal of the Rust implementation from apache/arrow is ready: https://github.com/apache/arrow/pull/10096 It is a -200k LOC change, so I wanted to give it some visibility here. The integration tests continue to run as is; they will pull from the latest apache/arrow-rs.

Re: [DISCUSS] Host DataFusion website on GitHub pages

2021-05-03 Thread Andrew Lamb
I think this is a great idea. Thank you for proposing it. Andrew On Sun, May 2, 2021 at 1:17 PM Andy Grove wrote: > The Arrow Rust community has been discussing the idea of hosting a > DataFusion website using GitHub pages. For further details please see the > GibHub issue [1] and Google doc

[NIGHTLY] Arrow Build Report for Job nightly-2021-05-03-0

2021-05-03 Thread Crossbow
Arrow Build Report for Job nightly-2021-05-03-0 All tasks: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-05-03-0 Failed Tasks: - conda-linux-gcc-py36-arm64: URL: