Re: Writing null structs to parquet

2020-07-30 Thread Micah Kornfield
Hi Radu, We are slowly working on the read support but no concrete dates. I think some people from ursalabs might get involved in the effort which might improve the velocity that this gets delivered. There are still some potential bugs on the write path for nested data. If you want to help

Re: Likelyhood of 1.0.1 patch release

2020-07-30 Thread Neal Richardson
Makes sense to me. On Thu, Jul 30, 2020 at 1:50 PM Wes McKinney wrote: > hi folks, > > Based on some of the JIRAs I have seen I would guess that a patch > release sometime in August might be a good idea, should we go ahead > and create a JIRA milestone (for tagging candidate patches) for that >

Likelyhood of 1.0.1 patch release

2020-07-30 Thread Wes McKinney
hi folks, Based on some of the JIRAs I have seen I would guess that a patch release sometime in August might be a good idea, should we go ahead and create a JIRA milestone (for tagging candidate patches) for that so that we do the bookkeeping as we go? Thanks Wes

Re: execute sql on top of arrow and data fusion

2020-07-30 Thread kekronbekron
Also, I think you can check out BlazingSQL. - KB ‐‐‐ Original Message ‐‐‐ On Thursday, July 30, 2020 9:05 PM, Andy Grove wrote: > Hi Kirill, > > Yes, DataFusion is part of Arrow and provides native query execution on > Arrow data. However, DataFusion is implemented in Rust and uses the

Re: Writing null structs to parquet

2020-07-30 Thread Radu Teodorescu
You’re a rock-star - your PR works for my reallife usecase as well - unfortunately this squashes my hopes of making my first arrow contribution today :) Now it breaks in supporting a combination of struct and list at read time, but that is clearly documented as not yet supported - it there any

Re: execute sql on top of arrow and data fusion

2020-07-30 Thread Andy Grove
Hi Kirill, Yes, DataFusion is part of Arrow and provides native query execution on Arrow data. However, DataFusion is implemented in Rust and uses the Rust implementation of Arrow. It does not delegate to the C++ implementation like Gandiva does. Thanks, Andy. On Thu, Jul 30, 2020 at 7:09 AM

Re: Writing null structs to parquet

2020-07-30 Thread Radu Teodorescu
Thank you Micah! I spent a bit of time trying to get to the bottom of it (I know parquet pretty well, but not that familiar with arrow parquet inner workings) so if manage to track down the issue I’ll circle back (I give myself a 30% chance of success given the allotted time and expertise

execute sql on top of arrow and data fusion

2020-07-30 Thread Kirill Lykov
Hi guys, I would like to confirm that knowledge about the status of execution SQL on Arrow data is up to date. I remember there was a Gandiva module which provides AST primitives.But there were no sqlparser + gandiva integration donated by Dremio. Today I've found DataFusion project which seems

Re: [NIGHTLY] Arrow Build Report for Job nightly-2020-07-30-0

2020-07-30 Thread Krisztián Szűcs
On Thu, Jul 30, 2020 at 12:15 PM Crossbow wrote: > > > Arrow Build Report for Job nightly-2020-07-30-0 > > All tasks: > https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-07-30-0 > Most of the failing tasks are false negatives. Appveyor picked up certain branches/builds even

[NIGHTLY] Arrow Build Report for Job nightly-2020-07-30-0

2020-07-30 Thread Crossbow
Arrow Build Report for Job nightly-2020-07-30-0 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-07-30-0 Failed Tasks: - centos-8-amd64: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-07-30-0-github-centos-8-amd64 -