FYI: flattening of embedded structures is not supported in Parquet either.

Regards,
 -Stefan

On Wed, Mar 23, 2016 at 8:51 PM, Johannes Schulte <
johannes.schu...@gmail.com> wrote:

> Hi Stefan,
>
> thanks for your response and the link to your udf repository, it's a good
> reference. I tried drill 1.6, the data is an array of complex objects
> though. I will try to setup a drill dev environment and see if i can modify
> the tests to fail.
>
> Johannes
>
> On Wed, Mar 23, 2016 at 8:13 PM, Stefán Baxter <ste...@activitystream.com>
> wrote:
>
> > FYI. this seems to be working in 1.6, at least on the Avro data that we
> > have.
> >
> > On Wed, Mar 23, 2016 at 6:59 PM, Stefán Baxter <
> ste...@activitystream.com>
> > wrote:
> >
> > > Hi again,
> > >
> > > What version of Drill are you using?
> > >
> > > Regards,
> > > - Stefán
> > >
> > > On Wed, Mar 23, 2016 at 4:49 PM, Stefán Baxter <
> > ste...@activitystream.com>
> > > wrote:
> > >
> > >> Hi Johannes,
> > >>
> > >> As great as Drill is the Avro plugin has been a source of frustration
> > for
> > >> us @activitystream.
> > >>
> > >> We have a small UDF library [1] (apache licensed) which contains a
> > >> function can return an array (List<String>) from Avro as a CSV list.
> > >>
> > >> You could use that to roll your own or provide me with a small sample
> > and
> > >> I can create a custom flatten function for you.
> > >>
> > >> The best would be to wait for a fix but this can potentially get you
> out
> > >> of a rough spot.
> > >>
> > >> [1] https://github.com/activitystream/asdrill
> > >>
> > >> Regards,
> > >>  -Stefán
> > >>
> > >> On Wed, Mar 23, 2016 at 9:05 AM, Johannes Schulte <
> > >> johannes.schu...@gmail.com> wrote:
> > >>
> > >>> Hi,
> > >>>
> > >>> when trying to read simple avro arrays with select flatten(array)
> from
> > >>> dfs... i get the exception
> > >>>
> > >>> SQL Query Error: SYSTEM ERROR: ClassCastException: Cannot cast
> > >>> org.apache.drill.exec.vector.complex.MapVector to
> > >>> org.apache.drill.exec.vector.complex.RepeatedValueVector
> > >>> ^
> > >>>
> > >>> The type of the array is said to be <UnknownType (2,002)>
> > >>>
> > >>> Is this the expected behaviour? The documentation mostly talsk about
> > json
> > >>> and parquet complex types and i wonder if the avro storage plugin
> > behaves
> > >>> differently.
> > >>>
> > >>> Thanks,
> > >>>
> > >>> Johannes
> > >>>
> > >>
> > >>
> > >
> >
>

Reply via email to