Hi Stefan,

thanks for this information - so it seems that there is currently no way of
accessing nested rich objects with drill; I somehow got that wrong from the
documentation...

Cheers,
Johannes

On Thu, Mar 24, 2016 at 2:14 PM, Stefán Baxter <ste...@activitystream.com>
wrote:

> FYI: flattening of embedded structures is not supported in Parquet either.
>
> Regards,
>  -Stefan
>
> On Wed, Mar 23, 2016 at 8:51 PM, Johannes Schulte <
> johannes.schu...@gmail.com> wrote:
>
> > Hi Stefan,
> >
> > thanks for your response and the link to your udf repository, it's a good
> > reference. I tried drill 1.6, the data is an array of complex objects
> > though. I will try to setup a drill dev environment and see if i can
> modify
> > the tests to fail.
> >
> > Johannes
> >
> > On Wed, Mar 23, 2016 at 8:13 PM, Stefán Baxter <
> ste...@activitystream.com>
> > wrote:
> >
> > > FYI. this seems to be working in 1.6, at least on the Avro data that we
> > > have.
> > >
> > > On Wed, Mar 23, 2016 at 6:59 PM, Stefán Baxter <
> > ste...@activitystream.com>
> > > wrote:
> > >
> > > > Hi again,
> > > >
> > > > What version of Drill are you using?
> > > >
> > > > Regards,
> > > > - Stefán
> > > >
> > > > On Wed, Mar 23, 2016 at 4:49 PM, Stefán Baxter <
> > > ste...@activitystream.com>
> > > > wrote:
> > > >
> > > >> Hi Johannes,
> > > >>
> > > >> As great as Drill is the Avro plugin has been a source of
> frustration
> > > for
> > > >> us @activitystream.
> > > >>
> > > >> We have a small UDF library [1] (apache licensed) which contains a
> > > >> function can return an array (List<String>) from Avro as a CSV list.
> > > >>
> > > >> You could use that to roll your own or provide me with a small
> sample
> > > and
> > > >> I can create a custom flatten function for you.
> > > >>
> > > >> The best would be to wait for a fix but this can potentially get you
> > out
> > > >> of a rough spot.
> > > >>
> > > >> [1] https://github.com/activitystream/asdrill
> > > >>
> > > >> Regards,
> > > >>  -Stefán
> > > >>
> > > >> On Wed, Mar 23, 2016 at 9:05 AM, Johannes Schulte <
> > > >> johannes.schu...@gmail.com> wrote:
> > > >>
> > > >>> Hi,
> > > >>>
> > > >>> when trying to read simple avro arrays with select flatten(array)
> > from
> > > >>> dfs... i get the exception
> > > >>>
> > > >>> SQL Query Error: SYSTEM ERROR: ClassCastException: Cannot cast
> > > >>> org.apache.drill.exec.vector.complex.MapVector to
> > > >>> org.apache.drill.exec.vector.complex.RepeatedValueVector
> > > >>> ^
> > > >>>
> > > >>> The type of the array is said to be <UnknownType (2,002)>
> > > >>>
> > > >>> Is this the expected behaviour? The documentation mostly talsk
> about
> > > json
> > > >>> and parquet complex types and i wonder if the avro storage plugin
> > > behaves
> > > >>> differently.
> > > >>>
> > > >>> Thanks,
> > > >>>
> > > >>> Johannes
> > > >>>
> > > >>
> > > >>
> > > >
> > >
> >
>

Reply via email to