I'll just add that a PR in in progress (thanks Joris!) for adding this
adapter: https://github.com/apache/arrow/pull/10991

On Tue, Sep 7, 2021 at 12:05 PM Wes McKinney <wesmck...@gmail.com> wrote:
>
> I'm missing context but if you're talking about C++/Python, we are
> currently missing a wrapper interface to the ORC reader in the Arrow
> datasets library
>
> https://github.com/apache/arrow/tree/master/cpp/src/arrow/dataset
>
> We have CSV, Arrow (IPC), and Parquet interfaces.
>
> But we have an HDFS filesystem implementation and an ORC reader
> implementation, so mechanically all of the pieces are there but need
> to be connected together.
>
> Thanks,
> Wes
>
> On Tue, Sep 7, 2021 at 8:22 AM Manoj Kumar <man...@zettabolt.com> wrote:
> >
> > Hi Dev-Community,
> >
> > Anyone can help me to guide how to read ORC directly from HDFS to an
> > arrow dataset.
> >
> > Thanks
> > Manoj

Reply via email to