I'll just add that a PR in in progress (thanks Joris!) for adding this adapter: https://github.com/apache/arrow/pull/10991
On Tue, Sep 7, 2021 at 12:05 PM Wes McKinney <wesmck...@gmail.com> wrote: > > I'm missing context but if you're talking about C++/Python, we are > currently missing a wrapper interface to the ORC reader in the Arrow > datasets library > > https://github.com/apache/arrow/tree/master/cpp/src/arrow/dataset > > We have CSV, Arrow (IPC), and Parquet interfaces. > > But we have an HDFS filesystem implementation and an ORC reader > implementation, so mechanically all of the pieces are there but need > to be connected together. > > Thanks, > Wes > > On Tue, Sep 7, 2021 at 8:22 AM Manoj Kumar <man...@zettabolt.com> wrote: > > > > Hi Dev-Community, > > > > Anyone can help me to guide how to read ORC directly from HDFS to an > > arrow dataset. > > > > Thanks > > Manoj