I'm missing context but if you're talking about C++/Python, we are
currently missing a wrapper interface to the ORC reader in the Arrow
datasets library

https://github.com/apache/arrow/tree/master/cpp/src/arrow/dataset

We have CSV, Arrow (IPC), and Parquet interfaces.

But we have an HDFS filesystem implementation and an ORC reader
implementation, so mechanically all of the pieces are there but need
to be connected together.

Thanks,
Wes

On Tue, Sep 7, 2021 at 8:22 AM Manoj Kumar <man...@zettabolt.com> wrote:
>
> Hi Dev-Community,
>
> Anyone can help me to guide how to read ORC directly from HDFS to an
> arrow dataset.
>
> Thanks
> Manoj

Reply via email to