+1 >From my previous work experience ORC in certain cases performs better than Parquet and really deserves to be supported.
On Sat, Apr 1, 2017 at 5:58 PM, Ted Yu <yuzhih...@gmail.com> wrote: > +1 > >> On Apr 1, 2017, at 8:31 AM, Tibor Kiss <tibor.k...@gmail.com> wrote: >> >> Hello, >> >> Recently the Optimized Row Columnar (ORC) file format was spin off from Hive >> and became a top level Apache Project: https://orc.apache.org/ >> >> It is similar to Parquet in a sense that it uses column major format but >> ORC has >> a more elaborate type system and stores basic statistics about each row. >> >> I'd be interested extending Beam with ORC support if others find it helpful >> too. >> >> What do you think? >> >> - Tibor