Howdy all,
I recently saw that the OrcInputFormat/OutputFormat's have been exposed
to be usable outside of hive (
https://issues.apache.org/jira/browse/HIVE-5728). Does anyone know how
one could use this with saveAsNewAPIHadoopFile to write records in orc
format?
In particular, I would like to use a spark streaming process to read
avro records off of kafka, and write then write them directly to hdfs in
orc format where they could be used with shark.Thanks, Brock
