Just wondering advantages and disadvantages to convert data into ORC or Parquet.
In the documentation of Spark there are numerous examples of Parquet format. Any strong reasons to chose Parquet over ORC file format ? Also : current data compression is bzip2 http://stackoverflow.com/questions/32373460/parquet-vs-orc-vs-orc-with-snappy This seems like biased.