Github user omalley commented on the issue:
https://github.com/apache/orc/pull/189
The sizes of the different data sets in the different compression:
https://drive.google.com/file/d/11bzDaqJiL7CQDg7TVo1PvQqXauj_DgIQ/view?usp=sharing---
