Github user dain commented on the issue: https://github.com/apache/orc/pull/169 @xndai if the min or max happens to be a multi megabyte value it can be really expensive for the reader. Additionally, for filtering the first few bytes are the most valuable (they establish the range).
---