Here's the proposal from a while ago: https://github.com/Parquet/parquet-format/commit/cca062f72b9e27caa51f1a07381065707b13660c
On Sun, Jan 11, 2015 at 9:21 PM, Paul Wais <[email protected]> wrote: > Thanks for the answers, Nong! > > > > > * I'm interested in performant range queries on Parquet files, i.e. a > > query > > > against a large file that selects all records with column value in some > > > interval or set. > > > > We've thought about how to make these kind of queries work well but a > good > > amount of it still needs to be implemented. The file formats allows for > > sorting columns. > > Using this in predicate push down should not be too hard to integrate and > > what > > help these queries a lot. Index pages would make this perform even better > > but > > that is further out. > > > > > I tried looking through Jira and the code but I can't seem to determine the > state of Index Pages. Is there any documentation available yet? > > > > > > >
