[ https://issues.apache.org/jira/browse/IMPALA-5842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16480162#comment-16480162 ]
ASF subversion and git services commented on IMPALA-5842: --------------------------------------------------------- Commit 5f9641043aed8590cad37f003921c462cda934af in impala's branch refs/heads/2.x from [~boroknagyz] [ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=5f96410 ] IMPALA-5842: Write page index in Parquet files This commit builds on the previous work of Pooja Nilangekar: https://gerrit.cloudera.org/#/c/7464/ The commit implements the write path of PARQUET-922: "Add column indexes to parquet.thrift". As specified in the parquet-format, Impala writes the page indexes just before the footer. This allows much more efficient page filtering than using the same information from the 'statistics' field of DataPageHeader. I updated Pooja's python tests as well. Change-Id: Icbacf7fe3b7672e3ce719261ecef445b16f8dec9 Reviewed-on: http://gerrit.cloudera.org:8080/9693 Reviewed-by: Zoltan Borok-Nagy <borokna...@cloudera.com> Tested-by: Impala Public Jenkins <impala-public-jenk...@cloudera.com> > Write page index in Parquet files > --------------------------------- > > Key: IMPALA-5842 > URL: https://issues.apache.org/jira/browse/IMPALA-5842 > Project: IMPALA > Issue Type: New Feature > Components: Backend > Affects Versions: Impala 2.10.0 > Reporter: Lars Volker > Assignee: Zoltán Borók-Nagy > Priority: Critical > Labels: parquet > > Once PARQUET-922 has been resolved, we should start writing page indices to > Parquet files. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org