Viktor Szathmary created PARQUET-98:
---------------------------------------

             Summary: filter2 API performance regression
                 Key: PARQUET-98
                 URL: https://issues.apache.org/jira/browse/PARQUET-98
             Project: Parquet
          Issue Type: Bug
            Reporter: Viktor Szathmary


The new filter API seems to be much slower (or perhaps I'm using it wrong :)

Code using an UnboundRecordFilter:

{code:java}
ColumnRecordFilter.column(column,
    ColumnPredicates.applyFunctionToBinary(
    input -> Binary.fromString(value).equals(input)));
{code}

vs. code using FilterPredicate:

{code:java}
eq(binaryColumn(column), Binary.fromString(value));
{code}

The latter performs twice as slow on the same Parquet file (built using 
1.6.0rc2).




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to