Yu Xu created PIG-2749:
--------------------------
Summary: Keep predicates in PIG after they have been pushed down
to give data sources more flexibility and enable more optimization
Key: PIG-2749
URL: https://issues.apache.org/jira/browse/PIG-2749
Project: Pig
Issue Type: Improvement
Reporter: Yu Xu
Priority: Minor
Currently PIG removes any predicate it has pushed down to data sources (via
LoadMetaData).
This JIRA ticket is created to request PIG optimizer to keep all predicates it
has pushed down so that:
1) the underlying data sources have more flexibility/opportunities to do
projections/indexed lookups. The only requirement for the underlying data
sources is that what they return should be be superset of the correct results.
As the result of the new flexibility, the underlying data sources can return
more results than the exact predicates pushed down would return (for example
using approximate predicate filtering), but still more efficient than full
table/file scan.
2) it's easier for programming correction verification since PIG would still
apply all predicates.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira