Yu Xu created PIG-2749:
--------------------------

             Summary: Keep predicates in PIG after they have been pushed down 
to give data sources more flexibility and enable more optimization
                 Key: PIG-2749
                 URL: https://issues.apache.org/jira/browse/PIG-2749
             Project: Pig
          Issue Type: Improvement
            Reporter: Yu Xu
            Priority: Minor


Currently PIG removes any predicate it has pushed down to data sources (via 
LoadMetaData).  

This JIRA ticket is created to request PIG optimizer to keep all predicates it 
has pushed down so that:

1) the underlying data sources have more flexibility/opportunities to do 
projections/indexed lookups. The only requirement for the underlying data 
sources is that what they return should be be superset of the correct results.

As the result of the new flexibility, the underlying data sources can return 
more results than the exact predicates pushed down would return (for example 
using approximate predicate filtering), but still more efficient than full 
table/file scan.

2) it's easier for programming correction verification since PIG would still 
apply all predicates. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to