Cheolsoo Park created PIG-4329: ---------------------------------- Summary: Fetch optimization should be disabled when limit is not pushed up Key: PIG-4329 URL: https://issues.apache.org/jira/browse/PIG-4329 Project: Pig Issue Type: Bug Reporter: Cheolsoo Park Assignee: Cheolsoo Park Fix For: 0.15.0
Although PIG-4135 disable fetch optimization when there is no limit in the plan, that doesn't solve the problem completely. In fact, fetch optimization should be still disabled if limit is not pushed up. Consider the following query- {code} random_lists = load 'prodhive.schakraborty.search_server_denorm_impressions' using DseStorage(); random_lists = filter random_lists by entity_section=='random'); random_lists = limit random_lists 10; dump random_lists; {code} Because the {{filter by}} blocks limit from being pushed up, POLoad actually scans the full table. In this case, fetch optimization makes the job extremely slow. -- This message was sent by Atlassian JIRA (v6.3.4#6332)