Thejas M Nair created PIG-3007:
----------------------------------

             Summary: support group-by collected for load funcs that don't 
implement CollectableLoadFunc
                 Key: PIG-3007
                 URL: https://issues.apache.org/jira/browse/PIG-3007
             Project: Pig
          Issue Type: New Feature
            Reporter: Thejas M Nair


group-by collected should be supported for all input that are sorted on 
group-by keys.
To ensure that a map task gets all records for a group-key, indexing can be 
done to determine which key at which it should start processing , and if it 
should read from next split as well to get remaining records for the last 
group-by column in its original split.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to