Re: POCollectedGroup and LoadFunc indicator interface

2012-09-12 Thread Alan Gates
You are correct, this would be better named OrderedCollectableLoadFunc. I suspect the way this happened is that this is usually used on the output of MapReduce jobs. In that case (at least in MR1) the keys are sorted as well as guaranteed to be in a particular part file. Alan. On Sep 7, 20

POCollectedGroup and LoadFunc indicator interface

2012-09-07 Thread Vasco Visser
Hi, Hi I am new to the list. I've been working on the Pig code base, adding my own blocking map side POs (e.g., map side join, map side grouping) for when assertions can be made with regard to fragmentation of input relations. Partly inspired by the new block placement policy possibilities in hado