Hi, All, Is there a better way to sort by value in the same key before reaching reducers?
I know it can be achieved by using setOutputValueGroupingComparator/setOutputKeyComparatorClass. But it actually adds duplicate data (i.e., the value column which needs sorting) to the key. Also, I wonder what is the benefit to sort values before reaching reducers. It can be achieved in the reduce phase anyway. Thanks, James