On Mon, Dec 12, 2016 at 4:48 PM, Dain Sundstrom <d...@iq80.com> wrote:
> I meant that the sorting of the dictionary seems to be UTF-16 BE. Is that > not correct? I believe the sorting of the dictionary is UTF-8, because the red-black tree takes the binary representation from Hadoop's Text instead of Java's String. .. Owen