If I have the following simple set of data: NAME John NAME Jake NAME John NAME Mary
I want to end up with the following: NAME 3 I'm thinking that perhaps a HyperLogLog approach should work. See http://en.wikipedia.org/wiki/HyperLogLog for more information. Has anyone done this before in Accumulo?