Hi Christoph,

The only unit test which exhibits this problem is the one which runs the full MR job (testCanopyEuclideanMRJob()). This is darn hard to debug and is doubly baffling since all the vectors should be read from Writable format into new, distinct instances. If you have a small dataset which exhibits the problem while running the reference implementation it would be very nice if you could share it.

Jeff

Jeff Eastman wrote:
I added some test code to detect duplicate boundPoint entries and can duplicate the issue in a unit test. I will see what is happening and let you know.
Jeff

Reply via email to