This time it doesn’t seem to be related to registering class serializers. Seems like the Scala collections work as well as the Java ones. It would still be nice to know when we have to add to that list in MahoutKryoRegistrator. When a job fails to serialize the message is not very helpful.
On Jul 29, 2014, at 9:10 AM, Pat Ferrel <p...@occamsmachete.com> wrote: I need to do a sort each vector inside an rdd.map. The last time I added a collection class, Guava’s HashBiMap, I had to add it to the MahoutKryoRegisrator. This time at first it wouldn't serialize when I used a Scala List[Vector.Element], but the problem is I can’t seem to add the Scala List to the MahoutKryoRegisrator because it doesn’t understand the classname. So I had to fall back to using Java’s ArrayList, which doesn’t require registering for some reason. What are the rules for when, why, and what we need to register with the MahoutKryoRegisrator? Is there a problem with just registering the Scala collection library?