I need to do a sort each vector inside an rdd.map. The last time I added a collection class, Guava’s HashBiMap, I had to add it to the MahoutKryoRegisrator.
This time at first it wouldn't serialize when I used a Scala List[Vector.Element], but the problem is I can’t seem to add the Scala List to the MahoutKryoRegisrator because it doesn’t understand the classname. So I had to fall back to using Java’s ArrayList, which doesn’t require registering for some reason. What are the rules for when, why, and what we need to register with the MahoutKryoRegisrator? Is there a problem with just registering the Scala collection library?