This time it doesn’t seem to be related to registering class serializers. Seems 
like the Scala collections work as well as the Java ones. It would still be 
nice to know when we have to add to that list in MahoutKryoRegistrator. When a 
job fails to serialize the message is not very helpful.


On Jul 29, 2014, at 9:10 AM, Pat Ferrel <p...@occamsmachete.com> wrote:

I need to do a sort each vector inside an rdd.map. The last time I added a 
collection class, Guava’s HashBiMap, I had to add it to the 
MahoutKryoRegisrator. 

This time at first it wouldn't serialize when I used a Scala 
List[Vector.Element], but the problem is I can’t seem to add the Scala List to 
the MahoutKryoRegisrator because it doesn’t understand the classname. So I had 
to fall back to using Java’s ArrayList, which doesn’t require registering for 
some reason.

What are the rules for when, why, and what we need to register with the 
MahoutKryoRegisrator? Is there a problem with just registering the Scala 
collection library?

Reply via email to