Hi Daniel, I also faced the same issue when using Naive Bayes classifier in MLLib. I was able to solve it by making all fields in the calling object either transient of serializable.
Spark will print which class's object it was not able to serialize, in the error message. that can give you a hint. Thanks, Jaideep On Mon, Jun 30, 2014 at 7:33 PM, Daniel Micol <dmi...@gmail.com> wrote: > Hello, > > I’m trying to use KMeans with MLLib but am getting a TaskNotSerializable > error. I’m using Spark 0.9.1 and invoking the KMeans.run method with k = 2 > and numPartitions = 200. Has anyone seen this error before and know what > could be the reason for this? > > Thanks, > > Daniel > -- _____________________________________________________________ The information contained in this communication is intended solely for the use of the individual or entity to whom it is addressed and others authorized to receive it. It may contain confidential or legally privileged information. If you are not the intended recipient you are hereby notified that any disclosure, copying, distribution or taking any action in reliance on the contents of this information is strictly prohibited and may be unlawful. If you have received this communication in error, please notify us immediately by responding to this email and then delete it from your system. The firm is neither liable for the proper and complete transmission of the information contained in this communication nor for any delay in its receipt.