On Fri, Jun 5, 2015 at 2:05 PM Will Briggs <wrbri...@gmail.com> wrote:

> Your lambda expressions on the RDDs in the SecondRollup class are closing
> around the context, and Spark has special logic to ensure that all
> variables in a closure used on an RDD are Serializable - I hate linking to
> Quora, but there's a good explanation here:
> http://www.quora.com/What-does-Closure-cleaner-func-mean-in-Spark
>

Ah, I see!  So if I broke out the lambda expressions into a method on an
object it would prevent this issue.  Essentially, "don't use lambda
expressions when using threads".

Thanks again, I appreciate the help.

Reply via email to