Re: Broadcasting a set in PySpark

2014-07-18 Thread Josh Rosen
You have to use `myBroadcastVariable.value` to access the broadcasted value; see https://spark.apache.org/docs/latest/programming-guide.html#broadcast-variables On Fri, Jul 18, 2014 at 2:56 PM, Vedant Dhandhania ved...@retentionscience.com wrote: Hi All, I am trying to broadcast a set in a

Re: Broadcasting a set in PySpark

2014-07-18 Thread Vedant Dhandhania
Hi Josh, I did make that change, however I get this error now: 568.492: [GC [PSYoungGen: 1412948K-207017K(1465088K)] 4494287K-3471149K(4960384K), 0.1280200 secs] [Times: user=0.23 sys=0.63, real=0.13 secs] 568.642: [Full GCTraceback (most recent call last): File stdin, line 1, in module