One way I can see is to -
1. get rdd from your df
2. call rdd.zipWithIndex to get a new rdd
3. turn your new rdd to a new df
On Fri, May 29, 2015 at 5:43 AM, Cesar Flores ces...@gmail.com wrote:
Assuming that I have the next data frame:
flag | price
--
1
The reason it didn't work for you is that the function you registered with
someRdd.map will be running on the worker/executor side, not in your
driver's program. Then you need to be careful to not accidentally close
over some objects instantiated from your driver's program, like the log
object in