Re: PySpark Connection reset by peer: socket write error

2015-12-16 Thread Surendran Duraisamy
I came across this issue while running the program in my lapop with a small data set (around 3.5 MB). Code is straight forward as follows. data = sc.textFile("inputfile.txt") mappedRdd = data.map(*mapFunction*).cache() model = ALS.train(mappedRdd , 10, 15) ... mapFunction - is a simple map

Re: PySpark Connection reset by peer: socket write error

2015-12-16 Thread Vijay Gharge
Can you elaborate your problem further ? Looking at the error looks like you are running on cluster. Also share relevant code for better understanding. On Wednesday 16 December 2015, Surendran Duraisamy wrote: > Hi, > > > > I am running ALS to train a data set of around

PySpark Connection reset by peer: socket write error

2015-12-16 Thread Surendran Duraisamy
Hi, I am running ALS to train a data set of around 15 lines in my local machine. When I call train I am getting following exception. *print *mappedRDDs.count() # this prints correct RDD count model = ALS.train(mappedRDDs, 10, 15) 15/12/16 18:43:18 ERROR PythonRDD: Python worker exited