java.lang.OutOfMemoryError: Java heap space during reduce operation

2014-10-20 Thread ayandas84
Hi, *In a reduce operation I am trying to accumulate a list of SparseVectors. The code is given below;* val WNode = trainingData.reduce{(node1:Node,node2:Node) = val wNode = new Node(num1,num2) wNode.WhatList ++= (node1.WList)

Re: Kyro deserialisation error

2014-09-12 Thread ayandas84
Hi, I am also facing the same problem. Has any one found out the solution yet? It just returns a vague set of characters. Please help.. Exception in thread main org.apache.spark.SparkException: Job aborted due to stage failure: Exception while deserializing and fetching task:

Partitioning a libsvm format file

2014-08-10 Thread ayandas84
Hi, I am using spark-scala system to train distributed svm. For training svm I am using the files in LIBSVM format. I want to partition a file into fixed number of partititions, with each partition having equal number of datapoints(assume that the number of datapoints in the file is exactly