Re: error on training with logistic regression sgd

2015-03-10 Thread Peng Xia
Hi, Can anyone give an idea about this? Just did some google search, it seems related to the 2gb limitation on block size, https://issues.apache.org/jira/browse/SPARK-1476. The whole process is that: 1. load the data 2. convert each line of data into labeled points using some feature hashing

error on training with logistic regression sgd

2015-03-09 Thread Peng Xia
Hi, I was launching a spark cluster with 4 work nodes, each work nodes contains 8 cores and 56gb ram, and I was testing my logistic regression problem. The training set is around 1.2 million records.When I was using 2**10 (1024) features, the whole program works fine, but when I use 2**14