Re: Re: how to use DoubleRDDFunctions on mllib Vector?

2015-07-09 Thread 诺铁
Ok, got it , thanks. On Thu, Jul 9, 2015 at 12:02 PM, prosp4300 prosp4...@163.com wrote: Seems what Feynman mentioned is the source code instead of documentation, vectorMean is private, see

Re: how to use DoubleRDDFunctions on mllib Vector?

2015-07-08 Thread Feynman Liang
A RDD[Double] is an abstraction for a large collection of doubles, possibly distributed across multiple nodes. The DoubleRDDFunctions are there for performing mean and variance calculations across this distributed dataset. In contrast, a Vector is not distributed and fits on your local machine.

how to use DoubleRDDFunctions on mllib Vector?

2015-07-07 Thread 诺铁
hi, there are some useful functions in DoubleRDDFunctions, which I can use if I have RDD[Double], eg, mean, variance. Vector doesn't have such methods, how can I convert Vector to RDD[Double], or maybe better if I can call mean directly on a Vector?