Re: Java Code for PCA

2013-04-16 Thread Chirag Lakhani
Thanks, the CCLI code seems to help a great deal. I am still confused about the distributed row format. When I have used the command line in Mahout I had a sequence file of dense vectors and that seemed to be fine. Is it possible to use that as an input or do I need to take that file and make

Re: Java Code for PCA

2013-04-16 Thread Dmitriy Lyubimov
How do you see a difference between sequence file of dense vectors and distributed row matrix? DRM (at least for the purposes of SSVD/PCA) is a set of sequence files with keys any writable and values o.a.m...VectorWritable. There's no difference if you use embedded solver or CLI, this definition

Java Code for PCA

2013-04-12 Thread Chirag Lakhani
I am having trouble understanding whether the following code is sufficient for running PCA I have a sequence file of dense vectors that I am calling and then I am trying to run the following code SSVDSolver pcaFactory = new SSVDSolver(conf, new Path(vectorsFolder), new

Re: Java Code for PCA

2013-04-12 Thread Dmitriy Lyubimov
No,this is not right. I will explain later when i have a moment. On Apr 12, 2013 8:08 AM, Chirag Lakhani clakh...@zaloni.com wrote: I am having trouble understanding whether the following code is sufficient for running PCA I have a sequence file of dense vectors that I am calling and then I

Re: Java Code for PCA

2013-04-12 Thread Dmitriy Lyubimov
On Fri, Apr 12, 2013 at 8:42 AM, Dmitriy Lyubimov dlie...@gmail.com wrote: No,this is not right. I will explain later when i have a moment. On Apr 12, 2013 8:08 AM, Chirag Lakhani clakh...@zaloni.com wrote: I am having trouble understanding whether the following code is sufficient for