Thanks, the CCLI code seems to help a great deal. I am still confused
about the distributed row format. When I have used the command line in
Mahout I had a sequence file of dense vectors and that seemed to be fine.
Is it possible to use that as an input or do I need to take that file and
make
How do you see a difference between sequence file of dense vectors and
distributed row matrix?
DRM (at least for the purposes of SSVD/PCA) is a set of sequence files with
keys any writable and values o.a.m...VectorWritable. There's no difference
if you use embedded solver or CLI, this definition
I am having trouble understanding whether the following code is sufficient
for running PCA
I have a sequence file of dense vectors that I am calling and then I am
trying to run the following code
SSVDSolver pcaFactory = new SSVDSolver(conf, new Path(vectorsFolder), new
No,this is not right.
I will explain later when i have a moment.
On Apr 12, 2013 8:08 AM, Chirag Lakhani clakh...@zaloni.com wrote:
I am having trouble understanding whether the following code is sufficient
for running PCA
I have a sequence file of dense vectors that I am calling and then I
On Fri, Apr 12, 2013 at 8:42 AM, Dmitriy Lyubimov dlie...@gmail.com wrote:
No,this is not right.
I will explain later when i have a moment.
On Apr 12, 2013 8:08 AM, Chirag Lakhani clakh...@zaloni.com wrote:
I am having trouble understanding whether the following code is sufficient
for