Record for computing an SVD large sparse matrix

2012-10-28 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
Hi all, Could anyone tell me which is the record of computing an SVD of a large sparse matrix and where was achieved? Thanks and regards. Pedro.

RE: ArrayIndexOutOfBoundsException SparseMatrix

2012-09-10 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
a single thread? This last is probably the fastest of all of the options. On Sun, Sep 9, 2012 at 11:29 AM, PEDRO MANUEL JIMENEZ RODRIGUEZ pmjimenez1...@hotmail.com wrote: Hi all, I'm trying to set all values of a SparseMatrix structure using multiple threads but I'm getting an error

ArrayIndexOutOfBoundsException SparseMatrix

2012-09-09 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
Hi all, I'm trying to set all values of a SparseMatrix structure using multiple threads but I'm getting an error of ArrayIndexOutBoundsException even when access indexes are correct. In fact, when I subtitude SparseMatrix structure for a double array I didn't get any error. Does any one have

RE: DistributedRowMatrix - FileNotFoundException

2012-03-08 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
. No, it doesn't accept some text-based format. On Wed, Mar 7, 2012 at 8:41 PM, PEDRO MANUEL JIMENEZ RODRIGUEZ pmjimenez1...@hotmail.com wrote: Sorry but I can't understand how to do it. I have single separated-space text file with my input matrix. To run DistributedRowMatrix with that file I

RE: DistributedRowMatrix - FileNotFoundException

2012-03-07 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
an IntWritable, and that means it expects input as a sequence file, via SequenceFileInputFormat. On Tue, Mar 6, 2012 at 7:21 PM, PEDRO MANUEL JIMENEZ RODRIGUEZ pmjimenez1...@hotmail.com wrote: Thanks for reply. I was doing something wrong. I have to convert my input file to a seqFile

RE: DistributedRowMatrix - FileNotFoundException

2012-03-06 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
with the DRM code or HDFS management. Try running it without HDFS or a Hadoop cluster, with local files and in pseudo-distributed mode. This way you can narrow the problem to one of the above. On Sat, Mar 3, 2012 at 10:13 AM, PEDRO MANUEL JIMENEZ RODRIGUEZ pmjimenez1...@hotmail.com wrote: Hi

DistributedRowMatrix - FileNotFoundException

2012-03-03 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
Hi everyone! I'm trying to use DistributedRowMatrix in my class code but I'm getting the same error all the time: FileNotFoundException I have put a file in my hdfs directory under /user/hduser/diffuse. And I run the progam with diffuse as input and output directory. The code looks like:

RE: Lanczos Algorithm

2010-11-24 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
, 2010 at 11:55 AM, PEDRO MANUEL JIMENEZ RODRIGUEZ pmjimenez1...@hotmail.com wrote: Well, this is in the worst case but it could be possible. I'm not going to make any tests with this amount of data because for me is impossible but this project is part of a bigger one and they would have

RE: Lanczos Algorithm

2010-11-23 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
. Are there many totally zero rows? Can you estimate how many non-zero elements you have in all? On Mon, Nov 22, 2010 at 1:07 PM, PEDRO MANUEL JIMENEZ RODRIGUEZ pmjimenez1...@hotmail.com wrote: Hi Ted, I can't give you an exact amount but more or less it could be around 10^5 non-zero elements per

RE: Lanczos Algorithm

2010-11-23 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
eigenvalues. One other thing you need to remember about the LanczosSolver: if your input matrix is symmetric, pass in the boolean isSymmetric=true to the solve() method, or else you'll get wrong values. -jake On Mon, Nov 22, 2010 at 1:07 PM, PEDRO MANUEL JIMENEZ RODRIGUEZ pmjimenez1

RE: Lanczos Algorithm

2010-11-22 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
: ted.dunn...@gmail.com Date: Fri, 19 Nov 2010 13:34:19 -0800 Subject: Re: Lanczos Algorithm To: user@mahout.apache.org How many non-zero elements? On Fri, Nov 19, 2010 at 12:34 PM, PEDRO MANUEL JIMENEZ RODRIGUEZ pmjimenez1...@hotmail.com wrote: I was talking about 10^9 rows and 10^9

Lanczos Algorithm

2010-11-19 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
Dear Mahout developers, I'm a Computer Science student from the National University of Distance Education in Spain. I'm currently developing my final year project which is about Diffusion Maps. This method is used for dimensionality reduction and it uses the Lanczos algorithm during its

RE: Lanczos Algorithm

2010-11-19 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
the results as you have, so i'm willing to read the answers to these questions :) Best. Fernando. 2010/11/19 PEDRO MANUEL JIMENEZ RODRIGUEZ pmjimenez1...@hotmail.com Dear Mahout developers, I'm a Computer Science student from the National University of Distance Education

RE: Lanczos Algorithm

2010-11-19 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
 I was talking about 10^9 rows and 10^9 columns From: ted.dunn...@gmail.com Date: Fri, 19 Nov 2010 12:07:16 -0800 Subject: Re: Lanczos Algorithm To: user@mahout.apache.org On Fri, Nov 19, 2010 at 11:17 AM, PEDRO MANUEL JIMENEZ RODRIGUEZ