Re: matrix-vector multiply in hadoop

2011-05-17 Thread Ted Dunning
Try using the Apache Mahout code that solves exactly this problem. Mahout has a distributed row-wise matrix that is read one row at a time. Dot products with the vector are computed and the results are collected. This capability is used extensively in the large scale SVD's in Mahout. On Tue, Ma

matrix-vector multiply in hadoop

2011-05-17 Thread Alexandra Anghelescu
Hi all, I was wondering how to go about doing a matrix-vector multiplication using hadoop. I have my matrix in one file and my vector in another. All the map tasks will need the vector file... basically they need to share it. Basically I want my map function to output key-value pairs (i,m[i,j]*v(