Basically, you can use Hama BSP for your PCA analysis, and I'm sure that it'll be more efficient than map/reduce.
See http://incubator.apache.org/hama/docs/r0.2.0/index.html But, we doesn't support some complete PCA analysis tool, too. You have to implement your own PCA analysis job. I can help you if you want to. Thanks. On Sun, Apr 10, 2011 at 6:53 PM, Vincent Xue <[email protected]> wrote: > Hello, > > Sorry for the new thread but my previous emails are being rejected by the > server. I have looked at the Mahout project but I did not realize that PCA > was implemented. > > I am looking at a matrix about 5000x5000 but my current implementation takes > several hours. Also, it is very likely that my matrix will grow in size. > > Regards > Vincent > -- Best Regards, Edward J. Yoon http://blog.udanax.org http://twitter.com/eddieyoon
