On Thu, Mar 24, 2011 at 11:03 PM, Dmitriy Lyubimov <dlie...@gmail.com>wrote:
> you can certainly try to write it out into a DRM (distributed row > matrix) and run stochastic SVD on hadoop (off the trunk now). see > MAHOUT-593. This is suitable if you have a good decay of singular > values (but if you don't it probably just means you have so much noise > that it masks the problem you are trying to solve in your data). > You don't need to run it as stochastic, either. The regular LanczosSolver will work on this data, if it lives as a DRM. -jake