Hi, For taking large matrices as input and persisting large models (like factor models), I created an HBase-backed version of Mahout matrix.
It allows random access to cells and rows as well as assignment, and iteration over rows. viewRow returns a view, and lazy loads actual data if a get is actually invoked. I plan to add a VectorInputFormat on top of it, too. The code that we need to have for our algorithms is tested, but there are still parts of it that are not. I am going to speak about this at HBaseCon, and I wanted to let you know that it can be contributed after some refactoring. Is there any interest? -- Gokhan