2 options: 1- row index as the row key, column index as column identifier, and value as value 2- row index and column index combined as the row key, and value in a column called "value"
Row indices are kept in a member variable in memory, to make iteration fast. On Wed, May 8, 2013 at 12:11 AM, Ted Dunning <ted.dunn...@gmail.com> wrote: > How did you store the matrix in HBase? > > > On Tue, May 7, 2013 at 1:08 PM, Gokhan Capan <gkhn...@gmail.com> wrote: > > > Hi, > > > > For taking large matrices as input and persisting large models (like > factor > > models), I created an HBase-backed version of Mahout matrix. > > > > It allows random access to cells and rows as well as assignment, and > > iteration over rows. viewRow returns a view, and lazy loads actual data > if > > a get is actually invoked. > > > > I plan to add a VectorInputFormat on top of it, too. > > > > The code that we need to have for our algorithms is tested, but there are > > still parts of it that are not. > > > > I am going to speak about this at HBaseCon, and I wanted to let you know > > that it can be contributed after some refactoring. > > > > Is there any interest? > > > > -- > > Gokhan > > > -- Gokhan