[ 
https://issues.apache.org/jira/browse/MAHOUT-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13640268#comment-13640268
 ] 

Gokhan Capan commented on MAHOUT-1193:
--------------------------------------

Ok, here are the updates:

I modified the code a little (made it run and modified as I had commented 
previosly), and did some tests within the real application that I mentioned int 
the user list.

Performance of get and sets (bigger is better):
DenseMatrix > SparseMatrix (with dense rows) > BlockSparseMatrix > SparseMatrix 
(with sparse rows) > SparseColumnMatrix


Performance difference between SparseMatrix with dense rows and 
BlockSparseMatrix is small.

One drawback of SparseMatrix might be that you need to specify the rowSize in 
advance (which means you need to set a boundary for your row indices). This 
wasn't a problem for me, but it's worth mentioning. With this version of 
BlockSparseMatrix, there might also be a memory overhead depending on 
blockSize. 

I decided to go for SparseMatrix with dense rows for now, but I also work on 
BlockSparseMatrix code (thanks to the flexible schema).
                
> We may want a BlockSparseMatrix
> -------------------------------
>
>                 Key: MAHOUT-1193
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1193
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Ted Dunning
>         Attachments: MAHOUT-1193.patch
>
>
> Here is an implementation.
> Is it good enough to commit?
> Is it useful?
> Is it redundant?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to