[ 
https://issues.apache.org/jira/browse/MAHOUT-124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12744155#action_12744155
 ] 

Robin Anil commented on MAHOUT-124:
-----------------------------------

Ran FindBugs through the code.  Everything looks fine.

Inmemory Datastore reads the whole model from HDFS into memory. Had the model 
been read directly from HDFS we could have called it a Datastore. Maybe a 2 
level (memory + HDFS) storage could be called a HDFS datastore in the future. 
Does that sound sane?

Could you try this new patch.  Also try with 0.20 RC1 of Hbase 
http://people.apache.org/~stack/hbase-0.20.0-candidate-1/

Scaling tests need to be done on Amazon EC2. 

Well GSOC ends today, but mahout-ing continues.


> Online Classification using HBase
> ---------------------------------
>
>                 Key: MAHOUT-124
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-124
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.2
>            Reporter: Robin Anil
>            Assignee: Isabel Drost
>             Fix For: 0.2
>
>         Attachments: MAHOUT-124-August-2.patch, MAHOUT-124-August17.patch, 
> MAHOUT-124-July-13.patch, MAHOUT-124-July-23.patch, MAHOUT-124-July-6.patch, 
> MAHOUT-124-June-23.patch
>
>
> #       Batch classification of flat file documents and flat file model:
> #       Storing the model in HBase and the end of Model Building Map/Reduce 
> stages
> #       Using the model stored in HBase create an interface (both command 
> line and web service) to classify a give document
> #       Using the model stored in HBase, batch classify documents stored on 
> the HDFS

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to