[ 
https://issues.apache.org/jira/browse/MAHOUT-1168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13608941#comment-13608941
 ] 

Benson Margulies commented on MAHOUT-1168:
------------------------------------------

I've revised the design of this a bit.

I've factored out an AbstractLuceneIterator, with an abstract method 
org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator#getVectorName, 
called at each iteration.

The plain LuceneIterator just arranges to fetch the id field, and return the 
value from this method.

Other subclasses can fetch more fields and provide accessors.

                
> Allow access to current Lucene document from LuceneIterator
> -----------------------------------------------------------
>
>                 Key: MAHOUT-1168
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1168
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Integration
>    Affects Versions: 0.8
>            Reporter: Benson Margulies
>            Assignee: Benson Margulies
>         Attachments: MAHOUT-1168.patch
>
>
> I propose to add a getCurrentDocument() API to the LuceneIterator to allow a 
> vectorizer to produce more information.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to