[
https://issues.apache.org/jira/browse/MAHOUT-1168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13608941#comment-13608941
]
Benson Margulies commented on MAHOUT-1168:
------------------------------------------
I've revised the design of this a bit.
I've factored out an AbstractLuceneIterator, with an abstract method
org.apache.mahout.utils.vectors.lucene.AbstractLuceneIterator#getVectorName,
called at each iteration.
The plain LuceneIterator just arranges to fetch the id field, and return the
value from this method.
Other subclasses can fetch more fields and provide accessors.
> Allow access to current Lucene document from LuceneIterator
> -----------------------------------------------------------
>
> Key: MAHOUT-1168
> URL: https://issues.apache.org/jira/browse/MAHOUT-1168
> Project: Mahout
> Issue Type: Improvement
> Components: Integration
> Affects Versions: 0.8
> Reporter: Benson Margulies
> Assignee: Benson Margulies
> Attachments: MAHOUT-1168.patch
>
>
> I propose to add a getCurrentDocument() API to the LuceneIterator to allow a
> vectorizer to produce more information.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira