[
https://issues.apache.org/jira/browse/LUCENENET-417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13039433#comment-13039433
]
Robert Jordan commented on LUCENENET-417:
-----------------------------------------
BTW, Java Lucene does not have the ability to tokenize AND store from a reader:
http://lucene.apache.org/java/3_0_2/api/core/org/apache/lucene/document/Field.html
IMO, this doesn't belong in Lucene.Net. The Lucene core simply cannot tokenize
and store in one pass, so it's up to the application to deal with this issue.
> implement streams as field values
> ---------------------------------
>
> Key: LUCENENET-417
> URL: https://issues.apache.org/jira/browse/LUCENENET-417
> Project: Lucene.Net
> Issue Type: New Feature
> Components: Lucene.Net Core
> Reporter: Christopher Currens
> Attachments: BinaryStream.patch
>
>
> Adding binary values to a field is an expensive operation, as the whole
> binary data must be loaded into memory and then written to the index. Adding
> the ability to use a stream instead of a byte array could not only speed up
> the indexing process, but reducing the memory footprint as well.
> Java lucene has the ability to use a TextReader the both analyze and store
> text in the index. .NET lacks the ability to store the data in the index,
> due to the fact that .net TextReaders cannot seek or reset the position of
> the stream. This should be a feature added into Lucene.NET as well. My
> thoughts are to add another Field constructor, that is Field(string name,
> System.IO.Stream stream, System.Text.Encoding encoding), that will allow the
> text to be analyzed and stored into the index.
> Comments about this approach are greatly appreciated.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira