[jira] [Commented] (LUCENE-3638) IndexReader.document always return a doc with all the stored fields loaded. And this can be slow for the indexed document contain huge fields

peter chang (Commented) (JIRA) Sun, 11 Dec 2011 08:11:09 -0800

    [ 
https://issues.apache.org/jira/browse/LUCENE-3638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13167126#comment-13167126
 ]


peter chang commented on LUCENE-3638:
-------------------------------------

1. i agree with robert, fieldsToAdd, fieldsToFilter something like this can be 
added for IR and IS.doc
2. yes, the offset info is specified topic related. it can be process in app 
level when process multi-bytes encoded languages such as Zh_CN. in this 
situation, the offset is just an estimation. 
                
> IndexReader.document always return a doc with all the stored fields loaded. 
> And this can be slow for the indexed document contain huge fields
> ---------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-3638
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3638
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: core/index, core/search
>    Affects Versions: 4.0
>         Environment: 64bit linux java 1.6
>            Reporter: peter chang
>            Priority: Minor
>              Labels: patch
>             Fix For: 4.0
>
>         Attachments: doc.fields.patch
>
>
> when generating digest for some documents with huge fields, it should be 
> unnecessary to load the field but just interesting part of the field with the 
> offset information. but indexreader always return the whole field content. 
> afterward, the customized storedfieldsreader will got a repeated loading

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (LUCENE-3638) IndexReader.document always return a doc with all the stored fields loaded. And this can be slow for the indexed document contain huge fields

Reply via email to