[ 
https://issues.apache.org/jira/browse/LUCENE-4258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13423258#comment-13423258
 ] 

Robert Muir commented on LUCENE-4258:
-------------------------------------

{quote}
Codecs didn't handle all segment files first... stored fields and such were 
added later. I do agree though that we should keep in mind the full range of 
scenarios.
{quote}

I don't think thats really comparable at all, for two reasons:
1. Codecs can be considered a "rote" refactoring of the XXXWriter in 3.x. I'm 
not trying to diminish the value but its just an introduced abstraction layer. 
Something like this is different in that its algorithmic.
2. The fact that Codecs only handled postings at first wasn't easy to fix after 
they were introduced as postings-only. Once they handled postings initially, 
this was a significant refactoring.

I'm not trying to pick on your proposal, I'm just saying there are things I 
don't like about the design.
* I think that updating individual terms is a fringe use-case, and not the 
major use case for incremental updates, which is to update the contents of one 
field, without reindexing the entire document. This was also noted by someone 
else on the discussion thread. This issue seems to be solely about supporting 
the 'tagging' use case, which is just one of many.
* I think requiring no positions, no frequencies, and no norms makes it even 
more fringe. This means its not really useful for any search purposes. And we 
are a search engine library.
* I think that negatives won't compress well, as in general compression 
algorithms for IR in the last years focus on positive integers.
* I think merging the postings will be slow: I don't like the tradeoff of 
slowing down searching so much for what I'm not even sure will be a significant 
speedup to indexing.

                
> Incremental Field Updates through Stacked Segments
> --------------------------------------------------
>
>                 Key: LUCENE-4258
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4258
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/index
>            Reporter: Sivan Yogev
>   Original Estimate: 2,520h
>  Remaining Estimate: 2,520h
>
> Shai and I would like to start working on the proposal to Incremental Field 
> Updates outlined here (http://markmail.org/message/zhrdxxpfk6qvdaex).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to