[ https://issues.apache.org/jira/browse/LUCENE-4203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13411640#comment-13411640 ]
Michael McCandless commented on LUCENE-4203: -------------------------------------------- OK I'll use AtomicReader, though it really must be a SegmentReader (we can cast it) since we need the SegmentInfoPerCommit instance from it. Hmm I suppose it could also be a CompositeReader (nrt DirectoryReader), and then the method would have to resolve to the AtomicReader (SegmentReader) / sub-docID. > Add IndexWriter.tryDeleteDocument, to delete by document id when possible > ------------------------------------------------------------------------- > > Key: LUCENE-4203 > URL: https://issues.apache.org/jira/browse/LUCENE-4203 > Project: Lucene - Java > Issue Type: Improvement > Components: core/index > Reporter: Michael McCandless > Attachments: LUCENE-4203.patch > > > Spinoff from LUCENE-4069. > In that use case, where the app needs to first lookup a document, then > call updateDocument, it's wasteful today because the relatively costly > lookup (by a primary key field, eg "id") is done twice. > But, since you already resolved the PK to docID on the first lookup, > it would be nice to then delete by that docID and then you can call > addDocument instead. > So I worked out a rough start at this, by adding > IndexWriter.tryDeleteDocument. It'd be a very expert API: it takes a > SegmentInfo (referencing the segment that contains the docID), and as > long as that segment hasn't yet been merged away, it will mark the > document for deletion and return true (success). If it has been > merged away it returns false and the app must then delete-by-term. It > only works if the writer is in NRT mode (ie you've opened an NRT > reader). > In LUCENE-4069 using tryDeleteDocument gave a ~20% net speedup. > I think tryDeleteDocument would also be useful when Solr "updates" a > document by loading all stored fields, changing them, and calling > updateDocument. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org