Re: Help with mass delete from large index

2006-02-15 Thread Michael D. Curtin
Chandramohan wrote: perform such a cull again, you might make several distinct indexes (one per day, per week, per whatever) during that reindexing so the next time will be much easier. How would you search and consolidate the results across multiple indexes? Hits from each index will have

Re: Help with mass delete from large index

2006-02-15 Thread Chandramohan
> perform such a cull again, you might make several > distinct indexes (one per > day, per week, per whatever) during that reindexing > so the next time will be > much easier. How would you search and consolidate the results across multiple indexes? Hits from each index will have independent sc

Re: Help with mass delete from large index

2006-02-14 Thread Greg Gershman
; From: Greg Gershman <[EMAIL PROTECTED]> > To: java-user@lucene.apache.org > Sent: Mon 13 Feb 2006 09:47:04 AM EST > Subject: Help with mass delete from large index > > I'm trying to delete a large number of documents > (~15million) from a a large index (30+ million > do

Re: Help with mass delete from large index

2006-02-13 Thread Otis Gospodnetic
From: Greg Gershman <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Mon 13 Feb 2006 09:47:04 AM EST Subject: Help with mass delete from large index I'm trying to delete a large number of documents (~15million) from a a large index (30+ million documents). I've started wi

Re: Help with mass delete from large index

2006-02-13 Thread Chris Hostetter
: I can create a test case; should I include an index : along with it (it could be rather large)? the ideal test case creates the index in it's constructor or setUp method. since the index is going to be totally artificial, the data doesn't matter, just theterm you want to delete on (and they can

Re: Help with mass delete from large index

2006-02-13 Thread Greg Gershman
I can create a test case; should I include an index along with it (it could be rather large)? I'm running the deletion process again with the latest nightly build. So far I haven't seen any of the previous problems, so perhaps there is already a fix in place. Thanks! Greg --- Daniel Naber <[EM

Re: Help with mass delete from large index

2006-02-13 Thread Daniel Naber
On Montag 13 Februar 2006 19:42, Greg Gershman wrote: > I'm still wondering if anyone has any thoughts on the > NullPointerException and/or the delete/optimize > problems I'm having.  They seem to be very real > issues. I haven't seen this before (and don't remember anyone on the list mentioning

Re: Help with mass delete from large index

2006-02-13 Thread Greg Gershman
Thanks, that is the way things will be done in the future. I'm still wondering if anyone has any thoughts on the NullPointerException and/or the delete/optimize problems I'm having. They seem to be very real issues. Greg --- "Michael D. Curtin" <[EMAIL PROTECTED]> wrote: > Greg Gershman wrote:

Re: Help with mass delete from large index

2006-02-13 Thread Michael D. Curtin
Greg Gershman wrote: No problem; this is not meant to be a regular operation, rather it's a (hopefully) one-time thing till the index can be restructured. The data is chronological in nature, deleting everything before a specific point in time. The index is optimized, so is it possible to remo

Re: Help with mass delete from large index

2006-02-13 Thread Greg Gershman
No problem; this is not meant to be a regular operation, rather it's a (hopefully) one-time thing till the index can be restructured. The data is chronological in nature, deleting everything before a specific point in time. The index is optimized, so is it possible to remove specific files? I'm

Re: Help with mass delete from large index

2006-02-13 Thread Michael D. Curtin
Greg Gershman wrote: I'm trying to delete a large number of documents (~15million) from a a large index (30+ million documents). I've started with an optimized index, and a list of docIds (our own unique identifier for a document, not a Lucene doc number) to pass to the IndexReader.delete(Term

Help with mass delete from large index

2006-02-13 Thread Greg Gershman
I'm trying to delete a large number of documents (~15million) from a a large index (30+ million documents). I've started with an optimized index, and a list of docIds (our own unique identifier for a document, not a Lucene doc number) to pass to the IndexReader.delete(Term t) method. I've had a f