Lucene id generation

2006-12-11 Thread Waheed Mohammed
Hello, Is there a way to influence lucene's generation of ids while indexing. my requirement is. I want to have different indexes where no index should have ids that have been assigned to an index earlier. for instance IDX1 : {0.100} IDX2: {101...200} IDX3: {201...300} but not

Re: Lucene id generation

2006-12-11 Thread karl wettin
11 dec 2006 kl. 16.15 skrev Waheed Mohammed: Is there a way to influence lucene's generation of ids while indexing. If you speak of the Lucene "document number", then no. And are you aware of the fact that document numbers are eligable for change at any time (optimization) without giving

Re: Lucene id generation

2006-12-11 Thread Erick Erickson
I don't believe that this is possible. Or desirable. Lucene IDs are mutable, even within an index. That is, if you index docs that get, say, IDs 1, 2, 3, 4, 5 and delete doc 2 and optimize, Docs 4 and 5 get reassigned IDs 3 and 4 (or something similar). You're far better off controlling this your

Re: Lucene id generation

2006-12-11 Thread Find Me
On 12/11/06, Waheed Mohammed <[EMAIL PROTECTED]> wrote: Hello, Is there a way to influence lucene's generation of ids while indexing. my requirement is. I want to have different indexes where no index should have ids that have been assigned to an index earlier. for instance IDX1 : {0.1

RE: Lucene id generation

2006-12-11 Thread Ramana Jelda
ind Me [mailto:[EMAIL PROTECTED] > Sent: Monday, December 11, 2006 4:34 PM > To: java-user@lucene.apache.org > Subject: Re: Lucene id generation > > On 12/11/06, Waheed Mohammed <[EMAIL PROTECTED]> wrote: > > > > Hello, > > > > Is there a way to inf

RE: Lucene id generation

2006-12-11 Thread Chris Hostetter
t; -Original Message- : > From: Find Me [mailto:[EMAIL PROTECTED] : > Sent: Monday, December 11, 2006 4:34 PM : > To: java-user@lucene.apache.org : > Subject: Re: Lucene id generation : > : > On 12/11/06, Waheed Mohammed <[EMAIL PROTECTED]> wrote: : > > : >

Re: Lucene id generation

2006-12-12 Thread Waheed Mohammed
ustom document ids? > > > -Original Message- > > From: Find Me [mailto:[EMAIL PROTECTED] > > Sent: Monday, December 11, 2006 4:34 PM > > To: java-user@lucene.apache.org > > Subject: Re: Lucene id generation > > > > On 12/11/06, Waheed Mohammed <[EMAIL PROTE

Re: Lucene id generation

2006-12-14 Thread karl wettin
11 dec 2006 kl. 20.04 skrev Chris Hostetter: if you are trying to think of Lucene's docid as a meaningful number, you are doing something wrong. There is this one place where I use it. The index is add only, and the only data that interests me is the stored field MyID, also kept track i

Re: Lucene id generation

2006-12-14 Thread Chris Hostetter
Karl: it sounds like you are just refering to using the lucene docid as an array index for the FieldCache of your "MyID" field ... that's a perfectly valid use of the docid, the key being that you aren't expecting the id to contain any meaningful data itself -- it's just a refrence number. : > if

RE: Lucene id generation

2006-12-15 Thread Ramana Jelda
lto:[EMAIL PROTECTED] > Sent: Friday, December 15, 2006 6:35 AM > To: java-user@lucene.apache.org > Subject: Re: Lucene id generation > > > Karl: it sounds like you are just refering to using the > lucene docid as an array index for the FieldCache of your > "MyID&quo

RE: Lucene id generation

2006-12-16 Thread Chris Hostetter
Sent: Friday, December 15, 2006 6:35 AM : > To: java-user@lucene.apache.org : > Subject: Re: Lucene id generation : > : > : > Karl: it sounds like you are just refering to using the : > lucene docid as an array index for the FieldCache of your : > "MyID" field ... that&

RE: Lucene id generation

2006-12-19 Thread Antonio Bruno
To use but directly the docId would render efficient and fastest the searches much. Thoughts to the possibility of being able to apply a first CachingWrapperFilter F1 on an index and a second CachingWrapperFilter F2 on an other index and after to make (F1 AND F2) and to even extract the info of

Re: Lucene id generation

2006-12-19 Thread Erick Erickson
But you can do something very similar and very quickly using a unique ID (not the Lucene ID) that's shared across the indexes (assuming I'm reading your issue correctly). Then use TermDocs/TermEnum and create your filters that way. I predict endless problems with user (programmer) errors if Lucen

Re: Lucene id generation

2006-12-19 Thread Steven Rowe
Antonio Bruno wrote: > To use but directly the docId would render efficient and fastest the > searches much. Thoughts to the possibility of being able to apply a > first CachingWrapperFilter F1 on an index and a second > CachingWrapperFilter F2 on an other index and after to make (F1 AND > F2) and

I: Lucene id generation

2006-12-19 Thread Antonio Bruno
The problem in reality consists on the fact to have an only dictionary of the terms for all the fields. If the dictionary of the terms is the many large performances of a search they diminish, even if the search is made on a single term. Then it would be wanted to be able to index the fields of

Re: I: Lucene id generation

2006-12-19 Thread Erick Erickson
I see your point, but I have to ask whether this is a practical or a theoretical problem? If it's a practical one, perhaps you'd be willing to talk about the issue you're actually trying to solve and maybe we can come up with a solution within the current framework. I know others on the list have