Re: Getting a proper ID value into every document

2015-06-05 Thread Chris Hostetter
: If you cannot do this for whatever reason, I vaguely remember someone : posting a link to a program they'd put together to do this for a : docValues field, you'd have to search the archives to find it. It was Toke - he generated DocValues for an existing index by writing an IndexReader Filter

Re: Getting a proper ID value into every document

2015-06-05 Thread Erick Erickson
My first recommendation, of course, would be to re-index the corpus with a new field. If possible, frankly, that would probably be less effort than trying to hack in an ID after the fact as well as not as error-prone. If you cannot do this for whatever reason, I vaguely remember someone posting a

Getting a proper ID value into every document

2015-06-04 Thread Trejkaz
Hi all. We had been going for the longest time abusing Lucene's doc IDs as our own IDs and of course all our filters still work like this. But at the moment, we're looking at ways to break our dependencies on this. One of the motivators for this is the outright removal of FieldCache in Lucene 5.