RE: MultiFields#getTerms docs clarification

2016-08-30 Thread Uwe Schindler
Hi, if you have an untokenized StringField and index the "empty token" it will appear in the index. If you are reindexing by hand (parsing the stored fields of your 3.x index), I'd suggest to add some length==0 check before adding the field. With IndexUpgrader you cannot easily get rid of the

RE: New type of proximity/fuzzy search

2016-08-30 Thread Uwe Schindler
Hi, I think you can use SpanQueries to do this! See: https://lucene.apache.org/core/6_2_0/core/org/apache/lucene/search/spans/package-summary.html I have no closely looked into your problem, but you should be able to combine those queries. Alternatively subclass one of the classes to do additio

Re: MultiFields#getTerms docs clarification

2016-08-30 Thread Trejkaz
On Mon, Aug 29, 2016 at 8:23 PM, Michael McCandless wrote: > Seems like you need to scrutinize exactly what documents were indexed in step > 3? > > How exactly did you copy documents out of the old index? Note that > when Lucene's IndexReader returns a Document, it's not the same > Document that

New type of proximity/fuzzy search

2016-08-30 Thread Saar Carmi
Hi I will appreciate some guidance for implementing the following type of query. Given a set of search terms (t1, t2, t3, ti), return all documents where in a sequence of x=10 tokens at least c=3 of the search terms appear within the sequence So for example the following document matches the sear

Re: parent-child relationship in lucene - to avoid reindexing if parent information changes

2016-08-30 Thread Kumaran Ramasubramanian
Hi Ralph Thank you for the response.. yes, It is one of the work-around.. While searching, what you have suggested is costly and also it takes more time if number of groups is more (we can use query time join?? ).. Also, my second problem remains same.( adding a member to a group ). Becau

ApacheCon Seville CFP closes September 9th

2016-08-30 Thread Rich Bowen
It's traditional. We wait for the last minute to get our talk proposals in for conferences. Well, the last minute has arrived. The CFP for ApacheCon Seville closes on September 9th, which is less than 2 weeks away. It's time to get your talks in, so that we can make this the best ApacheCon yet. I

Re: parent-child relationship in lucene - to avoid reindexing if parent information changes

2016-08-30 Thread Ralph Soika
Hi, I think this is more a problem of the data model. You should not link a message to a group by the group name. Instead use a GroupID (which is unique) to refer to the group. The GroupID is a 'non-analyzed' and 'not-stored' field in your lucene document. Then, when you want to search for al