Re: offsets issues with multiword synonyms since LUCENE_33

2012-08-15 Thread Konrad Lötzsch
I don't know wether this was discussed previously, but if you tell the synonmyfilter to not break your synonyms (which might be the default). In this case, the parts of the synonyms get new word positions. So you could use a Keywordtokenizer to avoid that behaviour: with regards, kon

Re: offsets issues with multiword synonyms since LUCENE_33

2012-08-14 Thread Michael McCandless
See also SOLR-3390. Some cases have been addressed. Eg, if you match domain name system -> dns, then dns will have correct offsets spanning the full phrase "domain name system" in the input. (However: QueryParser won't work because a query for "domain name system" is pre-split on whitespace so t

Re: offsets issues with multiword synonyms since LUCENE_33

2012-08-14 Thread Marc Sturlese
Well an example would be: synonyms.txt: huge,big size The I have the docs: 1- The huge fox attacks first 2- The big size fox attacks first Then if I query for huge, the highlights for each document are: 1- The huge fox attacks first 2- The big size fox attacks first The analyzer looks like this

Re: offsets issues with multiword synonyms since LUCENE_33

2012-08-14 Thread Jack Krupansky
solr-user@lucene.apache.org Subject: offsets issues with multiword synonyms since LUCENE_33 Has someone noticed this problem and solved it somehow? (without using LUCENE_33 in the solrconfig.xml) https://issues.apache.org/jira/browse/LUCENE-3668 Thanks in advance -- View this message in cont

offsets issues with multiword synonyms since LUCENE_33

2012-08-14 Thread Marc Sturlese
Has someone noticed this problem and solved it somehow? (without using LUCENE_33 in the solrconfig.xml) https://issues.apache.org/jira/browse/LUCENE-3668 Thanks in advance -- View this message in context: http://lucene.472066.n3.nabble.com/offsets-issues-with-multiword-synonyms-since-LUCENE-33