yep seems that’s the answer. The highlighting is done separately by the rails app, so I’ll look into proper solr highlighting.
thanks a lot for the use of your ears, much improved understanding! cheers, Alistair -- mov eax,1 mov ebx,0 int 80h On 16/06/2015 16:33, "Erick Erickson" <erickerick...@gmail.com> wrote: >Hmmm. First, highlighting should work here. If you have it configured >to work on the dc.description field. > >As to whether the phrase "management changes" is near enough, I >pretty much guarantee it is. This is where the admin/analysis page can >answer this type of question authoritatively since it's based exactly >on your particular analysis chain. > >Best, >Erick > >On Tue, Jun 16, 2015 at 8:25 AM, Alistair Young ><alistair.yo...@uhi.ac.uk> wrote: >> yes prolly not a bug. The highlighting is on but nothing is highlighted. >> Perhaps this text is triggering it? >> >> 'consider the impacts of land management changes’ >> >> that would seem reasonable. It’s not a direct match so no highlighting >> (the highlighting does work on a direct match) but 'management changes’ >> must be near enough ‘manage change’ to trigger a result. >> >> Alistair >> >> -- >> mov eax,1 >> mov ebx,0 >> int 80h >> >> >> >> >> On 16/06/2015 16:18, "Erick Erickson" <erickerick...@gmail.com> wrote: >> >>>I agree with Allesandro the behavior you're describing >>>is _not_ correct at all given your description. So either >>> >>>1> There's something "interesting" about your configuration >>> that doesn't seem important that you haven't told us, >>> although what it could be is a mystery to me too ;) >>> >>>2> it's matching on something else. Note that the >>> phrase has been stemmed, so something in there >>> besides management might stem to manag and/or >>> something other than changes might stem to chang >>> and the two of _them_ happen to be next to each >>> other. "are managers changing?" for instance. Or >>> even something less likely. Perhaps turn on >>> highlighting and see if it pops out? >>> >>> >>>3> you've uncovered a bug. Although I suspect others >>> would have reported it and the unit tests would have >>> barfed all over the place. >>> >>>One other thing you can do. Go to the admin/analysis >>>page and turn on the "verbose" check box. Put >>>management is undergoing many changes >>>in both the query and index boxes. The result (it's >>>kind of hard to read I'll admit) will include the position >>>of each token after all the analysis is done. Phrase >>>queries (without slop) should only be matching adjacent >>>positions. So the question is whether the position info >>>"looks correct".... >>> >>>Best, >>>Erick >>> >>>On Tue, Jun 16, 2015 at 4:40 AM, Alessandro Benedetti >>><benedetti.ale...@gmail.com> wrote: >>>> According to your debug you are using a default Lucene Query Parser. >>>> This surprise me as i would expect with that query a match with >>>>distance 0 >>>> between the 2 terms . >>>> >>>> Are you sure nothing else is that field that matches the phrase query >>>>? >>>> >>>> From the documentation >>>> >>>> "Lucene supports finding words are a within a specific distance away. >>>>To do >>>> a proximity search use the tilde, "~", symbol at the end of a Phrase. >>>>For >>>> example to search for a "apache" and "jakarta" within 10 words of each >>>> other in a document use the search: >>>> >>>> "jakarta apache"~10 " >>>> >>>> >>>> Cheers >>>> >>>> >>>> 2015-06-16 11:33 GMT+01:00 Alistair Young <alistair.yo...@uhi.ac.uk>: >>>> >>>>> it¹s a useful behaviour. I¹d just like to understand where it¹s >>>>>deciding >>>>> the document is relevant. debug output is: >>>>> >>>>> <lst name="debug"> >>>>> <str name="rawquerystring">dc.description:"manage change"</str> >>>>> <str name="querystring">dc.description:"manage change"</str> >>>>> <str name="parsedquery">PhraseQuery(dc.description:"manag >>>>>chang")</str> >>>>> <str name="parsedquery_toString">dc.description:"manag chang"</str> >>>>> <lst name="explain"> >>>>> <str name="tst:test"> >>>>> 1.2008798 = (MATCH) weight(dc.description:"manag chang" in 221) >>>>> [DefaultSimilarity], result of: >>>>> 1.2008798 = fieldWeight in 221, product of: >>>>> 1.0 = tf(freq=1.0), with freq of: >>>>> 1.0 = phraseFreq=1.0 >>>>> 9.6070385 = idf(), sum of: >>>>> 4.0365543 = idf(docFreq=101, maxDocs=2125) >>>>> 5.5704846 = idf(docFreq=21, maxDocs=2125) >>>>> 0.125 = fieldNorm(doc=221) >>>>> </str> >>>>> </lst> >>>>> <str name="QParser">LuceneQParser</str> >>>>> <lst name="timing"> >>>>> <double name="time">41.0</double> >>>>> <lst name="prepare"> >>>>> <double name="time">3.0</double> >>>>> <lst name="query"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> <lst name="facet"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> <lst name="mlt"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> <lst name="highlight"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> <lst name="stats"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> <lst name="debug"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> </lst> >>>>> <lst name="process"> >>>>> <double name="time">35.0</double> >>>>> <lst name="query"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> <lst name="facet"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> <lst name="mlt"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> <lst name="highlight"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> <lst name="stats"> >>>>> <double name="time">0.0</double> >>>>> </lst> >>>>> <lst name="debug"> >>>>> <double name="time">35.0</double> >>>>> </lst> >>>>> </lst> >>>>> </lst> >>>>> </lst> >>>>> >>>>> >>>>> thanks, >>>>> >>>>> Alistair >>>>> >>>>> -- >>>>> mov eax,1 >>>>> mov ebx,0 >>>>> int 80h >>>>> >>>>> >>>>> >>>>> >>>>> On 16/06/2015 11:26, "Alessandro Benedetti" >>>>><benedetti.ale...@gmail.com> >>>>> wrote: >>>>> >>>>> >Can you show us how the query is parsed ? >>>>> >You didn't tell us nothing about the query parser you are using. >>>>> >Enable the debugQuery=true will show you how the query is parsed and >>>>>this >>>>> >will be quite useful for us. >>>>> > >>>>> > >>>>> >Cheers >>>>> > >>>>> >2015-06-16 11:22 GMT+01:00 Alistair Young >>>>><alistair.yo...@uhi.ac.uk>: >>>>> > >>>>> >> Hiya, >>>>> >> >>>>> >> I've been looking for documentation that would point to where I >>>>>could >>>>> >> modify or explain why 'near neighbours' are returned from a phrase >>>>> >>search. >>>>> >> If I search for: >>>>> >> >>>>> >> "manage change" >>>>> >> >>>>> >> I get back a document that contains "this will help in your >>>>>management >>>>> >>of >>>>> >> <lots more words...> changes". It's relevant but I'd like to >>>>>understand >>>>> >>why >>>>> >> solr is returning it. Is it a combination of fuzzy/slop? The >>>>>distance >>>>> >> between the two variations of the two words in the document is >>>>>quite >>>>> >>large. >>>>> >> >>>>> >> thanks, >>>>> >> >>>>> >> Alistair >>>>> >> >>>>> >> -- >>>>> >> mov eax,1 >>>>> >> mov ebx,0 >>>>> >> int 80h >>>>> >> >>>>> > >>>>> > >>>>> > >>>>> >-- >>>>> >-------------------------- >>>>> > >>>>> >Benedetti Alessandro >>>>> >Visiting card : http://about.me/alessandro_benedetti >>>>> > >>>>> >"Tyger, tyger burning bright >>>>> >In the forests of the night, >>>>> >What immortal hand or eye >>>>> >Could frame thy fearful symmetry?" >>>>> > >>>>> >William Blake - Songs of Experience -1794 England >>>>> >>>>> >>>> >>>> >>>> -- >>>> -------------------------- >>>> >>>> Benedetti Alessandro >>>> Visiting card : http://about.me/alessandro_benedetti >>>> >>>> "Tyger, tyger burning bright >>>> In the forests of the night, >>>> What immortal hand or eye >>>> Could frame thy fearful symmetry?" >>>> >>>> William Blake - Songs of Experience -1794 England >>