Glad to see you're using http://splainer.io! I recognize those explains!
(let me know if you have any ideas/thoughts/questions/criticisms I created
the thing).

Some thoughts
- You might consider using ps2 or ps3 to add a slop to the two word and
three word phrase searches. Slop adds a less strict positional tolerance.
This would help get RER paired with Saint in your other document, and
effectively eliminate the coord. , though at a lower score (1 / position
difference IIRC)
- Have you tried sending "disableCoord" to Solr? I usually leave coord on,
as I consider it useful to bias towards more matches. But that option
exists.
- Using pf2 and pf3 together means that 3 word phrase matches will get
counted twice. Once as a three word phrase match. Again as multiple 2 word
phrase matches. I usually just stick with pf2.

Best!
-Doug

On Thu, Apr 28, 2016 at 11:32 AM elisabeth benoit <elisaelisael...@gmail.com>
wrote:

> Hello all,
>
> I am using Solr 4.10.1. I use edismax, with pf2 to boost documents starting
> with. I use a start with token (bzzzz) automatically added at index time,
> and added in request at query time.
>
> I have a problem at this point.
>
> request is *q=bzzzz saint denis rer*
>
> the start with field is name_sw
>
> first document *name_sw: Saint-Denis-Université*
> second document *name_sw: RER Saint-Denis*
>
> So one will have the pf2 starts with boost and not the other. The problem
> is that it has an effect on the scoring of pf2 for all other words.
>
> In other words, my problem is the proximity between "saint" and "denis" is
> not scored the same value for those two documents.
>
> From what I get this is because of the coord scoring factor used for pf2.
>
> In explain output, for first document
>
> 0.52612317 Matches Punished by 0.6666667 (not all query terms matched)
>    0.78918475 Sum of the following:
>      0.39459237 names_sw:"bzzzz saint"^0.21
>
>      0.39459237 Dismax (take winner of below)
>        0.39459237 names_sw:"saint denis"^0.21
>
>        0.37580228 catchall:"saint den"^0.2
>
>
> *So here, matches punished by 0.66*, which corresponds to coord(2/3)
>
> and final score pf2 for proximity between saint and denis
>
> 0.263061593153079 names_sw:"saint denis"^0.21
>
>
> In explain output, for second document
>
>
>  0.13153079 Matches Punished by 0.33333334 (not all query terms matched)
>    0.39459237 Dismax (take winner of below)
>      0.39459237 names_sw:"saint denis"^0.21
>
>      0.37580228 catchall:"saint den"^0.2
>
>
> *So here matches punished by 0.33*, which corresponds to coord(1/3)
>
> and final score pf2 for proximity between saint and denis
>
> 0.1315307926306158 names_sw:"saint denis"^0.21
>
>
> I would like to deactivate coord for pf2 pf3. Does anyone know how I
> could do this?
>
>
> Best regards,
>
> Elisabeth
>

Reply via email to