Hi, I am new to edismax, and trying to migrate from dismax to edismax. For
the queries with
the explicitly quoted sub phrase query, it seems edismax will ignore the
quoted one, compared with 
dismax during the whole query phrase matching process (pf). Here is one
example:

For the same query: 2012 "japan airlines" flight status, dismax

http://localhost:8983/solr/select?q=2012+"japan+airlines"+flight+status&qf=TTL&pf=TTL&debugQuery=true&defType=dismax
outputs:

<str name="parsedquery">+((DisjunctionMaxQuery((TTL:2012)~0.1)
DisjunctionMaxQuery((TTL:"japan airlin"~3)~0.1)
DisjunctionMaxQuery((TTL:flight)~0.1)
DisjunctionMaxQuery((TTL:status)~0.1))~3) DisjunctionMaxQuery((TTL:"2012
japan airlin flight status"~3)~0.1)</str>

The parsedquery has DisjunctionMaxQuery((TTL:"2012 japan airlin flight
status"~3)~0.1).

While edismax

http://localhost:8983/solr/select?q=2012+"japan+airlines"+flight+status&qf=TTL&pf=TTL&debugQuery=true&defType=edismax
outputs:

<str name="parsedquery">+((DisjunctionMaxQuery((TTL:2012)~0.1)
DisjunctionMaxQuery((TTL:"japan airlin"~3)~0.1)
DisjunctionMaxQuery((TTL:flight)~0.1)
DisjunctionMaxQuery((TTL:status)~0.1))~3) DisjunctionMaxQuery((TTL:"2012
flight status"~3)~0.1)</str>

The parsedquery has DisjunctionMaxQuery((TTL:"2012 flight status"~3)~0.1).
So it seems edismax
ignores "japan airlines" for the pf matching. This could cause some issues,
in that a document with 
exactly phrase "2012 japan airlines flight status" will have the same
relevancy score with another one with
two phrases "japan airlines", "2012 flight status" far away. 

Is this the intended behavior of edismax, or am I missing anything ?

Thanks a lot for the help !

David

--
View this message in context: 
http://lucene.472066.n3.nabble.com/edismax-ignores-the-quoted-sub-phrase-query-tp3601838p3601838.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to