Hi,

  I recently upgraded to 4.7, with the aim of replacing group queries with
CollapsingQParserPlugin. As I'm comparing results between the two APIs,
CollapsingQParserPlugin seems to be way off, in terms of relevancy and
result count. Here's an example :

*Group query*
http://test-dev.mydomain.com/solr/adskhelpportal?fq=language:(english)&wt=xml&rows=40&start=0&fq=(ContentGroup-local:"Learn
& Explore" OR ContentGroup-local:"Getting Started" OR
ContentGroup-local:"Troubleshooting")&fq=Product:"PRD"&q=line&sort=score
desc&group=true&group.field=ADSKDedup&group.ngroups=true&fl=title,ADSKDedup,score&debugQuery=true

/Top 4 results/
<lst name="grouped">
      <lst name="ADSKDedup">
         <int name="matches">14593</int>
         <int name="ngroups">*13648*</int>
         <arr name="groups">
            <lst>
               <str name="groupValue">fbfef4647e68c2300eba99028f2598a9</str>
               <result name="doclist" numFound="1" start="0">
                  <doc>
                     <str
name="ADSKDedup">fbfef4647e68c2300eba99028f2598a9</str>
                     <arr name="title">
                        <str>LINE</str>
                     </arr>
                     <float name="score">8.517085</float>
                  </doc>
               </result>
            </lst>
            <lst>
               <str
name="groupValue">GUID-E8C1190C-A26C-484C-ADDD-DDF81666F69F</str>
               <result name="doclist" numFound="3" start="0">
                  <doc>
                     <arr name="title">
                        <str>LINE (Command)</str>
                     </arr>
                     <str
name="ADSKDedup">GUID-E8C1190C-A26C-484C-ADDD-DDF81666F69F</str>
                  </doc>
               </result>
            </lst>
            <lst>
               <str
name="groupValue">GUID-695722CD-A131-48DB-9AB8-162F0832FE04</str>
               <result name="doclist" numFound="4" start="0">
                  <doc>
                     <str
name="ADSKDedup">GUID-695722CD-A131-48DB-9AB8-162F0832FE04</str>
                     <arr name="title">
                        <str>About Controlling Extension Lines</str>
                     </arr>
                     <float name="score">5.1433907</float>
                  </doc>
               </result>
            </lst>
            <lst>
               <str
name="groupValue">GUID-9084DAC2-D5B7-4727-A443-205007A79440</str>
               <result name="doclist" numFound="4" start="0">
                  <doc>
                     <arr name="title">
                        <str>About Controlling Dimension Lines</str>
                     </arr>
                     <str
name="ADSKDedup">GUID-9084DAC2-D5B7-4727-A443-205007A79440</str>
                     <float name="score">5.1361656</float>
                  </doc>
               </result>
            </lst>


*CollapsingQParserPlugin query*

http://test-dev.mydomain.com/solr/adskhelpportal?fq=language:(english)&wt=xml&rows=15&start=0&fq=(ContentGroup-local:"Learn
& Explore" OR ContentGroup-local:"Getting Started" OR
ContentGroup-local:"Troubleshooting")&fq=ProductLine:"PRD"&q=line&sort=score
desc&fq={!collapse field=ADSKDedup}&fl=title,ADSKDedup,score&debugQuery=true

/Top 4 results/
 <result name="response" numFound="27142" start="0" maxScore="8.517085">
      <doc>
         <str name="ADSKDedup">fbfef4647e68c2300eba99028f2598a9</str>
         <arr name="title">
            <str>LINE</str>
         </arr>
         <float name="score">8.517085</float>
      </doc>
      <doc>
         <str
name="ADSKDedup">GUID-57CDDB6C-B12B-46CE-B9C5-22EFC17258FF</str>
         <arr name="title">
            <str>To Draw Lines</str>
         </arr>
         <float name="score">6.276938</float>
      </doc>
      <doc>
         <arr name="title">
            <str>Draw Lines</str>
         </arr>
         <str name="ADSKDedup">98b4a0e39400f0a216ff51a89922ce82</str>
         <float name="score">6.224089</float>
      </doc>
      <doc>
         <str name="ADSKDedup">4e51abdc0e8d30e77069505d93c1d4d4</str>
         <arr name="title">
            <str>Lines Tab</str>
         </arr>
         <float name="score">6.210026</float>
      </doc>

As you can see, the results are completely off, except for the first one.
Moreover, the number of results returned are different as well. Group query
has 13648 results which CollapsingQParserPlugin returns 27142, almost twice
the size.

I'm little baffled why the two APIs are returning different results for the
same query. Are they fundamentally different ?

Any pointers will be appreciated.

-Thanks,
Shamik



--
View this message in context: 
http://lucene.472066.n3.nabble.com/CollapsingQParserPlugin-returning-different-result-set-tp4123716.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to