We used that syntax in 1.4.1 when Surround was not part of SOLR and has to
register it. Didn't know that it is now part of SOLR. Any ways this is a
red herring since I have totally removed Surround and the issue remains
there.

Below is the debug info when I give a simple phrase query having common
words with default Query Parser. What I don't understand is that why is it
including single tokens as well? I have also included the relevant config
part below.

"rawquerystring": "Contents:\"only be\"",
"querystring": "Contents:\"only be\"",
"parsedquery": "MultiPhraseQuery(Contents:\"(only only_be) be\")",
"parsedquery_toString": "Contents:\"(only only_be) be\"",

"QParser": "LuceneQParser",

=====

<fieldtype name="text" class="solr.TextField">
<analyzer>
 <tokenizer class="solr.StandardTokenizerFactory"/>
 <filter class="solr.StandardFilterFactory"/>
 <filter class="solr.LowerCaseFilterFactory"/>
 <filter class="solr.CommonGramsFilterFactory" words="commonwords.txt"
ignoreCase="true"/>
</analyzer>
</fieldtype>



On Mon, Dec 9, 2013 at 7:46 PM, Erik Hatcher <erik.hatc...@gmail.com> wrote:

> But again, as Ahmet mentioned… it doesn't look like the surround query
> parser is actually being used.   The debug output also mentioned the query
> parser used, but that part wasn't provided below.  One thing to note here,
> the surround query parser is not available in 1.4.1.   It also looks like
> you're surrounding your query with angle brackets, as it says query string
> is {!surround}<Contents:"only be">, which is not correct syntax.  And one
> of the most important things to note here is that the surround query parser
> does NOT use the analysis chain of the field, see <
> http://wiki.apache.org/solr/SurroundQueryParser#Limitations>.  In short,
> you're going to have to do some work to get common grams factored into a
> surround query (such as maybe calling to the analysis request hander to
> "parse" the query before sending it to the surround query parser).
>
>         Erik
>
>
> On Dec 9, 2013, at 9:36 AM, Salman Akram <
> salman.ak...@northbaysolutions.net> wrote:
>
> > Yup on debugging I found that its coming in Analyzer. We are using
> Standard
> > Analyzer. It seems to be a SOLR 4 issue with Common Grams. Not sure if
> its
> > a bug or I am missing some config.
> >
> >
> > On Mon, Dec 9, 2013 at 2:03 PM, Ahmet Arslan <iori...@yahoo.com> wrote:
> >
> >> Hi Salman,
> >> I am confused because with surround no analysis is applied at query
> time.
> >> I suspect that surround query parser is not kicking in. You should see
> >> SrndQuery or something like at parser query section.
> >>
> >>
> >>
> >> On Monday, December 9, 2013 6:24 AM, Salman Akram <
> >> salman.ak...@northbaysolutions.net> wrote:
> >>
> >> All,
> >>
> >> I posted this sub-issue with another issue few days back but maybe it
> was
> >> not obvious so posting it on a separate thread.
> >>
> >> We recently migrated to SOLR 4.6. We use Common Grams but queries with
> >> words in the CG list have slowed down. On debugging we found that for CG
> >> words the parser is adding individual tokens of those words in the query
> >> too which ends up slowing it. Below is an example:
> >>
> >> Query = "only be"
> >>
> >> Here is what debug shows. I have highlighted the red part which is
> >> different in both versions i.e. SOLR 4.6 is making it a multiphrasequery
> >> and adding individual tokens too. Can someone help?
> >>
> >> SOLR 4.6 (takes 20 secs)
> >> <str name="rawquerystring">{!surround}<Contents:"only be"></str>
> >> <str name="querystring">{!surround}<Contents:"only be"></str>
> >> <str name="parsedquery">MultiPhraseQuery(Contents:"(only only_be)
> >> be")</str>
> >> <str name="parsedquery_toString">Contents:"(only only_be) be"</str>
> >>
> >> SOLR 1.4.1 (takes 1 sec)
> >> <str name="rawquerystring">{!surround}<Contents:"only be"></str>
> >> <str name="querystring">{!surround}<Contents:"only be"></str>
> >> <str name="parsedquery">Contents:only_be</str>
> >> <str name="parsedquery_toString">Contents:only_be</str>--
> >>
> >>
> >> Regards,
> >>
> >> Salman Akram
> >>
> >
> >
> >
> > --
> > Regards,
> >
> > Salman Akram
>
>


-- 
Regards,

Salman Akram

Reply via email to