Re: Help with StopFilterFactory

Jack Krupansky Mon, 25 Aug 2014 05:59:34 -0700

Interesting. First, an apology for an error in my e-book - it says that theenablePositionIncrements parameter for the stop filter defaults to "false",but it actually defaults to "true". The question mark represents a "positionincrement". In your case you don't want position increments, so add theenablePositionIncrements="false" parameter to the stop filter, and be sureto reindex your data. The position increment leaves a "hole" where each stopword was removed. The question mark represents the hole. All bets are off asto what phrase query does when the phrase starts with a hole. I think thebasic idea is that there must be some term in the index at that positionthat can be "skipped".

This is actually a change in behavior, which occurred as a side effect ofLUCENE-4963 in 4.4. The default for enablePositionIncrements was false, butthat release changed it to true.

I suspect that I wrote that section of my e-book before 4.4 came out.Unfortunately, the change is not well documented - nothing in the Javadoc,and this is another example of where an underlying change in Lucene thatimpacts Solr users is not well highlighted for Solr users. Sorry about that.

In any case, try adding enablePositionIncrements="false", reindex, and seewhat happens.


-- Jack Krupansky

-----Original Message-----From: heaven

Sent: Monday, August 25, 2014 3:37 AM
To: solr-user@lucene.apache.org
Subject: Re: Help with StopFilterFactory

A valid search:
http://pastie.org/pastes/9500661/text?key=rgqj5ivlgsbk1jxsudx9za
An Invalid search:
http://pastie.org/pastes/9500662/text?key=b4zlh2oaxtikd8jvo5xaww

What weird I found is that the valid query has:
"parsedquery_toString": "+(url_words_ngram:\"twitter com zer0sleep\")"
And the invalid one has:
"parsedquery_toString": "+(url_words_ngram:\"? twitter com zer0sleep\")"

So "https" part was replaced with a "?".

--

View this message in context:http://lucene.472066.n3.nabble.com/Help-with-StopFilterFactory-tp4153839p4154957.htmlSent from the Solr - User mailing list archive at Nabble.com.

Re: Help with StopFilterFactory

Reply via email to