Re: struggling with solr.WordDelimiterFilterFactory and periods "." or dots

geeky2 Fri, 10 Feb 2012 11:38:58 -0800

hello,

>>
Or does your field in schema.xml have anything like
autoGeneratePhraseQueries="true" in it?
<<


there is no reference to this in our production schema.

this is extremely confusing.

i am not completely clear on the issue?

reviewing our previous messages - it looks like the data is being tokenized
correctly according to the analysis page and output from Luke.

it also looks like the definition of the field and field type is correct in
the schema.xml

it also looks like there is no errant data (quotes) being introduced in to
the query string submitted to solr:

example:

*http://hfsthssolr1.intra.searshc.com:8180/solrpartscat/core1/select?indent=on&version=2.2&q=itemNo%3ABP21UAA&fq=&start=0&rows=10&fl=*%2Cscore&qt=&wt=&debugQuery=on&explainOther=&hl.fl=*

*so - does the real issue reside in HOW the query is being contructed /
parsed ???

and if so - what drives this query to become a MultiPhraseQuery with
embedded quotes ????
*

<lst name="debug"><str name="rawquerystring">itemNo:BP21UAA
</str><str name="querystring">itemNo:BP21UAA
</str><str name="parsedquery">MultiPhraseQuery(itemNo:"bp 21 (uaa
bp21uaa)")</str><str name="parsedquery_toString">itemNo:"bp 21 (uaa
bp21uaa)"</str>

please note - i also mocked up a simple test on my personal linux box - just
using the solr 3.5 distro (we are using 3.3.0 on our production box under
centOS)

i was able to get a simple test to work and yes - my query does look
different

output from my simple mock up on my personal box:

*http://localhost:8983/solr/select?indent=on&version=2.2&q=manu%3ABP21UAA&fq=&start=0&rows=10&fl=*%2Cscore&qt=&wt=&debugQuery=on&explainOther=&hl.fl=*

<lst name="debug"><str name="rawquerystring">manu:BP21UAA</str><str
name="querystring">manu:BP21UAA</str><str name="parsedquery">manu:bp manu:21
manu:uaa manu:bp21uaa</str><str name="parsedquery_toString">manu:bp manu:21
manu:uaa manu:bp21uaa</str><lst name="explain">

schema.xml

<fieldType name="text_en_splitting" class="solr.TextField"
positionIncrementGap="100"><analyzer type="index"><tokenizer
class="solr.WhitespaceTokenizerFactory"/><filter
class="solr.StopFilterFactory" ignoreCase="true" words="stopwords_en.txt"
enablePositionIncrements="true"/><filter
class="solr.WordDelimiterFilterFactory" generateWordParts="1"
generateNumberParts="1" catenateWords="1" catenateNumbers="1"
catenateAll="1" splitOnCaseChange="1"/><filter
class="solr.LowerCaseFilterFactory"/><filter
class="solr.KeywordMarkerFilterFactory" protected="protwords.txt"/><filter
class="solr.PorterStemFilterFactory"/></analyzer><analyzer
type="query"><tokenizer class="solr.WhitespaceTokenizerFactory"/><filter
class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true"
expand="true"/><filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords_en.txt" enablePositionIncrements="true"/><filter
class="solr.WordDelimiterFilterFactory" generateWordParts="1"
generateNumberParts="1" catenateWords="0" catenateNumbers="0"
catenateAll="1" splitOnCaseChange="1"/><filter
class="solr.LowerCaseFilterFactory"/><filter
class="solr.KeywordMarkerFilterFactory" protected="protwords.txt"/><filter
class="solr.PorterStemFilterFactory"/></analyzer></fieldType>

<field name="manu" type="text_en_splitting" indexed="true" stored="true"
omitNorms="true"/>

any suggestions would be greatly appreciated.

mark




--
View this message in context: 
http://lucene.472066.n3.nabble.com/struggling-with-solr-WordDelimiterFilterFactory-and-periods-or-dots-tp3724822p3733486.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: struggling with solr.WordDelimiterFilterFactory and periods "." or dots

Reply via email to