Re: WordDelimiterFilterFactory + CamelCase query

Peter Karich Fri, 19 Nov 2010 04:26:29 -0800

 Hi,

the final solution is explained here in context:
http://mail-archives.apache.org/mod_mbox/lucene-dev/201011.mbox/%3caanlktimatgvplph_mgfbsughdoedc8tc2brrwxhid...@mail.gmail.com%3e


"

/If you are using Solr branch_3x or trunk, you can turn this off, by
setting autoGeneratePhraseQueries to false in the fieldType.
<fieldType name="text" class="solr.TextField"
positionIncrementGap="100" autoGeneratePhraseQueries="false">
By enabling this option, phrase queries are only created by the
queryparser when you enclose stuff in double quotes.

If you are using an older version of solr such as 1.4.x, then you can
only hack it, by adding a PositionFilterFactory to the end of your
query analyzer.
The downside to that approach (unfortunately the only approach, for
older versions) is that it completely disables phrasequeries across
the board for that field type./

"
So, it is not a bug of wdf.
Thanks to Robert!

Regards,
Peter.

 Hi,
I am going crazy but which config is necessary to include the missingdoc 2?
I have:
doc1 tw:aBc
doc2 tw:abc
Now a query "aBc" returns only doc 1 although when I try doc2 fromadmin/analysis.jsp
then the term text 'abc' of the index gets highlighted as intended.
I even indexed a simple example (no stopwords, no protwords, nosynonyms) via* andtried this with the normal and dismax handler but I cannot make itworking :-/
What have I misunderstood?

Regards,
Peter.


<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
<analyzer type="index">
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.WordDelimiterFilterFactory" protected="protwords.txt"
generateWordParts="1" generateNumberParts="1"catenateAll="0" preserveOriginal="1"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.SnowballPorterFilterFactory" language="English"protected="protwords.txt"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.WhitespaceTokenizerFactory"/>
<filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"ignoreCase="true" expand="true"/><filter class="solr.StopFilterFactory" ignoreCase="true"words="stopwords.txt" enablePositionIncrements="true" />
<filter class="solr.WordDelimiterFilterFactory" protected="protwords.txt"
generateWordParts="1" generateNumberParts="1"catenateAll="0" preserveOriginal="1"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.SnowballPorterFilterFactory" language="English"protected="protwords.txt"/>
</analyzer>
</fieldType>
--
<field name="tw" type="text" indexed="true" stored="true"/>

*
books.csv:

id,tw
1,aBc
2,abc
curl http://localhost:8983/solr/update/csv?commit=true --data-binary@books.csv -H 'Content-type:text/plain; charset=utf-8'

Re: WordDelimiterFilterFactory + CamelCase query

Reply via email to