Hello everyone,

I'm quite new in using solr but I managed to create some indexes and I wrote 
some nice working PHP classes for adding, inserting, querying and deleting 
records. But when I perform a search there have been some crazy phenomenons. If 
a word is searched exactly in the right spelling, it was found (okay, I didn't 
except anything else). But when I searched the same correct spelled word in the 
fuzzy way (by adding ~0.5), it wasn't found any more. Hello?!

It costs some hours of today to find out the problem, it's the german snowball 
porter filter (both German and German2). When using the 
EnglishPorterFilterFactory everything is working well (but it isn't very 
commendable if nearly all contents are german I think) and if no porter filter 
is used the result is as expected, too.

Therefore my question to you: Does everyone know about this problem and is 
there a solution? My schema.xml looks as follows:
(...)
<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
      <analyzer type="index">
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
        <!-- in this example, we will only use synonyms at query time
        <filter class="solr.SynonymFilterFactory" synonyms="index_synonyms.txt" 
ignoreCase="true" expand="false"/>
        -->
        <filter class="solr.StopFilterFactory" ignoreCase="true" 
words="stopwords.txt"/>
        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" 
generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
      </analyzer>
      <analyzer type="query">
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" 
ignoreCase="true" expand="true"/>
        <filter class="solr.StopFilterFactory" ignoreCase="true" 
words="stopwords.txt"/>
        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" 
generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
      </analyzer>
    </fieldType>
(...)

I'm pretty happy now when I leave off work soon, but maybe there is an error on 
mine? Thank for any answer and have a nice evening now!

Best regards from Karlsruhe, Germany
Christian

Reply via email to