Did you reindex after you incorporated the ISOLatin... filter?

On Tue, Mar 17, 2009 at 8:40 AM, dabboo <ag...@sapient.com> wrote:

>
> This is the entry in schema.xml
>
>    <fieldType name="text" class="solr.TextField" positionIncrementGap="100"
> omitNorms="true">
>      <analyzer type="index">
>        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>        <!--tokenizer class="solr.HTMLStripWhitespaceTokenizerFactory" /-->
>        <!-- in this example, we will only use synonyms at query time
>        <filter class="solr.SynonymFilterFactory"
> synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/>
>        -->
>        <!-- Case insensitive stop word removal.
>             enablePositionIncrements=true ensures that a 'gap' is left to
>             allow for accurate phrase queries.
>        -->
>        <filter class="solr.StopFilterFactory"
>                ignoreCase="true"
>                words="stopwords.txt"
>                enablePositionIncrements="true"
>                />
>                <filter class="solr.ISOLatin1AccentFilterFactory"/>
>        <filter class="solr.WordDelimiterFilterFactory"
> generateWordParts="1" generateNumberParts="1" catenateWords="1"
> catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"/>
>        <filter class="solr.LowerCaseFilterFactory"/>
>        <filter class="solr.EnglishPorterFilterFactory"
> protected="protwords.txt"/>
>        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
>        <analyzer class="org.apache.lucene.analysis.ru.RussianAnalyzer"/>
>
>      </analyzer>
>      <analyzer type="query">
>        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>         <filter class="solr.ISOLatin1AccentFilterFactory"/>
>        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
> ignoreCase="true" expand="true"/>
>        <filter class="solr.StopFilterFactory" ignoreCase="true"
> words="stopwords.txt"/>
>        <filter class="solr.WordDelimiterFilterFactory"
> generateWordParts="1" generateNumberParts="1" catenateWords="0"
> catenateNumbers="0" catenateAll="0" splitOnCaseChange="1"/>
>        <filter class="solr.LowerCaseFilterFactory"/>
>        <filter class="solr.EnglishPorterFilterFactory"
> protected="protwords.txt"/>
>        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
>        <!--analyzer
> class="org.apache.lucene.analysis.ru.RussianAnalyzer"/-->
>         <filter class="solr.ShingleFilterFactory" outputUnigrams="true"
> outputUnigramIfNoNgram="true" maxShingleSize="99"/>
>
>
>      </analyzer>
>    </fieldType>
>
>
>
> dabboo wrote:
> >
> > I have added this filter factory in my schema.xml also but still that is
> > not working. I am sorry but I didnt get as how to create the field to
> > handle the accents.
> >
> > Please help.
> >
> >
> > Grant Ingersoll-6 wrote:
> >>
> >> You will need to create a field that handles the accents in order to
> >> do this.  Start by looking at the ISOLatin1AccentFilter.
> >>
> >> -Grant
> >>
> >> On Mar 17, 2009, at 7:31 AM, dabboo wrote:
> >>
> >>>
> >>> Hi,
> >>>
> >>> I am searching with any query string, which contains special
> >>> characters like
> >>> è in it. for e.g. If I search for tèst then it shud return all the
> >>> results
> >>> which contains tèst and test etc. There are other special characters
> >>> also.
> >>>
> >>> I have updated my server.xml file of tomcat server and included
> >>> UTF-8 as
> >>> encoding type in the server entry but still it is not working.
> >>>
> >>> Please suggest.
> >>>
> >>> Thanks,
> >>> Amit Garg
> >>> --
> >>> View this message in context:
> >>>
> http://www.nabble.com/Special-Characters-search-in-solr-tp22557230p22557230.html
> >>> Sent from the Solr - User mailing list archive at Nabble.com.
> >>>
> >>
> >>
> >>
> >
> >
>
> --
> View this message in context:
> http://www.nabble.com/Special-Characters-search-in-solr-tp22557230p22558353.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>
>

Reply via email to