Did you reindex after you incorporated the ISOLatin... filter? On Tue, Mar 17, 2009 at 8:40 AM, dabboo <ag...@sapient.com> wrote:
> > This is the entry in schema.xml > > <fieldType name="text" class="solr.TextField" positionIncrementGap="100" > omitNorms="true"> > <analyzer type="index"> > <tokenizer class="solr.WhitespaceTokenizerFactory"/> > <!--tokenizer class="solr.HTMLStripWhitespaceTokenizerFactory" /--> > <!-- in this example, we will only use synonyms at query time > <filter class="solr.SynonymFilterFactory" > synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/> > --> > <!-- Case insensitive stop word removal. > enablePositionIncrements=true ensures that a 'gap' is left to > allow for accurate phrase queries. > --> > <filter class="solr.StopFilterFactory" > ignoreCase="true" > words="stopwords.txt" > enablePositionIncrements="true" > /> > <filter class="solr.ISOLatin1AccentFilterFactory"/> > <filter class="solr.WordDelimiterFilterFactory" > generateWordParts="1" generateNumberParts="1" catenateWords="1" > catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"/> > <filter class="solr.LowerCaseFilterFactory"/> > <filter class="solr.EnglishPorterFilterFactory" > protected="protwords.txt"/> > <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> > <analyzer class="org.apache.lucene.analysis.ru.RussianAnalyzer"/> > > </analyzer> > <analyzer type="query"> > <tokenizer class="solr.WhitespaceTokenizerFactory"/> > <filter class="solr.ISOLatin1AccentFilterFactory"/> > <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" > ignoreCase="true" expand="true"/> > <filter class="solr.StopFilterFactory" ignoreCase="true" > words="stopwords.txt"/> > <filter class="solr.WordDelimiterFilterFactory" > generateWordParts="1" generateNumberParts="1" catenateWords="0" > catenateNumbers="0" catenateAll="0" splitOnCaseChange="1"/> > <filter class="solr.LowerCaseFilterFactory"/> > <filter class="solr.EnglishPorterFilterFactory" > protected="protwords.txt"/> > <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> > <!--analyzer > class="org.apache.lucene.analysis.ru.RussianAnalyzer"/--> > <filter class="solr.ShingleFilterFactory" outputUnigrams="true" > outputUnigramIfNoNgram="true" maxShingleSize="99"/> > > > </analyzer> > </fieldType> > > > > dabboo wrote: > > > > I have added this filter factory in my schema.xml also but still that is > > not working. I am sorry but I didnt get as how to create the field to > > handle the accents. > > > > Please help. > > > > > > Grant Ingersoll-6 wrote: > >> > >> You will need to create a field that handles the accents in order to > >> do this. Start by looking at the ISOLatin1AccentFilter. > >> > >> -Grant > >> > >> On Mar 17, 2009, at 7:31 AM, dabboo wrote: > >> > >>> > >>> Hi, > >>> > >>> I am searching with any query string, which contains special > >>> characters like > >>> è in it. for e.g. If I search for tèst then it shud return all the > >>> results > >>> which contains tèst and test etc. There are other special characters > >>> also. > >>> > >>> I have updated my server.xml file of tomcat server and included > >>> UTF-8 as > >>> encoding type in the server entry but still it is not working. > >>> > >>> Please suggest. > >>> > >>> Thanks, > >>> Amit Garg > >>> -- > >>> View this message in context: > >>> > http://www.nabble.com/Special-Characters-search-in-solr-tp22557230p22557230.html > >>> Sent from the Solr - User mailing list archive at Nabble.com. > >>> > >> > >> > >> > > > > > > -- > View this message in context: > http://www.nabble.com/Special-Characters-search-in-solr-tp22557230p22558353.html > Sent from the Solr - User mailing list archive at Nabble.com. > >