Re: Folding of accented to non-accented *only* — leaving symbols

2014-10-13 Thread Alexandre Rafalovitch
You are probably looking for ICU Folding which is part of ICU plugin: https://github.com/elasticsearch/elasticsearch-analysis-icu . It's not explained in details on that page, but you can see a long list of normalizations from the Lucene's Javadoc: http://www.solr-start.com/javadoc/solr-lucene/org/

Folding of accented to non-accented *only* — leaving symbols

2014-10-13 Thread Lee Gee
I now the asciifolding filter docs are really very clear on this, but it took me an embarrassingly long time to realise I was losing my currency symbol (£) to the ASCII folding filter. Other than creating my own character map with the char map filter, does there exist something of production qu