In Unicode, uppercasing characters loses information, because there are some upper case characters that represent more than one lower case character.
Lower casing text is safe, so always lower-case. wunder On May 18, 2012, at 10:41 AM, srinir wrote: > I am wondering why solr doesnt have an uppercase filter. I want the analyzed > output to be in upper case to be compatible with legacy data. Will there be > any problem if i create my own uppercase filter and use it ? >