ASCIIFoldingFilter

Julian Motz Thu, 17 Nov 2016 08:52:57 -0800

Hello together,

We're currently discussing<https://github.com/diacritics/database/issues/1> about the usage of theASCIIFoldingFilter<https://lucene.apache.org/core/3_6_2/api/core/org/apache/lucene/analysis/ASCIIFoldingFilter.html>class in our diacritics project. This project will be about mappingdiacritics (e.g. "ü") to their associated ASCII characters (e.g. "u" and"ue" in this example).

The ASCIIFoldingFilter class would help us a lot and therefore I hopeyou can answer the following questions we have:


1. The ASCIIFoldingFilter class only maps diacritics to their ASCII
   base characters, e.g. "ü" => "u", not to their ASCII characters in
   the associated language, e.g. "ue" in this case. Why have you
   excluded these mappings in the class?
2. Does anyone of you know if German and Norwegian are the only
   languages that have such language specific mappings (e.g. "ü" =>
   "ue" instead of "ü" => "u")?
3. Can we use the data in your class (Apache license) in our project
   (MIT license) with naming the copyright inside the file in our
   repository, but not in the end product that users will have when
   using our project? The project would use the data in the build to
   generate a JSON file.

Thanks in advance.

Cheers,
Julian

ASCIIFoldingFilter

Reply via email to