Hi,

I have ported your Stempel stemmer [1] for Polish language from Java
to Python [2]. I know you have also Python wrapper for Lucene
(pyLucene) so I was curious if you would be interested in the native
implementation of a single stemmer?

It has same accuracy as the original version and only slightly better
performance comparing to the wrapped version (compared with pyjini)
but uses only one language (no need to switch between languages when
debugging) which was quite important in my NLP project. I understand
that it introduces the need to maintain two code bases, though.

Regards,
Maciej Gawinecki



[1]: 
https://github.com/apache/lucene-solr/tree/master/lucene/analysis/stempel/src/java/org
[2]:https://github.com/dzieciou/pystempel/tree/feature/1

Reply via email to