Hi,
in general we are open to include external stemmers into the Lucene code
base (maybe only the adaptor), but the external depenencies should be
kept minimal (not including a ton of utility libraries like Apache
commons-foobar added as transitive dependencies). As far as I
understand, the stemmer classes are not "ready-to use", so to actually
include the filter into your analysis chain you need to add those data
files possible created with the Radixor framework, correct? If we need
to shit with many data files for all those languages, we may need to
think about a way to regenerate them in our code and ship them to have
ready-to-use stemmers for supported languages.
Small comment about the Lucene code: It has a services file for the
factory, but no module-info.java class file yet. It would be great to
add a module descriptor - also for the Radixor framework, too, because
Lucene is module conformant. This will be mandatory when it needs to be
included into Lucene.
Uwe
Am 19.04.2026 um 23:30 schrieb Leo Galambos:
Hello Lucene developers,
I would like to share a new external project that may be relevant to
the Lucene analysis community: Radixor-Lucene.
Radixor-Lucene is an adapter for using the Radixor stemmer in Apache
Lucene 10. Radixor builds on ideas from the historical Egothor
stemming line, with a focused implementation aimed at compact compiled
runtime data, deterministic lookup, and practical extension with
domain-specific vocabulary.
It may be relevant in cases where one would otherwise consider
Stempel-style stemming or Porter-family stemmers. It is a stemming
toolkit, not a full morphological analyzer.
Project links:
*
Radixor: https://leogalambos.github.io/Radixor/
*
Radixor-Lucene: https://github.com/leogalambos/Radixor-Lucene
I would be interested in feedback on whether this is best kept purely
as an external adapter, or whether the Lucene community would see
value in it as part of a broader modernization path for the
Stempel-style stemming space.
Best regards,
Leo Galambos
--
Uwe Schindler
Achterdiek 19, D-28357 Bremen
https://www.thetaphi.de
eMail:[email protected]