Hello Everyone,

 

stemming german words is not very accurate with the stemmers in Lucene. This
is because the german language is complicated and so many special cases cant
be put into a algorithm. So I have decided to add a lookup map to a stemmer,
that contains stems for the most frequent german words, where the stemmer
does not work correct. I have searched the web and could not find a word
list for german stems (or you have to pay for it EUR 10.000 or more). I have
found a nice list with english stems
(http://www.comp.lancs.ac.uk/computing/research/stemming/Links/resources.htm
), but not for german. Does anybody know where I can find a list like this
or do I have to do all the work on my own???

 

Greetings,

Jan Philipp Seng

Reply via email to