Richard Zowalla created OPENNLP-1523:
----------------------------------------

             Summary: Use the snowball-data set to write language-specific 
stemmer eval tests
                 Key: OPENNLP-1523
                 URL: https://issues.apache.org/jira/browse/OPENNLP-1523
             Project: OpenNLP
          Issue Type: Improvement
          Components: Stemmer
    Affects Versions: 2.3.1
            Reporter: Richard Zowalla


Investigate on the possibility to re-use  
https://github.com/snowballstem/snowball-data/tree/master in our eval data to 
run it against our stemmers to see how good they behave for certain languages.

It contains of two files "vocab" (to be stemmed) and "output" (expected)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to