The Apache OpenNLP team is pleased to announce the release of version 2.1.0 of Apache OpenNLP. The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, and parsing.
The OpenNLP 2.1.0 binary and source distributions are available for download from our download page: https://opennlp.apache.org/download.html The OpenNLP library is distributed by Maven Central as well. See the Maven Dependency page for more details: http://opennlp.apache.org/maven-dependency.html Changes in this version: - Update language codes in documentation - Enable optional GPU inference in ONNX Runtime configuration - Allow for unlimited text length in document classification with ONNX Runtime - Fix alphaNumOpt in tokenizer example - Training of MaxEnt model with large corpora fails with java.io.UTFDataFormatException - Make parameter names in the params file be not case-sensitive - Upgrade JUnit to version 5 For a complete list of fixed bugs and improvements please see the announcement page at https://opennlp.apache.org/news/release-210.html. The Apache OpenNLP Team