+1 On Wed, Oct 15, 2025 at 3:35 AM Martin Wiesner <[email protected]> wrote:
> Hi all, > > I’m fine with Richard’s proposal. > In that context, I’ve edited the planned release(s) in JIRA, see: > https://issues.apache.org/jira/projects/OPENNLP/versions/12356312 > > Cheers > Martin > > > Am 15.10.2025 um 08:26 schrieb Richard Zowalla <[email protected]>: > > > > > > Hi, > > I’d like to propose releasing OpenNLP 2.5.6.1 to address a regression in > the Sentence Detector introduced in 2.5.6. > > When an abbreviation appears at the beginning of a sentence, > SentenceDetectorME in OpenNLP 2.5.6 throws a > java.lang.StringIndexOutOfBoundsException. > > I've fixed that issue on main / opennlp-2.x > > A practical case where this occurs is when using ICD-10 codes or other > abbreviations at the start of a sentence (e.g. in a medical text). > > This currently breaks sentence detection for affected users if an > abbreviation dictionary is used. > > I would therefore propose to release a 2.5.6.1 patch version containing > the fix for this issue (already addressed with the overlap handling > improvement). > > The change is small, localized, and it is IMHO important to restore > sentence detection with abbreviation support for users. > > Unless there are objections, I would like to prepare a RC soon so users > can upgrade safely without waiting for 2.5.7. > > WDYT? > > Gruß > > Richard > >
