+1

On Wed, Oct 15, 2025 at 3:35 AM Martin Wiesner <[email protected]> wrote:

> Hi all,
>
> I’m fine with Richard’s proposal.
> In that context, I’ve edited the planned release(s) in JIRA, see:
> https://issues.apache.org/jira/projects/OPENNLP/versions/12356312
>
> Cheers
> Martin
>
> > Am 15.10.2025 um 08:26 schrieb Richard Zowalla <[email protected]>:
> >
> >
> > Hi,
> > I’d like to propose releasing OpenNLP 2.5.6.1 to address a regression in
> the Sentence Detector introduced in 2.5.6.
> > When an abbreviation appears at the beginning of a sentence,
> SentenceDetectorME in OpenNLP 2.5.6 throws a
> java.lang.StringIndexOutOfBoundsException.
> > I've fixed that issue on main / opennlp-2.x
> > A practical case where this occurs is when using ICD-10 codes or other
> abbreviations at the start of a sentence (e.g. in a medical text).
> > This currently breaks sentence detection for affected users if an
> abbreviation dictionary is used.
> > I would therefore propose to release a 2.5.6.1 patch version containing
> the fix for this issue (already addressed with the overlap handling
> improvement).
> > The change is small, localized, and it is IMHO important to restore
> sentence detection with abbreviation support for users.
> > Unless there are objections, I would like to prepare a RC soon so users
> can upgrade safely without waiting for 2.5.7.
> > WDYT?
> > Gruß
> > Richard
>
>

Reply via email to