[jira] [Resolved] (OPENNLP-1745) SentenceDetector - Add documentation and tests for useTokenEnd = false

Martin Wiesner (Jira) Tue, 24 Jun 2025 23:05:09 -0700


     [ 
https://issues.apache.org/jira/browse/OPENNLP-1745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Martin Wiesner resolved OPENNLP-1745.
-------------------------------------
    Resolution: Fixed

> SentenceDetector - Add documentation and tests for useTokenEnd = false
> ----------------------------------------------------------------------
>
>                 Key: OPENNLP-1745
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1745
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Sentence Detector
>    Affects Versions: 2.5.4
>            Reporter: Nishant Shrivastava
>            Assignee: Nishant Shrivastava
>            Priority: Trivial
>             Fix For: 3.0.0
>
>          Time Spent: 40m
>  Remaining Estimate: 0h
>
> SentenceDetector can be configured with useTokenEnd = false.
> This allows appropriate identification of the first character of the next 
> sentence for cases where the new sentence doesn't start with a blank space 
> (post the end of the previous sentence).
> e.g. German Text - 
> *Input :*
> "Träume sind eine Verbindung von Gedanken.Verschiedene Gedanken sind während 
> der Traumformation aktiv."
> *Actual Output from SentenceDetectorTool (setting useTokenEnd = false):*
>   Sentence-1 : Träume sind eine Verbindung von Gedanken.
>   Sentence-2 : Verschiedene Gedanken sind während der Traumformation aktiv.
>  
> It will be useful to add tests and documentation for the correct usage of 
> 'useTokenEnd' property.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Resolved] (OPENNLP-1745) SentenceDetector - Add documentation and tests for useTokenEnd = false

Reply via email to