-------- Original Message --------
Subject:        Re: Asian Sentence Detector Models
Date:   Thu, 22 Mar 2012 20:12:09 +0900
From:   wl.gao.tkl@gmail <[email protected]>
To:     Jörn Kottmann <[email protected]>



On 03/22/2012 06:50 PM, Jörn Kottmann wrote:
On 03/22/2012 10:01 AM, wl.gao.tkl@gmail wrote:
You can. Almost every time we use this symbol to signal the end of a sentence. However, sometimes, it can be missing, especially in a dialogue or chatroom.

The OpenNLP Sentence Detector actual does sentence boundary disambiguation, in English or European languages usually these: !, ?,. are used to indicate a sentence boundary, but often they are used for other things as well, e.g. in abbreviations.

If the sentence boundary is missing it will never split there.

Jörn
Sorry, I have to add that if this mark is missing, then it might be obvious there are no need to separate this text into sentences. (only one sentence)
? - are used in Chinese and Japanese as question mark.
! - emphasise/strongly opinionated
. - not used in C&J
, - for segmentation of one sentence. just like English.

Thank you Jörn ,for the reminding.

Gao

Reply via email to