The training data is one sentence per line. That's how you feed data to the sentence detector.
-----Original Message----- From: [email protected] [mailto:[email protected]] On Behalf Of Tim Miller Sent: Monday, August 26, 2013 11:12 AM To: [email protected] Subject: Re: apostrophe and sentence detector On 08/26/2013 12:05 PM, Masanz, James J. wrote: > The recently rebuilt sentence detector (currently in trunk and the 3.1.0 > branch) is sometimes taking the apostrophe as a sentence break where the > ctakes-3.0.0-incubating model didn't. > > The training data used for the recently rebuilt model only contains only 7 > lines that end with an apostrophe (single quote) Do you mean 7 sentences that end in a single apostrophe or 7 lines? The sentence detector will currently break on newlines no matter what, so the important number is how many sentences end mid-line with an apostrophe, right? Tim
