Re: [Jprogramming] Separating sentences

2018-04-04 Thread Skip Cave
Raul, Your separatedirty and clean verbs work perfectly on a test set of my data. Now I need to make sure that there are no other weird aberrations when I run the full data set. Thanks for the help! Skip Skip Cave Cave Consulting LLC On Wed, Apr 4, 2018 at 2:20 PM, Raul Miller wrote: > Hmm.

Re: [Jprogramming] Separating sentences

2018-04-04 Thread Raul Miller
Hmm.. The code I had posted at http://jsoftware.com/pipermail/programming/2018-April/050900.html almost did what you wanted: separatedirty ex4 ┌┬┬──┬─┐ │This is Skip's test.│Testing one, two, three.│Count 3, 2, 1.│

Re: [Jprogramming] Separating sentences

2018-04-04 Thread Skip Cave
​​All, Luckily, the text I am working with is somewhat more well-behaved than Finnegans Wake or SOLAR BONES. However, there is one more twist in the text that I need to address - broken sentences: ex4=. ('This is Skip''s test. Testing one, two, three. Count 3, 2, 1. This is a');('broken sentence

Re: [Jprogramming] Separating sentences

2018-04-04 Thread Tom Arneson
SOLAR BONES by Mike McCormack has no periods "." -Original Message- From: Programming On Behalf Of Jose Mario Quintana Sent: Wednesday, April 4, 2018 12:09 To: Programming forum Subject: Re: [Jprogramming] Separating sentences > text of Finnegan's Wake. The actua

Re: [Jprogramming] Separating sentences

2018-04-04 Thread Jose Mario Quintana
> text of Finnegan's Wake. The actual name of the book by James Joyce is "Finnegans Wake" and one could argue that it is not written in English ;) On Wed, Apr 4, 2018 at 12:30 PM, Raul Miller wrote: > There are some unmentioned issues that may trip you up eventually with > this approach, for

Re: [Jprogramming] Separating sentences

2018-04-04 Thread Raul Miller
There are some unmentioned issues that may trip you up eventually with this approach, for example, if you try to apply these routines to the text of Finnegan's Wake. To hint at those issues, here's an approach that takes you directly to the final result: ex1=: <'This is Skip''s test. Testing o

[Jprogramming] Separating sentences

2018-04-04 Thread Skip Cave
I have the following boxed data: ex1=. <'This is Skip''s test. Testing one, two, three. Count 3, 2, 1.' ​ ​ ex1 ┌┐ │This is Skip's test. Testing one, two, three. Count 3, 2, 1.│ └───