New chunks

2014-03-11 Thread Benjamin Beaumont
Hi All, We're in the process of adding some new chunk types in LiveCode 7 and we would appreciate suggestions for a particular chunk name. The new chunk types are: naturalword (breaks on unicode word boundaries) sentence (breaks on unicode sentence boundaries) paragraph (Same behaviour as curren

Re: New chunks

2014-03-11 Thread Roger Eller
uword or wordup (up = unicode part) ~Roger On Tue, Mar 11, 2014 at 2:15 PM, Benjamin Beaumont wrote: > Hi All, > > We're in the process of adding some new chunk types in LiveCode 7 and we > would appreciate suggestions for a particular chunk name. > > The new chunk types are: > > naturalword (

Re: New chunks

2014-03-11 Thread dunbarx
al Message- From: Roger Eller To: How to use LiveCode Sent: Tue, Mar 11, 2014 2:41 pm Subject: Re: New chunks uword or wordup (up = unicode part) ~Roger On Tue, Mar 11, 2014 at 2:15 PM, Benjamin Beaumont wrote: > Hi All, > > We're in the process of adding some new chunk

Re: New chunks

2014-03-11 Thread Richmond
On 11/03/14 20:15, Benjamin Beaumont wrote: Hi All, We're in the process of adding some new chunk types in LiveCode 7 and we would appreciate suggestions for a particular chunk name. The new chunk types are: naturalword (breaks on unicode word boundaries) Well; in theory that looks good unti

Re: New chunks

2014-03-11 Thread Peter M. Brigham
On Mar 11, 2014, at 2:15 PM, Benjamin Beaumont wrote: > Hi All, > > We're in the process of adding some new chunk types in LiveCode 7 and we > would appreciate suggestions for a particular chunk name. > > The new chunk types are: > > naturalword (breaks on unicode word boundaries) Will this b

Re: New chunks

2014-03-11 Thread Mark Wieder
Richmond writes: > How are you going to work out what marks a sentence boundary in every > language that one can write > with Unicode? And there are languages where the idea of a 'sentence' is > absent. My initial take on this was the same as yours. But unicode separators seem to have been wel

Re: New chunks

2014-03-11 Thread Mark Wieder
Benjamin Beaumont writes: > 'naturalword' is the clearest internal suggestion at the moment and we'd > love to get the input from community members if there is an even clearer > option. My preferences, in order would be: 1. change the parser so that it accepts modifiers, and specifically so tha

Re: New chunks

2014-03-11 Thread Jim Hurley
> To: LiveCode Developer List , How to > use LiveCode > Subject: New chunks > Message-ID: > > Content-Type: text/plain; charset=ISO-8859-1 > > Hi All, > > We're in the process of adding some new chunk types in LiveCode 7 and we > would appreciate s

Re: New chunks

2014-03-11 Thread Phil Davis
What are the pros and cons of making the 'useUnicode' property persistent and true by default, and let it determine the meaning of 'word'? To my own immature understanding, that seems like a better approach that trying to come up with a new vocabulary item. Phil Davis On 3/11/14, 11:15 AM,

Re: New chunks

2014-03-11 Thread Phil Davis
Or maybe introduce a persistent 'useUnicodeChunks' property that controls which kind of chunking is used. On 3/11/14, 3:42 PM, Phil Davis wrote: What are the pros and cons of making the 'useUnicode' property persistent and true by default, and let it determine the meaning of 'word'? To my own

Re: New chunks

2014-03-11 Thread Phil Davis
Sorry, I didn't read all emails before suggesting this. :-) On 3/11/14, 3:44 PM, Phil Davis wrote: Or maybe introduce a persistent 'useUnicodeChunks' property that controls which kind of chunking is used. On 3/11/14, 3:42 PM, Phil Davis wrote: What are the pros and cons of making the 'useUnico

Re: New chunks

2014-03-11 Thread Mark Wieder
Phil Davis writes: > > What are the pros and cons of making the 'useUnicode' property > persistent and true by default, and let it determine the meaning of > 'word'? To my own immature understanding, that seems like a better > approach that trying to come up with a new vocabulary item. I lik

Re: New chunks

2014-03-11 Thread Mark Schonewille
Hi Ben, I think that the chunk names for unicode should be exactly the same as for english text, but they should act differently if either the text being operated on is unicode text or the useUnicode has been set to true. Note that other development tools don't have this problem at all. Oper

Re: New chunks

2014-03-12 Thread Richmond
I think this might be useful for anyone who wants to detect word breaks in Unicode, although, personally, it makes me feel very queasy indeed: http://www.unicode.org/Public/6.3.0/ucd/auxiliary/WordBreakTest.html This comment; "The material here is informative, not normative." doesn't exactly in

Re: New chunks

2014-03-12 Thread Fraser Gordon
On 11 Mar 2014, at 19:26, Richmond wrote: > > Well; in theory that looks good until you start to think about languages > which are > written (such as Sanskrit) with no obvious word boundaries and both vowel > mutation (Sandhi) > at what would be word boundaries, and consonant fusion. The libr

Fwd: New chunks

2014-03-12 Thread Björnke von Gierke
d for legacyWord in their scripts. > > Kind regards, > > Kevin > > Kevin Miller ~ ke...@runrev.com ~ http://www.livecode.com/ > LiveCode: Everyone can code > > From: Benjamin Beaumont > Reply-To: LiveCode Developer List > Date: Tuesday, 11 March 2014 18:15 > To

Re: New chunks

2014-03-12 Thread Richmond
On 12/03/14 12:12, Fraser Gordon wrote: On 11 Mar 2014, at 19:26, Richmond wrote: Well; in theory that looks good until you start to think about languages which are written (such as Sanskrit) with no obvious word boundaries and both vowel mutation (Sandhi) at what would be word boundaries, an

Re: New chunks

2014-03-12 Thread J. Landman Gay
On 3/12/14, 5:12 AM, Fraser Gordon wrote: The library that we use for low-level Unicode stuff Just curious, do you know how much this will inflate the file size of a standalone? (Not that it matters really.) I'm thinking about the dictionary size(s) as well as the library itself. -- Jacquel

Re: New chunks

2014-03-12 Thread Richmond
On 12/03/14 18:07, J. Landman Gay wrote: On 3/12/14, 5:12 AM, Fraser Gordon wrote: The library that we use for low-level Unicode stuff Just curious, do you know how much this will inflate the file size of a standalone? (Not that it matters really.) I'm thinking about the dictionary size(s) as

Re: New chunks

2014-03-12 Thread Jim Hurley
Let me try this again. In “sentence (breaks on unicode sentence boundaries)” how does RR distinguish between the multiple uses of the period? Jim ___ use-livecode mailing list use-livecode@lists.runrev.com Please visit this url to subscribe, unsubscrib

Re: New chunks

2014-03-12 Thread Fraser Gordon
On 12 Mar 2014, at 17:02, Richmond wrote: > On 12/03/14 18:07, J. Landman Gay wrote: >> On 3/12/14, 5:12 AM, Fraser Gordon wrote: >>> The library that we use for low-level Unicode stuff >> >> Just curious, do you know how much this will inflate the file size of >> a standalone? (Not that it mat

Re: New chunks

2014-03-12 Thread Bob Sneidar
enjamin Beaumont >> To: LiveCode Developer List , How to >> use LiveCode >> Subject: New chunks >> Message-ID: >> >> Content-Type: text/plain; charset=ISO-8859-1 >> >> Hi All, >> >> We're in the process of adding

Re: New chunks

2014-03-12 Thread Vokey, John
As they are unicode “words”, why not: wurd? put the 2nd wurd of “word word” into naturalword. Just a thought. (Now ducking and running because this email is obviously about cheese). -- Please avoid sending me Word or PowerPoint attachments. See

Re: New chunks

2014-03-12 Thread Paul Dupuis
here in the >> same sentence. >> >> My ignorance on unicode is profound. >> Jim >> >> C >>> Message: 15 >>> Date: Tue, 11 Mar 2014 18:15:18 + >>> From: Benjamin Beaumont >>> To: LiveCode Developer List ,How to >

Re: New chunks

2014-03-12 Thread Dick Kriesel
How about "revword" for the pre-7 behavior? + frees "word" to make learning easier post-6 + tribute to heritage + easily recognizable and sensible reference for current and future livecoders + no reference to encoding + no reference to versions + no unexpected "u" + precedent for similar changes i

Re: New chunks

2014-03-12 Thread dunbarx
12, 2014 1:07 pm Subject: Re: New chunks Let me try this again. In “sentence (breaks on unicode sentence boundaries)” how does RR distinguish between the multiple uses of the period? Jim ___ use-livecode mailing list use-livecode@lists.runrev.com

Re: New chunks

2014-03-12 Thread Jerry Jensen
+1 for revword. But I still think wurd is the funniest. .Jerry On Mar 12, 2014, at 7:25 PM, Dick Kriesel wrote: > How about "revword" for the pre-7 behavior? > > + frees "word" to make learning easier post-6 > + tribute to heritage > + easily recognizable and sensible reference for current and

Re: New chunks

2014-03-12 Thread Jim Hurley
; Date: Wed, 12 Mar 2014 23:58:46 +0000 > From: Bob Sneidar > To: How to use LiveCode > Subject: Re: New chunks > Message-ID: > Content-Type: text/plain; charset="Windows-1252" > > Pretty sure Livecode is going to do a simple delimiter on period. You would > have

Re: New chunks

2014-03-12 Thread J. Landman Gay
On 3/12/14, 9:25 PM, Dick Kriesel wrote: How about "revword" for the pre-7 behavior? That's not bad. I wouldn't mind it. As an aside, I'm working late tonight and doing a lot of text parsing. I've got some text like this: 3/13/14 text more text and without even thinking about it, I gra

Re: New chunks

2014-03-13 Thread Fraser Gordon
On 13 Mar 2014, at 04:48, Jim Hurley wrote: > > So I really can't see the purpose of RR's "sentence chunk". I wish they would > explain. > We'd be using ICU's sentence breaking code. They include a whole bunch of language-related knowledge with the library and can use that to tell the diffe

Re: New chunks

2014-03-13 Thread Alan Stenhouse
Wouldn't it be simple to set the itemDel to tab put item 1 of line x of fld y into tDate Granted, slightly more work, but since itemDel changes are limited to only the local handler, it's not too big a deal, is it? Or am I missing something else? Re: the change to "word" - I'd love for it to

Re: New chunks

2014-03-13 Thread Peter Brigham
On Wed, Mar 12, 2014 at 9:32 PM, Paul Dupuis wrote: > Can someone probably construct some sequence of characters that could be > called a sentence that might get mis-parsed? Possibly - I am familiar > with the library RunRev is using only by reputation, so I can't say for > sure. However for most

Re: New chunks

2014-03-13 Thread J. Landman Gay
Sure, that's what I'd need to do, convert each instance to items. I was bemoaning the sheer number of times it would have been necessary in just one night's work. It does point out the necessity for me to have a global switch of some sort. Think of all the swapping you'd have to do within a

Re: New chunks

2014-03-13 Thread Bob Sneidar
Office products stumble at name and standard street abbreviations as well. Floating point numbers however are ignored. Apparently what they are looking for is period-space or period-. Not very comprehensive, but of course nothing could accommodate all abbreviations. The example, "I used to know

Re: New chunks

2014-03-13 Thread Jim Hurley
the paren beyond the period, but not both. I find it quite useful in look ups. I can look find all sentences in a book length ms. in which any number of words occur, regardless of the order. The ICU library will be a welcome addition. Jim > To: How to use LiveCode > Subject: Re: New

Fwd: New chunks

2014-03-13 Thread Benjamin Beaumont
Dear List Members, This discussion has been very interesting and there have been a lot of suggestions made. Ultimately I do not feel that now is the correct time to change the fundamental meaning of any of the syntax of LiveCode. My team and I have been very carefully crafting the 7.0 release so t

Re: New chunks

2014-03-13 Thread Benjamin Beaumont
Dear list member, Thank you for your input on this important matter. Having taken all the posts into account and discussed it internally we are inclined to implement the following in LiveCode 7.0: 1) Add a 'trueWord' chunk type to provide access to the Unicode Standard's notion of word-breaking.

Re: New chunks

2014-03-13 Thread Richard Gaskin
On 3/13/14 9:51 AM, Benjamin Beaumont wrote: Dear List Members, This discussion has been very interesting and there have been a lot of suggestions made. Ultimately I do not feel that now is the correct time to change the fundamental meaning of any of the syntax of LiveCode. My team and I have be

Re: New chunks

2014-03-13 Thread J. Landman Gay
On 3/13/14, 10:45 AM, Bob Sneidar wrote: Things like this make me ponder in what scenario would it be necessary to isolate sentences at all. I've had more than one case where I wanted to extract portions of a paragraph. Right now lines = paragraphs and it gets messy trying to do that. Parse a

Re: New chunks

2014-03-13 Thread Bob Sneidar
I had to read it 3 times to get it right! No computer is THAT advanced! Bob On Mar 13, 2014, at 07:31 , Peter Brigham wrote: > On Wed, Mar 12, 2014 at 9:32 PM, Paul Dupuis wrote: > >> Can someone probably construct some sequence of characters that could be >> called a sentence that might get

Re: New chunks

2014-03-13 Thread John Craig
Will point 4 - 'part' synonym break anything that already uses 'part' as a synonym of 'control'? I've only got one project that uses this keyword - not a big deal for me to change it, but others may have more substantial work to change. On 13/03/2014 16:54, Benjamin Beaumont wrote: Dear lis

Re: New chunks

2014-03-13 Thread Mark Wieder
John- Thursday, March 13, 2014, 3:25:19 PM, you wrote: > Will point 4 - 'part' synonym break anything that already uses 'part' as > a synonym of 'control'? I've only got one project that uses this > keyword - not a big deal for me to change it, but others may have more > substantial work to cha

Re: New chunks

2014-03-13 Thread Stephen MacLean
On Mar 13, 2014, at 1:10 PM, Richard Gaskin wrote: > So let's please do our part to ensure that the current Release Candidate > fully meets our needs. If we test thoroughly enough it may even be the most > solid release ever, which benefits everyone. > > 6.6 RC1 is available at the top of th

Re: New chunks

2014-03-13 Thread Monte Goulding
On 14/03/2014, at 2:05 PM, Stephen MacLean wrote: > Everything If tested so far works fine, but mergMK seems to have a problem > when when you go to set it's rect to the rect of a graphic that's been > scaled, it doesn't see it as scaled and is drawn at a quarter of the size. Hmm... LCInterfa

Re: New chunks

2014-03-14 Thread Benjamin Beaumont
Hi Stephen, Monte. We're looking at this today. Warm regards, Ben On 14 March 2014 03:41, Monte Goulding wrote: > > On 14/03/2014, at 2:05 PM, Stephen MacLean wrote: > > > Everything If tested so far works fine, but mergMK seems to have a > problem when when you go to set it's rect to the re

Re: New chunks

2014-03-14 Thread Stephen MacLean
Thanks Ben! Best, Steve On Mar 14, 2014, at 6:35 AM, Benjamin Beaumont wrote: > Hi Stephen, Monte. > > We're looking at this today. > > Warm regards, > > Ben > > > On 14 March 2014 03:41, Monte Goulding wrote: > >> >> On 14/03/2014, at 2:05 PM, Stephen MacLean wrote: >> >>> Everything

Re: New chunks

2014-03-15 Thread Peter M. Brigham
On Mar 13, 2014, at 10:17 AM, J. Landman Gay wrote: > Sure, that's what I'd need to do, convert each instance to items. I was > bemoaning the sheer number of times it would have been necessary in just one > night's work. It does point out the necessity for me to have a global switch > of some

Re: New chunks

2014-03-15 Thread Peter M. Brigham
On Mar 13, 2014, at 11:51 AM, Jim Hurley wrote: > I have a poor man’s version that is workable. > > However just ran across a sentence in today’s NYT that I will have to include: > >(“What they’re offering people is a full stomach and an empty soul.”) > > I had dealt with the quote beyo

Re: New chunks

2014-03-15 Thread Jim Hurley
> > Message: 3 > Date: Sat, 15 Mar 2014 11:30:06 -0400 > From: "Peter M. Brigham" > To: How to use LiveCode > Subject: Re: New chunks > Message-ID: > Content-Type: text/plain; charset=windows-1252 > > On Mar 13, 2014, at 11:51 AM, Jim Hurley wrote: &

Re: New chunks

2014-03-15 Thread Jim Hurley
Just thought of a situation I didn’t deal with. The apostrophe is tricky. In doing a word search, you would want to kept the apostrophe in the word “don’t” but probably not in “Howard’s best friend is his dog.” “Howard’s” should appear in the found sentence but not in the word search, i.e. If y

Re: New chunks

2014-03-15 Thread Peter Haworth
Hi Jim, Have you ever tried the sqlite fts virtual table? It's specifically designed for text searching and is lightening fast. Certainly an extra step load stuff into a sneak but might be worth a look. Pete lcSQL Software On Mar 15, 2014 4:25 PM, "Jim Hurley" wrote: > Just thought of a situat

Re: New chunks

2014-03-15 Thread Peter M. Brigham
On Mar 15, 2014, at 7:05 PM, Jim Hurley wrote: > The following app allows one to put any text into a field and search for any > or all words (whole words of patricidal) in a sentence or paragraph in > whatever order, even MS Word with all their curly quotes and apostrophes etc. > It handles de

Re: New chunks

2014-03-15 Thread stephen barncard
On Sat, Mar 15, 2014 at 7:19 PM, Peter M. Brigham wrote: > I get "script compile error: Expression bad factor" in the message box, > and when I click on the link itself I get a 404 in the browser. yeah something odd about the URL here too. *--* *Stephen Barncard - San Francisco Ca. USA - Deeds

Re: New chunks

2014-03-16 Thread Jim Hurley
Sorry Peter and Stephen, That should have been: open stack url “https://dl.dropboxusercontent.com/u/47044230/FindSentences2.livecode"; Jim > Message: 6 > Date: Sat, 15 Mar 2014 22:19:20 -0400 > From: "Peter M. Brigham" > To: How to use LiveCode > Subjec

Re: New chunks

2014-03-16 Thread Jim Hurley
Hi Pete, I’ve never gotten into SQL. Will it search for any or all words in any order in a sentence? Jim > Message: 5 > Date: Sat, 15 Mar 2014 17:02:20 -0700 > From: Peter Haworth > To: How to use LiveCode > Subject: Re: New chunks > Message-ID: > > Content-T

Re: New chunks

2014-03-16 Thread Peter Haworth
Date: Sat, 15 Mar 2014 17:02:20 -0700 > > From: Peter Haworth > > To: How to use LiveCode > > Subject: Re: New chunks > > Message-ID: > > < > cagdt7ep0pdwcchnhoxcfpxdjgpy4yxq9pnmf0brw73je4vc...@mail.gmail.com> > > Content-Type: text/plain; charse