Re: New chunks

2014-03-16 Thread Jim Hurley
Subject: Re: New chunks Message-ID: 5865b76a-362c-44a2-97e6-1bf49181a...@gmail.com Content-Type: text/plain; charset=windows-1252 On Mar 15, 2014, at 7:05 PM, Jim Hurley wrote: The following app allows one to put any text into a field and search for any or all words (whole words of patricidal

Re: New chunks

2014-03-16 Thread Jim Hurley
Hi Pete, I’ve never gotten into SQL. Will it search for any or all words in any order in a sentence? Jim Message: 5 Date: Sat, 15 Mar 2014 17:02:20 -0700 From: Peter Haworth p...@lcsql.com To: How to use LiveCode use-livecode@lists.runrev.com Subject: Re: New chunks Message-ID

Re: New chunks

2014-03-16 Thread Peter Haworth
...@lcsql.com To: How to use LiveCode use-livecode@lists.runrev.com Subject: Re: New chunks Message-ID: cagdt7ep0pdwcchnhoxcfpxdjgpy4yxq9pnmf0brw73je4vc...@mail.gmail.com Content-Type: text/plain; charset=ISO-8859-1 Hi Jim, Have you ever tried the sqlite fts virtual table? It's

Re: New chunks

2014-03-15 Thread Peter M. Brigham
On Mar 13, 2014, at 10:17 AM, J. Landman Gay wrote: Sure, that's what I'd need to do, convert each instance to items. I was bemoaning the sheer number of times it would have been necessary in just one night's work. It does point out the necessity for me to have a global switch of some

Re: New chunks

2014-03-15 Thread Peter M. Brigham
On Mar 13, 2014, at 11:51 AM, Jim Hurley wrote: I have a poor man’s version that is workable. However just ran across a sentence in today’s NYT that I will have to include: (“What they’re offering people is a full stomach and an empty soul.”) I had dealt with the quote beyond the

Re: New chunks

2014-03-15 Thread Jim Hurley
Message: 3 Date: Sat, 15 Mar 2014 11:30:06 -0400 From: Peter M. Brigham pmb...@gmail.com To: How to use LiveCode use-livecode@lists.runrev.com Subject: Re: New chunks Message-ID: d04520e3-36df-4bf6-9ced-72b363e19...@gmail.com Content-Type: text/plain; charset=windows-1252 On Mar 13

Re: New chunks

2014-03-15 Thread Jim Hurley
Just thought of a situation I didn’t deal with. The apostrophe is tricky. In doing a word search, you would want to kept the apostrophe in the word “don’t” but probably not in “Howard’s best friend is his dog.” “Howard’s” should appear in the found sentence but not in the word search, i.e. If

Re: New chunks

2014-03-15 Thread Peter Haworth
Hi Jim, Have you ever tried the sqlite fts virtual table? It's specifically designed for text searching and is lightening fast. Certainly an extra step load stuff into a sneak but might be worth a look. Pete lcSQL Software On Mar 15, 2014 4:25 PM, Jim Hurley jhurley0...@sbcglobal.net wrote:

Re: New chunks

2014-03-15 Thread Peter M. Brigham
On Mar 15, 2014, at 7:05 PM, Jim Hurley wrote: The following app allows one to put any text into a field and search for any or all words (whole words of patricidal) in a sentence or paragraph in whatever order, even MS Word with all their curly quotes and apostrophes etc. It handles

Re: New chunks

2014-03-15 Thread stephen barncard
On Sat, Mar 15, 2014 at 7:19 PM, Peter M. Brigham pmb...@gmail.com wrote: I get script compile error: Expression bad factor in the message box, and when I click on the link itself I get a 404 in the browser. yeah something odd about the URL here too. *--* *Stephen Barncard - San Francisco

Re: New chunks

2014-03-14 Thread Benjamin Beaumont
Hi Stephen, Monte. We're looking at this today. Warm regards, Ben On 14 March 2014 03:41, Monte Goulding mo...@sweattechnologies.com wrote: On 14/03/2014, at 2:05 PM, Stephen MacLean wrote: Everything If tested so far works fine, but mergMK seems to have a problem when when you go to

Re: New chunks

2014-03-14 Thread Stephen MacLean
Thanks Ben! Best, Steve On Mar 14, 2014, at 6:35 AM, Benjamin Beaumont b...@runrev.com wrote: Hi Stephen, Monte. We're looking at this today. Warm regards, Ben On 14 March 2014 03:41, Monte Goulding mo...@sweattechnologies.com wrote: On 14/03/2014, at 2:05 PM, Stephen

Re: New chunks

2014-03-13 Thread J. Landman Gay
On 3/12/14, 9:25 PM, Dick Kriesel wrote: How about revword for the pre-7 behavior? That's not bad. I wouldn't mind it. As an aside, I'm working late tonight and doing a lot of text parsing. I've got some text like this: 3/13/14 tab text tab more text and without even thinking about it,

Re: New chunks

2014-03-13 Thread Fraser Gordon
On 13 Mar 2014, at 04:48, Jim Hurley jhurley0...@sbcglobal.net wrote: So I really can't see the purpose of RR's sentence chunk. I wish they would explain. We'd be using ICU's sentence breaking code. They include a whole bunch of language-related knowledge with the library and can use

Re: New chunks

2014-03-13 Thread Alan Stenhouse
Wouldn't it be simple to set the itemDel to tab put item 1 of line x of fld y into tDate Granted, slightly more work, but since itemDel changes are limited to only the local handler, it's not too big a deal, is it? Or am I missing something else? Re: the change to word - I'd love for it to

Re: New chunks

2014-03-13 Thread Peter Brigham
On Wed, Mar 12, 2014 at 9:32 PM, Paul Dupuis p...@researchware.com wrote: Can someone probably construct some sequence of characters that could be called a sentence that might get mis-parsed? Possibly - I am familiar with the library RunRev is using only by reputation, so I can't say for

Re: New chunks

2014-03-13 Thread J. Landman Gay
Sure, that's what I'd need to do, convert each instance to items. I was bemoaning the sheer number of times it would have been necessary in just one night's work. It does point out the necessity for me to have a global switch of some sort. Think of all the swapping you'd have to do within

Re: New chunks

2014-03-13 Thread Bob Sneidar
Office products stumble at name and standard street abbreviations as well. Floating point numbers however are ignored. Apparently what they are looking for is period-space or period-break of some kind. Not very comprehensive, but of course nothing could accommodate all abbreviations. The

Re: New chunks

2014-03-13 Thread Jim Hurley
Subject: Re: New chunks Message-ID: a9cfc885-71fb-4f78-b7df-df5954511...@runrev.com Content-Type: text/plain; charset=us-ascii On 13 Mar 2014, at 04:48, Jim Hurley jhurley0...@sbcglobal.net wrote: So I really can't see the purpose of RR's sentence chunk. I wish they would explain

Fwd: New chunks

2014-03-13 Thread Benjamin Beaumont
Dear List Members, This discussion has been very interesting and there have been a lot of suggestions made. Ultimately I do not feel that now is the correct time to change the fundamental meaning of any of the syntax of LiveCode. My team and I have been very carefully crafting the 7.0 release so

Re: New chunks

2014-03-13 Thread Benjamin Beaumont
Dear list member, Thank you for your input on this important matter. Having taken all the posts into account and discussed it internally we are inclined to implement the following in LiveCode 7.0: 1) Add a 'trueWord' chunk type to provide access to the Unicode Standard's notion of word-breaking.

Re: New chunks

2014-03-13 Thread Richard Gaskin
On 3/13/14 9:51 AM, Benjamin Beaumont wrote: Dear List Members, This discussion has been very interesting and there have been a lot of suggestions made. Ultimately I do not feel that now is the correct time to change the fundamental meaning of any of the syntax of LiveCode. My team and I have

Re: New chunks

2014-03-13 Thread J. Landman Gay
On 3/13/14, 10:45 AM, Bob Sneidar wrote: Things like this make me ponder in what scenario would it be necessary to isolate sentences at all. I've had more than one case where I wanted to extract portions of a paragraph. Right now lines = paragraphs and it gets messy trying to do that. Parse

Re: New chunks

2014-03-13 Thread Bob Sneidar
I had to read it 3 times to get it right! No computer is THAT advanced! Bob On Mar 13, 2014, at 07:31 , Peter Brigham pmb...@gmail.com wrote: On Wed, Mar 12, 2014 at 9:32 PM, Paul Dupuis p...@researchware.com wrote: Can someone probably construct some sequence of characters that could be

Re: New chunks

2014-03-13 Thread John Craig
Will point 4 - 'part' synonym break anything that already uses 'part' as a synonym of 'control'? I've only got one project that uses this keyword - not a big deal for me to change it, but others may have more substantial work to change. On 13/03/2014 16:54, Benjamin Beaumont wrote: Dear

Re: New chunks

2014-03-13 Thread Mark Wieder
John- Thursday, March 13, 2014, 3:25:19 PM, you wrote: Will point 4 - 'part' synonym break anything that already uses 'part' as a synonym of 'control'? I've only got one project that uses this keyword - not a big deal for me to change it, but others may have more substantial work to

Re: New chunks

2014-03-13 Thread Stephen MacLean
On Mar 13, 2014, at 1:10 PM, Richard Gaskin ambassa...@fourthworld.com wrote: So let's please do our part to ensure that the current Release Candidate fully meets our needs. If we test thoroughly enough it may even be the most solid release ever, which benefits everyone. 6.6 RC1 is

Re: New chunks

2014-03-13 Thread Monte Goulding
On 14/03/2014, at 2:05 PM, Stephen MacLean wrote: Everything If tested so far works fine, but mergMK seems to have a problem when when you go to set it's rect to the rect of a graphic that's been scaled, it doesn't see it as scaled and is drawn at a quarter of the size. Hmm...

Re: New chunks

2014-03-12 Thread Richmond
I think this might be useful for anyone who wants to detect word breaks in Unicode, although, personally, it makes me feel very queasy indeed: http://www.unicode.org/Public/6.3.0/ucd/auxiliary/WordBreakTest.html This comment; The material here is informative, not normative. doesn't exactly

Re: New chunks

2014-03-12 Thread Fraser Gordon
On 11 Mar 2014, at 19:26, Richmond richmondmathew...@gmail.com wrote: Well; in theory that looks good until you start to think about languages which are written (such as Sanskrit) with no obvious word boundaries and both vowel mutation (Sandhi) at what would be word boundaries, and

Fwd: New chunks

2014-03-12 Thread Björnke von Gierke
LiveCode use-livecode@lists.runrev.com Subject: New chunks Hi All, We're in the process of adding some new chunk types in LiveCode 7 and we would appreciate suggestions for a particular chunk name. The new chunk types are: naturalword (breaks on unicode word boundaries) sentence

Re: New chunks

2014-03-12 Thread Richmond
On 12/03/14 12:12, Fraser Gordon wrote: On 11 Mar 2014, at 19:26, Richmond richmondmathew...@gmail.com wrote: Well; in theory that looks good until you start to think about languages which are written (such as Sanskrit) with no obvious word boundaries and both vowel mutation (Sandhi) at what

Re: New chunks

2014-03-12 Thread J. Landman Gay
On 3/12/14, 5:12 AM, Fraser Gordon wrote: The library that we use for low-level Unicode stuff Just curious, do you know how much this will inflate the file size of a standalone? (Not that it matters really.) I'm thinking about the dictionary size(s) as well as the library itself. --

Re: New chunks

2014-03-12 Thread Richmond
On 12/03/14 18:07, J. Landman Gay wrote: On 3/12/14, 5:12 AM, Fraser Gordon wrote: The library that we use for low-level Unicode stuff Just curious, do you know how much this will inflate the file size of a standalone? (Not that it matters really.) I'm thinking about the dictionary size(s) as

Re: New chunks

2014-03-12 Thread Fraser Gordon
On 12 Mar 2014, at 17:02, Richmond richmondmathew...@gmail.com wrote: On 12/03/14 18:07, J. Landman Gay wrote: On 3/12/14, 5:12 AM, Fraser Gordon wrote: The library that we use for low-level Unicode stuff Just curious, do you know how much this will inflate the file size of a standalone?

Re: New chunks

2014-03-12 Thread Bob Sneidar
livecode-...@lists.runrev.com, How to use LiveCode use-livecode@lists.runrev.com Subject: New chunks Message-ID: CADd0_Txbhdem4PbKXifXUsujqPLs9HROME6vKhF=sk1znp2...@mail.gmail.com Content-Type: text/plain; charset=ISO-8859-1 Hi All, We're in the process of adding some new

Re: New chunks

2014-03-12 Thread Vokey, John
As they are unicode “words”, why not: wurd? put the 2nd wurd of “word word” into naturalword. Just a thought. (Now ducking and running because this email is obviously about cheese). -- Please avoid sending me Word or PowerPoint attachments. See

Re: New chunks

2014-03-12 Thread Paul Dupuis
+ From: Benjamin Beaumont b...@runrev.com To: LiveCode Developer List livecode-...@lists.runrev.com,How to use LiveCode use-livecode@lists.runrev.com Subject: New chunks Message-ID: CADd0_Txbhdem4PbKXifXUsujqPLs9HROME6vKhF=sk1znp2...@mail.gmail.com Content-Type: text/plain

Re: New chunks

2014-03-12 Thread Dick Kriesel
How about revword for the pre-7 behavior? + frees word to make learning easier post-6 + tribute to heritage + easily recognizable and sensible reference for current and future livecoders + no reference to encoding + no reference to versions + no unexpected u + precedent for similar changes in the

Re: New chunks

2014-03-12 Thread dunbarx
-livecode use-livecode@lists.runrev.com Sent: Wed, Mar 12, 2014 1:07 pm Subject: Re: New chunks Let me try this again. In “sentence (breaks on unicode sentence boundaries)” how does RR distinguish between the multiple uses of the period? Jim ___ use

Re: New chunks

2014-03-12 Thread Jerry Jensen
+1 for revword. But I still think wurd is the funniest. .Jerry On Mar 12, 2014, at 7:25 PM, Dick Kriesel dick.krie...@mail.com wrote: How about revword for the pre-7 behavior? + frees word to make learning easier post-6 + tribute to heritage + easily recognizable and sensible reference for

Re: New chunks

2014-03-12 Thread Jim Hurley
to use LiveCode use-livecode@lists.runrev.com Subject: Re: New chunks Message-ID: e3ab1833-c8fc-47f0-af8e-358e6ec80...@iotecdigital.com Content-Type: text/plain; charset=Windows-1252 Pretty sure Livecode is going to do a simple delimiter on period. You would have to prep the data first

New chunks

2014-03-11 Thread Benjamin Beaumont
Hi All, We're in the process of adding some new chunk types in LiveCode 7 and we would appreciate suggestions for a particular chunk name. The new chunk types are: naturalword (breaks on unicode word boundaries) sentence (breaks on unicode sentence boundaries) paragraph (Same behaviour as

Re: New chunks

2014-03-11 Thread Roger Eller
uword or wordup (up = unicode part) ~Roger On Tue, Mar 11, 2014 at 2:15 PM, Benjamin Beaumont b...@runrev.com wrote: Hi All, We're in the process of adding some new chunk types in LiveCode 7 and we would appreciate suggestions for a particular chunk name. The new chunk types are:

Re: New chunks

2014-03-11 Thread dunbarx
Eller roger.e.el...@sealedair.com To: How to use LiveCode use-livecode@lists.runrev.com Sent: Tue, Mar 11, 2014 2:41 pm Subject: Re: New chunks uword or wordup (up = unicode part) ~Roger On Tue, Mar 11, 2014 at 2:15 PM, Benjamin Beaumont b...@runrev.com wrote: Hi All, We're in the process

Re: New chunks

2014-03-11 Thread Richmond
On 11/03/14 20:15, Benjamin Beaumont wrote: Hi All, We're in the process of adding some new chunk types in LiveCode 7 and we would appreciate suggestions for a particular chunk name. The new chunk types are: naturalword (breaks on unicode word boundaries) Well; in theory that looks good

Re: New chunks

2014-03-11 Thread Peter M. Brigham
On Mar 11, 2014, at 2:15 PM, Benjamin Beaumont wrote: Hi All, We're in the process of adding some new chunk types in LiveCode 7 and we would appreciate suggestions for a particular chunk name. The new chunk types are: naturalword (breaks on unicode word boundaries) snip Will this be

Re: New chunks

2014-03-11 Thread Mark Wieder
Richmond richmondmathewson@... writes: How are you going to work out what marks a sentence boundary in every language that one can write with Unicode? And there are languages where the idea of a 'sentence' is absent. My initial take on this was the same as yours. But unicode separators

Re: New chunks

2014-03-11 Thread Mark Wieder
Benjamin Beaumont ben@... writes: 'naturalword' is the clearest internal suggestion at the moment and we'd love to get the input from community members if there is an even clearer option. My preferences, in order would be: 1. change the parser so that it accepts modifiers, and specifically

Re: New chunks

2014-03-11 Thread Jim Hurley
To: LiveCode Developer List livecode-...@lists.runrev.com, How to use LiveCode use-livecode@lists.runrev.com Subject: New chunks Message-ID: CADd0_Txbhdem4PbKXifXUsujqPLs9HROME6vKhF=sk1znp2...@mail.gmail.com Content-Type: text/plain; charset=ISO-8859-1 Hi All, We're in the process

Re: New chunks

2014-03-11 Thread Phil Davis
What are the pros and cons of making the 'useUnicode' property persistent and true by default, and let it determine the meaning of 'word'? To my own immature understanding, that seems like a better approach that trying to come up with a new vocabulary item. Phil Davis On 3/11/14, 11:15 AM,

Re: New chunks

2014-03-11 Thread Phil Davis
Or maybe introduce a persistent 'useUnicodeChunks' property that controls which kind of chunking is used. On 3/11/14, 3:42 PM, Phil Davis wrote: What are the pros and cons of making the 'useUnicode' property persistent and true by default, and let it determine the meaning of 'word'? To my own

Re: New chunks

2014-03-11 Thread Phil Davis
Sorry, I didn't read all emails before suggesting this. :-) On 3/11/14, 3:44 PM, Phil Davis wrote: Or maybe introduce a persistent 'useUnicodeChunks' property that controls which kind of chunking is used. On 3/11/14, 3:42 PM, Phil Davis wrote: What are the pros and cons of making the

Re: New chunks

2014-03-11 Thread Mark Wieder
Phil Davis revdev@... writes: What are the pros and cons of making the 'useUnicode' property persistent and true by default, and let it determine the meaning of 'word'? To my own immature understanding, that seems like a better approach that trying to come up with a new vocabulary item.

Re: New chunks

2014-03-11 Thread Mark Schonewille
Hi Ben, I think that the chunk names for unicode should be exactly the same as for english text, but they should act differently if either the text being operated on is unicode text or the useUnicode has been set to true. Note that other development tools don't have this problem at all.