Re: [sword-devel] diatheke search type regex and the dot ?

2017-03-06 Thread Karl Kleinpaste
On 03/06/2017 09:06 PM, DM Smith wrote: > Does setting CLANG (or whatever it is) in the env help? In unix you > have to tell the program what charset you are using. They already come along for the ride for free as a result of logging in, per default specification when system was installed. $

Re: [sword-devel] diatheke search type regex and the dot ?

2017-03-06 Thread DM Smith
Does setting CLANG (or whatever it is) in the env help? In unix you have to tell the program what charset you are using. Cent from my fone so theer mite be tipos. ;) > On Mar 6, 2017, at 7:52 PM, Karl Kleinpaste wrote: > >> On 03/06/2017 05:25 PM, Greg Hellings wrote: >>

Re: [sword-devel] diatheke search type regex and the dot ?

2017-03-06 Thread Troy A. Griffitts
Yeah, so this page shows that c11x regex is still mostly unsupported in gcc: http://gcc.gnu.org/onlinedocs/libstdc++/manual/status.html#status.iso.tr1 (see section 7) And the old school gnu regex we use otherwise I don't think knows anything about wide chars. It simply compares bytes and

Re: [sword-devel] diatheke search type regex and the dot ?

2017-03-06 Thread Karl Kleinpaste
On 03/06/2017 05:25 PM, Greg Hellings wrote: > being off by 2 would seem strange to me I don't understand this question at all. 0xE2 = 226 = 0342 0x80 = 128 = 0200 0x93 = 147 = 0223 There's no off-by error at all. "od" is the "octal dump" tool; given -c, it tries to dump characters, but outside

Re: [sword-devel] diatheke search type regex and the dot ?

2017-03-06 Thread Greg Hellings
On Mon, Mar 6, 2017 at 4:15 PM, David Haslam wrote: > Are we sure it's an "off by 2" error and not just an email typo? > I'm not sure of that at all. It was my first guess, but being off by 2 would seem strange to me, as I would expect a "fat finger" error to produce an

Re: [sword-devel] diatheke search type regex and the dot ?

2017-03-06 Thread David Haslam
Are we sure it's an "off by 2" error and not just an email typo? I wasn't expecting decimal, I just didn't parse it as octal. David -- View this message in context: http://sword-dev.350566.n4.nabble.com/diatheke-search-type-regex-and-the-dot-tp4656879p4656914.html Sent from the SWORD Dev

Re: [sword-devel] diatheke search type regex and the dot ?

2017-03-06 Thread Greg Hellings
147 = 0223 (octal) 128 = 0200 (octal) 226 = 0340 (octal) So it's off by 2 in the top order byte. Not sure why, but it seems you're expecting decimal but the tool is obviously giving out octal. --Greg On Mon, Mar 6, 2017 at 3:02 PM, David Haslam wrote: > Thanks Karl, > >

Re: [sword-devel] diatheke search type regex and the dot ?

2017-03-06 Thread David Haslam
Thanks Karl, All the "hyphenated" names in the KJV OT use the *en dash* character U+2013 which has 3 UTF-8 bytes E2 80 93. In decimal, these are 226 128 147 so we might well wonder how your tool gave 342 200 223 ? Best regards, David -- View this message in context:

Re: [sword-devel] diatheke search type regex and the dot ?

2017-03-06 Thread Karl Kleinpaste
On 03/03/2017 09:16 PM, Troy A. Griffitts wrote: > SWORD supports compiling with a variety of regex engines I have an interesting result. My previous build of sword used --with-cxx11regex, and that failed to find Abednego in any circumstance. Reconfiguring without that option and rebuilding, I