|
Hi Elias, the UTF8_constructors look OK, but it can be tricky to properly interpret indices (the elements of sub in your code) of UTF8-encoded strings (i.e whether they mean code points or byte offsets). My feeling is that you should avoid UTF8_strings completely and go for the UTF32 option of the library (assuming that UTF32 are codepoints encoded as 32 bit integers). APL character strings are almost UTF32 strings (except for gaps between the codepoints) and they avoid all the bits shifting needed for UTF8 strings. Best Regards, /// Jürgen On 09/21/2017 12:09 PM, Elias Mårtenson
wrote:
|
- [Bug-apl] Regex support Elias Mårtenson
- Re: [Bug-apl] Regex support Giuseppe Cocomazzi
- Re: [Bug-apl] Regex support Elias Mårtenson
- Re: [Bug-apl] Regex support Juergen Sauermann
- Re: [Bug-apl] Regex support Xiao-Yong Jin
- Re: [Bug-apl] Regex support enztec
- Re: [Bug-apl] Regex support Peter Teeson
- Re: [Bug-apl] Regex support Xiao-Yong Jin
- Re: [Bug-apl] Regex support Elias Mårtenson
- Re: [Bug-apl] Regex support Juergen Sauermann
- Re: [Bug-apl] Regex suppo... Elias Mårtenson
- Re: [Bug-apl] Regex support Blake McBride
- Re: [Bug-apl] Regex support Juergen Sauermann
- Re: [Bug-apl] Regex support Peter Teeson
- Re: [Bug-apl] Regex support Giuseppe Cocomazzi
- Re: [Bug-apl] Regex support Hans-Peter Sorge
- Re: [Bug-apl] Regex suppo... Juergen Sauermann
- Re: [Bug-apl] Regex support Elias Mårtenson
- Re: [Bug-apl] Regex support Juergen Sauermann
- Re: [Bug-apl] Regex support Jay Foad
