Re: "ICU - International Components for Unicode"

2020-09-27 Thread Samantha McVey
So MoarVM uses its own database of the UCD. One nice thing is this can probably be faster than calling to the ICU to look up information of each codepoint in a long string. Secondly it implements its own text data structures, so the nice features of the UCD to do that would be difficult to use.

Re: readchars, seek back, and readchars again

2020-04-28 Thread Samantha McVey
On zaterdag 25 april 2020 21:51:41 CEST Joseph Brenner wrote: > > Yary has an issue posted regarding 'display-width' of UTF-16 encoded strings: > > https://github.com/rakudo/rakudo/issues/3461 > > > > I know it might be far-fetched, but what if your UTF-8 issue and > > Yary's UTF-16 issue wer

Re: readchars, seek back, and readchars again

2020-04-28 Thread Samantha McVey
uot; feature, it can be tricky to predict where you're > going to end up, because the point you're starting at depends on what > kind text you've been reading, not just the number of bytes you've > read. > > Is that making any sense? I posted a later code examp