Not super interested in getting into it right now. Will make a PR when it's more ready.
On Mon, Sep 28, 2015 at 4:15 PM, Daniel Carrera <dcarr...@gmail.com> wrote: > Sounds interesting. I am very naive when it comes to strings. What re the > advantages of the planned String type that can hold arbitrary data? What > are the problems with UTF8? > > Cheers, > Daniel. > > On 28 September 2015 at 17:27, Stefan Karpinski <ste...@karpinski.org> > wrote: > >> There's another reason: I am planning to use `String` as the name of a >> standard string type in a future version of Julia (hopefully 0.5). This >> string type will be UTF-8-like but can hold arbitrary data, which will >> distinguish if from the UTF8String type, which will enforce valid UTF-8 >> encoding. >> >> On Mon, Sep 28, 2015 at 9:56 AM, Steven G. Johnson <stevenj....@gmail.com >> > wrote: >> >>> >>> >>> On Monday, September 28, 2015 at 9:55:27 AM UTC-4, Steven G. Johnson >>> wrote: >>>> >>>> The indices passed to strings in Julia are byte offsets, >>>> >>> >>> (Technically, they are codeunit offsets. In UTF-8, the codeunit is a >>> byte; in UTF-16, the codeunit is a 16-bit word.) >>> >> >> >