Thanks Ben, your hint made the whole patch work with umlauts and
accents.

Thanks for the explanation IOhannes. I think I am having a hard time
understanding the following:

- Bytes, which have a range between 0 and 255.
- "Unicode points" which are numbers representing characters including
  umlauts?
- "ASCII characters", which are somehow bytes but also use unicode points 0-127.
- The "string" message and its difference to the "text" message in
  [text2d].

All the best, Peter

* IOhannes m zmölnig via Pd-list <[email protected]> [2026-01-28 19:51]:
> Am 28. Jänner 2026 19:08:15 MEZ schrieb "Peter P." <[email protected]>:
> >
> >> basically, my advice is to use the [string( message with unicode points:
> >> - use [file] to read the raw UTF-8 bytes
> >> - use [unicode/utf82codenumber] to convert the UTF-8 bytes into
> >> unicode-points
> >> - prefix [string(
> >
> >I tried to implement this (see attached) but [text2d] renders the actual
> >bytes as numbers... 
> 
> I haven't checked your patch yet, but apart from the obvious (as Ben pointed 
> out), I'd like to reiterate:
> 
> the "string" message does **not** take bytes but Unicode points (there's 
> about 2^20 of them) which are numbers (been 0 and ~1000000).
> bytes can only have values between 0 and 255.
> 
> for ASCII characters (which only use Unicode points between 0 and 127), the 
> UTF-8 representation is identical to the Unicode point representation.
> 
> 
> mfg.sfg.jfd
> IOhannes
> ---
> [email protected] - the Pure Data mailinglist
> https://lists.iem.at/hyperkitty/list/[email protected]/message/3HTQ2OT4USBHFLMCZR7HFV7EZA3TEETC/
> 
> To unsubscribe send an email to [email protected] mailing list
> UNSUBSCRIBE and account-management -> https://lists.iem.at/

---
[email protected] - the Pure Data mailinglist
https://lists.iem.at/hyperkitty/list/[email protected]/message/7HZC2LYUHLSYIVSG5L2ZM3ZLVE6A375E/

To unsubscribe send an email to [email protected] mailing list
UNSUBSCRIBE and account-management -> https://lists.iem.at/

Reply via email to