Also, to extract unicode characters as single elements of an array, I
would use 9 u:
#9 u:a.{~40 32 205 161 194 176 32 205 156 202 150 32 205 161 194 176 41
11
,.9 u:a.{~40 32 205 161 194 176 32 205 156 202 150 32 205 161 194 176 41
(
͡
°
͜
ʖ
͡
°
)
(However, when writing to a file, I'd convert those back to a utf-8
byte stream, using 8 u:)
((That said, note also that unicode is not a single standard but a
suite of standards. And https://blog.unicode.org/ might be a good
place to get an overview of some of those things.))
FYI,
--
Raul
On Sun, Feb 5, 2023 at 10:26 AM Elijah Stone <[email protected]> wrote:
>
> A single unicode code point may be encoded using multiple utf8 code units.
>
> 205 161 {a.
> ͡
>
> On Sun, 5 Feb 2023, Brian Schott wrote:
>
> > Consider the following experiments (beware of line wrap).
> > Why do the first and last entries "work", but not the intermediate ones?
> > How could the original "face" have been constructed?
> >
> > a. i. '( ͡° ͜ʖ ͡°)'
> > 40 32 205 161 194 176 32 205 156 202 150 32 205 161 194 176 41
> > 205{a.
> > ?
> > 161{a.
> > ?
> > 40 32 205 161 194 176 32 205 156 202 150 32 205 161 194 176 41{a.
> > ( ͡° ͜ʖ ͡°)
> > --
> > (B=)
> > ----------------------------------------------------------------------
> > For information about J forums see http://www.jsoftware.com/forums.htm
> ----------------------------------------------------------------------
> For information about J forums see http://www.jsoftware.com/forums.htm
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm