Also, to extract unicode characters as single elements of an array, I
would use 9 u:

   #9 u:a.{~40 32 205 161 194 176 32 205 156 202 150 32 205 161 194 176 41
11

   ,.9 u:a.{~40 32 205 161 194 176 32 205 156 202 150 32 205 161 194 176 41
(

͡
°

͜
ʖ

͡
°
)

(However, when writing to a file, I'd convert those back to a utf-8
byte stream, using 8 u:)

((That said, note also that unicode is not a single standard but a
suite of standards. And https://blog.unicode.org/ might be a good
place to get an overview of some of those things.))

FYI,

-- 
Raul

On Sun, Feb 5, 2023 at 10:26 AM Elijah Stone <[email protected]> wrote:
>
> A single unicode code point may be encoded using multiple utf8 code units.
>
>     205 161 {a.
>   ͡
>
> On Sun, 5 Feb 2023, Brian Schott wrote:
>
> > Consider the following experiments (beware of line wrap).
> > Why do the first and last entries "work", but not the intermediate ones?
> > How could the original "face" have been constructed?
> >
> >   a. i. '( ͡° ͜ʖ ͡°)'
> > 40 32 205 161 194 176 32 205 156 202 150 32 205 161 194 176 41
> >   205{a.
> > ?
> >   161{a.
> > ?
> >   40 32 205 161 194 176 32 205 156 202 150 32 205 161 194 176 41{a.
> > ( ͡° ͜ʖ ͡°)
> > --
> > (B=)
> > ----------------------------------------------------------------------
> > For information about J forums see http://www.jsoftware.com/forums.htm
> ----------------------------------------------------------------------
> For information about J forums see http://www.jsoftware.com/forums.htm
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to