Dear LuaTeX developers,

assume the following plain TeX document `example.tex`:

    \newwrite\outfile
    \openout\outfile\jobname.out
    \write\outfile{^^^^fffd}
    \closeout\outfile
    \bye

Running `luatex example` will correctly produce file `example.out` with the
UTF-8 encoding of U+FFFD: 0xEF, 0xBF, and 0xBD.

    $ hexdump -C
    00000000  ef bf bd 0a                                       |....|
    00000004

Now, let's change `example.tex` as follows:

    \input\jobname.out
    \bye

Running `luatex example` produces the following error:

    ! String contains an invalid utf-8 sequence.

I would expect that LuaTeX would treat ^^^^fffd and the byte sequence 0xEF,
0xBF, and 0xBD the same. This issue was co-discovered by @lostenderman at
<https://github.com/lostenderman/markdown/issues/34>.

Best,
Vitek

Attachment: signature.asc
Description: PGP signature

Reply via email to