Hello,

On Fri, Sep 27, 2024 at 07:55:58 CEST, Johannes Schlatow wrote:
> I noticed that your .txt file is UTF-8 encoded, which apparently breaks 
> gosh's title and summary detection.
> Converting the file to ASCII solves the issue.

>From my experience, the UTF-8 support in gosh is quite good. The issue
with Pirmins text file stems from the use of a BOM [1] in combination
with UTF-8, which is not required as UTF-8 is not affected by byte
order and also not recommended by the Unicode standard.

Nevertheless, we may improve gosh to ignore ef bb bf at the beginning
of a text file. AFAIK the byte sequence produces garbage in Latin1 and
Windows-1252.

[1] https://en.wikipedia.org/wiki/Byte_order_mark

Regards
-- 
Christian Helmuth
Genode Labs

https://www.genode-labs.com/ · https://genode.org/
https://floss.social/@genode · https://genodians.org/

Genode Labs GmbH · Amtsgericht Dresden · HRB 28424 · Sitz Dresden
Geschäftsführer: Dr.-Ing. Norman Feske, Christian Helmuth
_______________________________________________
users mailing list -- users@lists.genode.org
To unsubscribe send an email to users-le...@lists.genode.org
Archived at 
https://lists.genode.org/mailman3/hyperkitty/list/users@lists.genode.org/message/KUUROGO7F2QKVSUXERCPRMAMLFWKJWAO/

Reply via email to