Hello, On Fri, Sep 27, 2024 at 07:55:58 CEST, Johannes Schlatow wrote: > I noticed that your .txt file is UTF-8 encoded, which apparently breaks > gosh's title and summary detection. > Converting the file to ASCII solves the issue.
>From my experience, the UTF-8 support in gosh is quite good. The issue with Pirmins text file stems from the use of a BOM [1] in combination with UTF-8, which is not required as UTF-8 is not affected by byte order and also not recommended by the Unicode standard. Nevertheless, we may improve gosh to ignore ef bb bf at the beginning of a text file. AFAIK the byte sequence produces garbage in Latin1 and Windows-1252. [1] https://en.wikipedia.org/wiki/Byte_order_mark Regards -- Christian Helmuth Genode Labs https://www.genode-labs.com/ · https://genode.org/ https://floss.social/@genode · https://genodians.org/ Genode Labs GmbH · Amtsgericht Dresden · HRB 28424 · Sitz Dresden Geschäftsführer: Dr.-Ing. Norman Feske, Christian Helmuth _______________________________________________ users mailing list -- users@lists.genode.org To unsubscribe send an email to users-le...@lists.genode.org Archived at https://lists.genode.org/mailman3/hyperkitty/list/users@lists.genode.org/message/KUUROGO7F2QKVSUXERCPRMAMLFWKJWAO/