Your message dated Sun, 20 Feb 2022 01:41:22 +0100
with message-id <[email protected]>
and subject line Re: atril: does not support PDF bookmarks encoded in UTF-16
has caused the Debian Bug report #998018,
regarding atril: does not support PDF bookmarks encoded in UTF-16
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)
--
998018: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=998018
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Package: atril
Version: 1.24.0-1+b1
Severity: normal
Most bookmarks from
http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2731.pdf
appear as "þÿ" (see attached png file), which is actually
U+FEFF ZERO WIDTH NO-BREAK SPACE, i.e. the BOM in UTF-16BE,
interpreted as ISO-8859-1.
"pdftk n2731.pdf dump_data" seems to handle that correctly,
e.g.
BookmarkBegin
BookmarkTitle: 1 þÿScope
BookmarkLevel: 1
BookmarkPageNumber: 17
-- System Information:
Debian Release: bookworm/sid
APT prefers unstable-debug
APT policy: (500, 'unstable-debug'), (500, 'stable-updates'), (500,
'stable-security'), (500, 'unstable'), (500, 'testing'), (500, 'stable'), (1,
'experimental')
Architecture: amd64 (x86_64)
Kernel: Linux 5.14.0-3-amd64 (SMP w/8 CPU threads)
Kernel taint flags: TAINT_PROPRIETARY_MODULE, TAINT_OOT_MODULE,
TAINT_UNSIGNED_MODULE
Locale: LANG=POSIX, LC_CTYPE=C.UTF-8 (charmap=UTF-8), LANGUAGE not set
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled
Versions of packages atril depends on:
ii atril-common 1.24.0-1
ii dconf-gsettings-backend [gsettings-backend] 0.40.0-2
ii libatk1.0-0 2.36.0-2
ii libatrildocument3 1.24.0-1+b1
ii libatrilview3 1.24.0-1+b1
ii libc6 2.32-4
ii libcaja-extension1 1.24.0-1+b1
ii libgdk-pixbuf-2.0-0 2.42.6+dfsg-2
ii libglib2.0-0 2.70.0-3
ii libgtk-3-0 3.24.30-3
ii libice6 2:1.0.10-1
ii libsecret-1-0 0.20.4-2
ii libsm6 2:1.2.3-1
ii libxml2 2.9.12+dfsg-5
ii shared-mime-info 2.0-1
Versions of packages atril recommends:
ii dbus-user-session [default-dbus-session-bus] 1.12.20-3
ii dbus-x11 [dbus-session-bus] 1.12.20-3
ii gvfs 1.48.1-2
Versions of packages atril suggests:
pn caja <none>
ii poppler-data 0.4.11-1
pn unrar <none>
-- no debconf information
--
Vincent Lefèvre <[email protected]> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)
--- End Message ---
--- Begin Message ---
On 2022-01-14 15:06:32 +0100, Vincent Lefevre wrote:
> On 2021-10-28 17:44:43 +0200, Vincent Lefevre wrote:
> > Most bookmarks from
> >
> > http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2731.pdf
> >
> > appear as "þÿ" (see attached png file), which is actually
> > U+FEFF ZERO WIDTH NO-BREAK SPACE, i.e. the BOM in UTF-16BE,
> > interpreted as ISO-8859-1.
>
> According to https://bugzilla.mozilla.org/show_bug.cgi?id=1750123
> (as Firefox has a similar issue), the bug would be in the PDF file.
The issue in the N2731 PDF was confirmed by WG14 (N2921).
Closing.
--
Vincent Lefèvre <[email protected]> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)
--- End Message ---