Package: libpoppler118
Version: 22.02.0-3
Severity: wishlist
Tags: patch upstream
Forwarded: https://gitlab.freedesktop.org/poppler/poppler/-/merge_requests/1215

LaTeX with the amssymb package generates lessorequalslant and
greaterorequalslant character names for \leqslant and \geqslant.

These character names are currently not supported by poppler.
For instance, on the attached PDF file (consisting in these two
characters only), pdftotext gives "6>", which is meaningless.

With the attached patch, I get "⩽⩾" as expected, which correspond
to the Unicode characters
  U+2A7D LESS-THAN OR SLANTED EQUAL TO
  U+2A7E GREATER-THAN OR SLANTED EQUAL TO

(Alternatively, the non-slanted versions U+2264 and U+2265 could be
used instead, as they are better supported by fonts, but the right
solution would be to fix the fonts.)

Bug forwarded to the merge request I've submitted upstream.

-- System Information:
Debian Release: bookworm/sid
  APT prefers unstable-debug
  APT policy: (500, 'unstable-debug'), (500, 'stable-updates'), (500, 
'stable-security'), (500, 'unstable'), (500, 'testing'), (500, 'stable'), (1, 
'experimental')
Architecture: amd64 (x86_64)

Kernel: Linux 5.17.0-3-amd64 (SMP w/8 CPU threads; PREEMPT)
Kernel taint flags: TAINT_PROPRIETARY_MODULE, TAINT_OOT_MODULE, 
TAINT_UNSIGNED_MODULE
Locale: LANG=POSIX, LC_CTYPE=C.UTF-8 (charmap=UTF-8), LANGUAGE not set
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages libpoppler118 depends on:
ii  libc6            2.33-7
ii  libfontconfig1   2.13.1-4.4
ii  libfreetype6     2.12.1+dfsg-2
ii  libjpeg62-turbo  1:2.1.2-1
ii  liblcms2-2       2.12~rc1-2
ii  libnspr4         2:4.34-1
ii  libnss3          2:3.79-1
ii  libopenjp2-7     2.4.0-6
ii  libpng16-16      1.6.37-5
ii  libstdc++6       12.1.0-4
ii  libtiff5         4.4.0-2
ii  zlib1g           1:1.2.11.dfsg-4

Versions of packages libpoppler118 recommends:
ii  poppler-data  0.4.11-1

libpoppler118 suggests no packages.

-- no debconf information

-- 
Vincent Lefèvre <vinc...@vinc17.net> - Web: <https://www.vinc17.net/>
100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/>
Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)
Description: add ToUnicode support for {greater,less}orequalslant
 Character names lessorequalslant and greaterorequalslant are generated
 from \leqslant and \geqslant in LaTeX with the amssymb package. They
 correspond to the Unicode characters
   U+2A7D LESS-THAN OR SLANTED EQUAL TO
   U+2A7E GREATER-THAN OR SLANTED EQUAL TO
 (alternatively, the non-slanted versions U+2264 and U+2265 could be
 used instead, as they are better supported by fonts).
Author: Vincent Lefevre <vinc...@vinc17.net>
Last-Update: 2022-06-17

Index: poppler-22.02.0/poppler/NameToUnicodeTable.h
===================================================================
--- poppler-22.02.0.orig/poppler/NameToUnicodeTable.h
+++ poppler-22.02.0/poppler/NameToUnicodeTable.h
@@ -2044,6 +2044,7 @@ static const struct NameToUnicodeTab nam
                                                                 { 0x2265, "greaterequal" },
                                                                 { 0x22db, "greaterequalorless" },
                                                                 { 0xff1e, "greatermonospace" },
+                                                                { 0x2a7e, "greaterorequalslant" },
                                                                 { 0x2273, "greaterorequivalent" },
                                                                 { 0x2277, "greaterorless" },
                                                                 { 0x2267, "greateroverequal" },
@@ -2536,6 +2537,7 @@ static const struct NameToUnicodeTab nam
                                                                 { 0x2264, "lessequal" },
                                                                 { 0x22da, "lessequalorgreater" },
                                                                 { 0xff1c, "lessmonospace" },
+                                                                { 0x2a7d, "lessorequalslant" },
                                                                 { 0x2272, "lessorequivalent" },
                                                                 { 0x2276, "lessorgreater" },
                                                                 { 0x2266, "lessoverequal" },

Attachment: slant.pdf
Description: Adobe PDF document

Reply via email to