Package: libpoppler118 Version: 22.02.0-3 Severity: wishlist Tags: patch upstream Forwarded: https://gitlab.freedesktop.org/poppler/poppler/-/merge_requests/1215
LaTeX with the amssymb package generates lessorequalslant and greaterorequalslant character names for \leqslant and \geqslant. These character names are currently not supported by poppler. For instance, on the attached PDF file (consisting in these two characters only), pdftotext gives "6>", which is meaningless. With the attached patch, I get "⩽⩾" as expected, which correspond to the Unicode characters U+2A7D LESS-THAN OR SLANTED EQUAL TO U+2A7E GREATER-THAN OR SLANTED EQUAL TO (Alternatively, the non-slanted versions U+2264 and U+2265 could be used instead, as they are better supported by fonts, but the right solution would be to fix the fonts.) Bug forwarded to the merge request I've submitted upstream. -- System Information: Debian Release: bookworm/sid APT prefers unstable-debug APT policy: (500, 'unstable-debug'), (500, 'stable-updates'), (500, 'stable-security'), (500, 'unstable'), (500, 'testing'), (500, 'stable'), (1, 'experimental') Architecture: amd64 (x86_64) Kernel: Linux 5.17.0-3-amd64 (SMP w/8 CPU threads; PREEMPT) Kernel taint flags: TAINT_PROPRIETARY_MODULE, TAINT_OOT_MODULE, TAINT_UNSIGNED_MODULE Locale: LANG=POSIX, LC_CTYPE=C.UTF-8 (charmap=UTF-8), LANGUAGE not set Shell: /bin/sh linked to /bin/dash Init: systemd (via /run/systemd/system) LSM: AppArmor: enabled Versions of packages libpoppler118 depends on: ii libc6 2.33-7 ii libfontconfig1 2.13.1-4.4 ii libfreetype6 2.12.1+dfsg-2 ii libjpeg62-turbo 1:2.1.2-1 ii liblcms2-2 2.12~rc1-2 ii libnspr4 2:4.34-1 ii libnss3 2:3.79-1 ii libopenjp2-7 2.4.0-6 ii libpng16-16 1.6.37-5 ii libstdc++6 12.1.0-4 ii libtiff5 4.4.0-2 ii zlib1g 1:1.2.11.dfsg-4 Versions of packages libpoppler118 recommends: ii poppler-data 0.4.11-1 libpoppler118 suggests no packages. -- no debconf information -- Vincent Lefèvre <vinc...@vinc17.net> - Web: <https://www.vinc17.net/> 100% accessible validated (X)HTML - Blog: <https://www.vinc17.net/blog/> Work: CR INRIA - computer arithmetic / AriC project (LIP, ENS-Lyon)
Description: add ToUnicode support for {greater,less}orequalslant Character names lessorequalslant and greaterorequalslant are generated from \leqslant and \geqslant in LaTeX with the amssymb package. They correspond to the Unicode characters U+2A7D LESS-THAN OR SLANTED EQUAL TO U+2A7E GREATER-THAN OR SLANTED EQUAL TO (alternatively, the non-slanted versions U+2264 and U+2265 could be used instead, as they are better supported by fonts). Author: Vincent Lefevre <vinc...@vinc17.net> Last-Update: 2022-06-17 Index: poppler-22.02.0/poppler/NameToUnicodeTable.h =================================================================== --- poppler-22.02.0.orig/poppler/NameToUnicodeTable.h +++ poppler-22.02.0/poppler/NameToUnicodeTable.h @@ -2044,6 +2044,7 @@ static const struct NameToUnicodeTab nam { 0x2265, "greaterequal" }, { 0x22db, "greaterequalorless" }, { 0xff1e, "greatermonospace" }, + { 0x2a7e, "greaterorequalslant" }, { 0x2273, "greaterorequivalent" }, { 0x2277, "greaterorless" }, { 0x2267, "greateroverequal" }, @@ -2536,6 +2537,7 @@ static const struct NameToUnicodeTab nam { 0x2264, "lessequal" }, { 0x22da, "lessequalorgreater" }, { 0xff1c, "lessmonospace" }, + { 0x2a7d, "lessorequalslant" }, { 0x2272, "lessorequivalent" }, { 0x2276, "lessorgreater" }, { 0x2266, "lessoverequal" },
slant.pdf
Description: Adobe PDF document