https://bugs.kde.org/show_bug.cgi?id=334068
--- Comment #8 from Albert Astals Cid ---
I am pretty sure poppler supports tagged pdf, maybe it's just not exported. Do
you think you could have a look? I'm also the poppler maintainer so it should
not be a problem getting patches in (any more than my
https://bugs.kde.org/show_bug.cgi?id=334068
--- Comment #7 from Jaan Vajakas ---
When testing with some PDF documents on my hard drive, I found that improving
this bug would cause a regression for some PDFs (OCR'ed papers) from JSTOR
which have slightly wrong bounding rectangles; for those docume
https://bugs.kde.org/show_bug.cgi?id=334068
--- Comment #6 from Albert Astals Cid ---
Jaan, poppler supports tagged pdf, you can always have a look at it.
About improvements for this bug to go away, they're always welcome :-)
--
You are receiving this mail because:
You are the assignee for the
https://bugs.kde.org/show_bug.cgi?id=334068
--- Comment #5 from Dmitry ---
Albert, Jaan, thank you for your comments!
Jaan, despite of that (1) would not help for the current file, you said that
you can modify Okular's layout detection algorithm so that it will be able to
detect text like we have
https://bugs.kde.org/show_bug.cgi?id=334068
--- Comment #4 from Jaan Vajakas ---
The problem with this file is that the bounding boxes of "T" and "A" overlap
and Okular's layout detection algorithm only considers two glyphs to belong to
the same word if the second one's bounding box touches the f
https://bugs.kde.org/show_bug.cgi?id=334068
Albert Astals Cid changed:
What|Removed |Added
CC||jaanvaja...@hot.ee
--- Comment #3 from Albe
https://bugs.kde.org/show_bug.cgi?id=334068
Albert Astals Cid changed:
What|Removed |Added
Status|UNCONFIRMED |CONFIRMED
CC|
https://bugs.kde.org/show_bug.cgi?id=334068
--- Comment #1 from Dmitry ---
Created attachment 86345
--> https://bugs.kde.org/attachment.cgi?id=86345&action=edit
PDF document in which okular failed to find occurence of "tad" string
--
You are receiving this mail because:
You are the assignee f