[jira] [Commented] (PDFBOX-5143) Refactor/Simplify CFF parsing

2022-04-12 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521488#comment-17521488 ] ASF subversion and git services commented on PDFBOX-5143: - Commi

Re: 2.0.26 release

2022-04-12 Thread Tilman Hausherr
Yeah, PDFBOX-5413 fixes that one as well. 👍 Tilman Am 12.04.2022 um 19:26 schrieb Tilman Hausherr: Only one left: 7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M.pdf . There is some sort of problem with an incremental save, a part of the multi-content stream is missing / has a new object number. Lets wait

[jira] [Commented] (PDFBOX-5403) Blurry / distorted rendering

2022-04-12 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521487#comment-17521487 ] Tilman Hausherr commented on PDFBOX-5403: - Thanks, the Schleuse file is better,

[jira] [Commented] (PDFBOX-4892) Improve code quality (4)

2022-04-12 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521478#comment-17521478 ] ASF subversion and git services commented on PDFBOX-4892: - Commi

[jira] [Commented] (PDFBOX-4892) Improve code quality (4)

2022-04-12 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521479#comment-17521479 ] ASF subversion and git services commented on PDFBOX-4892: - Commi

[jira] [Commented] (PDFBOX-5413) Field text missing

2022-04-12 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521473#comment-17521473 ] ASF subversion and git services commented on PDFBOX-5413: - Commi

[jira] [Commented] (PDFBOX-5413) Field text missing

2022-04-12 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521472#comment-17521472 ] ASF subversion and git services commented on PDFBOX-5413: - Commi

[jira] [Commented] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521382#comment-17521382 ] Tim Allison commented on PDFBOX-5415: - Michael Demey's diagnosis: https://twitter.c

[jira] [Updated] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-12 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-5415: Attachment: PDFBOX-5415-TIKA-3718-p10.pdf > Infinite loop in ExtractText in 2.x branch on

[jira] [Updated] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5415: Affects Version/s: 2.0.26 > Infinite loop in ExtractText in 2.x branch on a specific pdf > ---

[jira] [Updated] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5415: Component/s: Parsing > Infinite loop in ExtractText in 2.x branch on a specific pdf >

[jira] [Created] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-12 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5415: --- Summary: Infinite loop in ExtractText in 2.x branch on a specific pdf Key: PDFBOX-5415 URL: https://issues.apache.org/jira/browse/PDFBOX-5415 Project: PDFBox

Re: 2.0.26 release

2022-04-12 Thread Tilman Hausherr
Only one left: 7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M.pdf . There is some sort of problem with an incremental save, a part of the multi-content stream is missing / has a new object number. Lets wait whether it is related to PDFBOX-5413 . (The other one, HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5.pdf is an im

Re: 2.0.26 release

2022-04-12 Thread Tilman Hausherr
Only commoncrawl3/7L/7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M commoncrawl3/HO/HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5 have a different text extraction With the other two it's attachment file names or doc info. Tilman Am 12.04.2022 um 08:16 schrieb Tilman Hausherr: After having looked at the content difference