[
https://issues.apache.org/jira/browse/PDFBOX-5143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521488#comment-17521488
]
ASF subversion and git services commented on PDFBOX-5143:
-
Commi
Yeah, PDFBOX-5413 fixes that one as well. 👍
Tilman
Am 12.04.2022 um 19:26 schrieb Tilman Hausherr:
Only one left: 7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M.pdf .
There is some sort of problem with an incremental save, a part of the
multi-content stream is missing / has a new object number. Lets wait
[
https://issues.apache.org/jira/browse/PDFBOX-5403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521487#comment-17521487
]
Tilman Hausherr commented on PDFBOX-5403:
-
Thanks, the Schleuse file is better,
[
https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521478#comment-17521478
]
ASF subversion and git services commented on PDFBOX-4892:
-
Commi
[
https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521479#comment-17521479
]
ASF subversion and git services commented on PDFBOX-4892:
-
Commi
[
https://issues.apache.org/jira/browse/PDFBOX-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521473#comment-17521473
]
ASF subversion and git services commented on PDFBOX-5413:
-
Commi
[
https://issues.apache.org/jira/browse/PDFBOX-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521472#comment-17521472
]
ASF subversion and git services commented on PDFBOX-5413:
-
Commi
[
https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17521382#comment-17521382
]
Tim Allison commented on PDFBOX-5415:
-
Michael Demey's diagnosis:
https://twitter.c
[
https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tilman Hausherr updated PDFBOX-5415:
Attachment: PDFBOX-5415-TIKA-3718-p10.pdf
> Infinite loop in ExtractText in 2.x branch on
[
https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated PDFBOX-5415:
Affects Version/s: 2.0.26
> Infinite loop in ExtractText in 2.x branch on a specific pdf
> ---
[
https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated PDFBOX-5415:
Component/s: Parsing
> Infinite loop in ExtractText in 2.x branch on a specific pdf
>
Tim Allison created PDFBOX-5415:
---
Summary: Infinite loop in ExtractText in 2.x branch on a specific
pdf
Key: PDFBOX-5415
URL: https://issues.apache.org/jira/browse/PDFBOX-5415
Project: PDFBox
Only one left: 7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M.pdf .
There is some sort of problem with an incremental save, a part of the
multi-content stream is missing / has a new object number. Lets wait
whether it is related to PDFBOX-5413 .
(The other one, HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5.pdf is an im
Only
commoncrawl3/7L/7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M
commoncrawl3/HO/HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5
have a different text extraction
With the other two it's attachment file names or doc info.
Tilman
Am 12.04.2022 um 08:16 schrieb Tilman Hausherr:
After having looked at the content difference
14 matches
Mail list logo