[ https://issues.apache.org/jira/browse/PDFBOX-5397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17513980#comment-17513980 ]
Tobias Hugendubel commented on PDFBOX-5397: ------------------------------------------- We have now manually added a timeout in our placeholder search and successfully use a fallback in that case for problematic files. We tried a validate method of PDFBox but there we could not really identify problematic files, this might one of your future improvements. But the initial problem we found a solution now, thanks a lot for your feedback, that helped. Ticket is closed. > Certain PDF cannot be processed > ------------------------------- > > Key: PDFBOX-5397 > URL: https://issues.apache.org/jira/browse/PDFBOX-5397 > Project: PDFBox > Issue Type: Bug > Components: Rendering > Affects Versions: 2.0.24, 2.0.25 > Reporter: Tobias Hugendubel > Priority: Blocker > Fix For: 2.0.26, 3.0.0 PDFBox > > Attachments: TET_5_4xxx_GR_00_00_XX_14_F.pdf, > TSA_5_4xxx_SH_XX_05_XX_03_F.pdf, TSA_5_4xxx_SH_XX_05_XX_03_F1.jpg, > TSA_5_BF2x_GR_-1_01_XX_09_F.pdf, image-2022-03-23-12-25-56-963.png, > image-2022-03-23-12-29-03-735.png > > > !https://cdn.discordapp.com/attachments/381016918703996928/955833631484563566/unknown.png|width=570,height=291! > For certain PDFs where we use PDFBox to open a PDF, scan for defined dummy QR > codes on it, and then replace the dummy with a real QR code, we either get > the above error, or the process does not terminate. > A sample file TET_5_4xxx_GR_00_00_XX_14_F.pdf or > TSA_5_BF2x_GR_-1_01_XX_09_F.pdf . > They both lead to the above problem. > Our own analysis so far is that it might be the same issue as mentionend in > [https://stackoverflow.com/questions/69237146/pdfbox-renderimagewithdpi-hangs-sometimes] > > For our company PMG Projektraum GmbH, Munich, Germany, this is an essential > function. Currently users cannot download PDFs in most cases because of this: > The download tries to add a QR code and never ends. > It could be that the QR code on these PDFs causes issues because it is in a > certain layer. We observed that in Adobe Reader we see a QR Code but with or > own viewer it is invisible: > This is TSA_5_4xxx_SH_XX_05_XX_03_F.pdf > where we have the same issues > Any idea, what this might be and how to solve it? -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org