[ 
https://issues.apache.org/jira/browse/PDFBOX-5397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17513980#comment-17513980
 ] 

Tobias Hugendubel commented on PDFBOX-5397:
-------------------------------------------

We have now manually added a timeout in our placeholder search and successfully 
use a fallback in that case for problematic files.
We tried a validate method of PDFBox but there we could not really identify 
problematic files, this might one of your future improvements.

But the initial problem we found a solution now, thanks a lot for your 
feedback, that helped.
Ticket is closed.

> Certain PDF cannot be processed
> -------------------------------
>
>                 Key: PDFBOX-5397
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5397
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Rendering
>    Affects Versions: 2.0.24, 2.0.25
>            Reporter: Tobias Hugendubel
>            Priority: Blocker
>             Fix For: 2.0.26, 3.0.0 PDFBox
>
>         Attachments: TET_5_4xxx_GR_00_00_XX_14_F.pdf, 
> TSA_5_4xxx_SH_XX_05_XX_03_F.pdf, TSA_5_4xxx_SH_XX_05_XX_03_F1.jpg, 
> TSA_5_BF2x_GR_-1_01_XX_09_F.pdf, image-2022-03-23-12-25-56-963.png, 
> image-2022-03-23-12-29-03-735.png
>
>
> !https://cdn.discordapp.com/attachments/381016918703996928/955833631484563566/unknown.png|width=570,height=291!
> For certain PDFs where we use PDFBox to open a PDF, scan for defined dummy QR 
> codes on it, and then replace the dummy with a real QR code, we either get 
> the above error, or the process does not terminate.
> A sample file TET_5_4xxx_GR_00_00_XX_14_F.pdf or 
> TSA_5_BF2x_GR_-1_01_XX_09_F.pdf .
> They both lead to the above problem.
> Our own analysis so far is that it might be the same issue as mentionend in 
> [https://stackoverflow.com/questions/69237146/pdfbox-renderimagewithdpi-hangs-sometimes]
>  
> For our company PMG Projektraum GmbH, Munich, Germany, this is an essential 
> function. Currently users cannot download PDFs in most cases because of this: 
> The download tries to add a QR code and never ends.
> It could be that the QR code on these PDFs causes issues because it is in a 
> certain layer. We observed that in Adobe Reader we see a QR Code but with or 
> own viewer it is invisible:
> This is TSA_5_4xxx_SH_XX_05_XX_03_F.pdf
> where we have the same issues
> Any idea, what this might be and how to solve it?



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to