[ 
https://issues.apache.org/jira/browse/PDFBOX-4297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17257709#comment-17257709
 ] 

Tilman Hausherr edited comment on PDFBOX-4297 at 1/3/21, 10:55 AM:
-------------------------------------------------------------------

I was able to check your file with streams in the same amount of time. Btw the 
proposed method doesn't work because it returns a closed input stream. There is 
more work to do, the method you mentioned, and then the check for 
"adbe.x509.rsa_sha1" (update: turns out that this one isn't part of our 
repository, because it contains code from another project).


was (Author: tilman):
I was able to check your file with streams in the same amount of time. Btw the 
proposed method doesn't work because it returns a closed input stream. There is 
more work to do, the method you mentioned, and then the check for 
"adbe.x509.rsa_sha1".

> Allow to space efficiently analyse large PDFs
> ---------------------------------------------
>
>                 Key: PDFBOX-4297
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4297
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>            Reporter: Ralf Hauser
>            Priority: Major
>         Attachments: programWinter2015_20210103_091853-sig_LTV.pdf
>
>
> Assume you get a 300+MB large pdf and need to know
> 1) the file names of embedded files if any
> 2) whether it is encrypted (symmetric or asymmetric)
> 3) certification level (and whether it is signed)
> This should not use more than 5 MB (extra) memory
>  
> P.S.: seems to an exampe of https://pdfbox.apache.org/ideas.html  "Handle 
> large PDF files"
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to