Alfred created PDFBOX-4869:
------------------------------
Summary: Reading standard 14 fonts is slow
Key: PDFBOX-4869
URL: https://issues.apache.org/jira/browse/PDFBOX-4869
Project: PDFBox
Issue Type: Improvement
Components: Parsing, Text extraction
Affects Versions: 3.0.0 PDFBox
Reporter: Alfred
I ham testing text extraction from PDF and profiling the execution.
I found that the second biggest time consumer is the static code in
Standard14Fonts that loads fonts from the pdf box jar.
The culprit seems to be the direct use of the stream returned
getResurceAsStream.
Using a buffered stream around it reduces the load time a lot.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]