Matti Oinas created PDFBOX-4728: ----------------------------------- Summary: Broken PDF after load and save Key: PDFBOX-4728 URL: https://issues.apache.org/jira/browse/PDFBOX-4728 Project: PDFBox Issue Type: Bug Components: Parsing, Writing Affects Versions: 2.0.18, 3.0.0 PDFBox Reporter: Matti Oinas
If read was done using WINDOWS-1252 charset and writing is done using UTF-8 then resulting PDF will be broken after load and save operations. {{PDDocument document = PDDocument.load(sourcePath);}} {{document.save(targetPath);}} If source PDF contains XObject dictionary reference whose name isn't encoded in UTF-8. For example. /L#f8vetann 16 0 R That is read using WINDOWS-1252 encoding. Now if write operation is using UTF-8 then the resulting name will be /L#3Fvetann 16 0 R And resulting PDF is broken and image is missing. FIX in pull request: https://github.com/apache/pdfbox/pull/77 -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org