[ https://issues.apache.org/jira/browse/PDFBOX-5848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17862867#comment-17862867 ]
Tilman Hausherr edited comment on PDFBOX-5848 at 7/3/24 6:15 PM: ----------------------------------------------------------------- I'm testing with 3.0.2 and it's working very slowly... I'm at page 170. However there's a different problem, lots of orphan pages. The reason is that some annotations have a /Parent entry which has a /Kids entry whose children are annotations on *different* pages. Opening and saving it with Adobe Reader brings a much smaller file, where the /Parent entry value is set to null. !screenshot-1.png! was (Author: tilman): I'm testing with 3.0.2 and it's working very slowly... I'm at page 170. However there's a different problem, lots of orphan pages. The reason is that some annotations have a /Parent entry which has a /Kids entry whose children are annotations on *different* pages. !screenshot-1.png! > Infinite loop processing PDF > ---------------------------- > > Key: PDFBOX-5848 > URL: https://issues.apache.org/jira/browse/PDFBOX-5848 > Project: PDFBox > Issue Type: Bug > Affects Versions: 3.0.2 PDFBox > Reporter: Joan Fisbein > Priority: Major > Attachments: cbc0018b-5659-4ae3-9887-0e0a2d9a62a7.pdf, > screenshot-1.png > > > I use PDFBox to split hundreds of PDFs per day, generally, everything works > flawlessly but I just received a PDF that generates an infinite loop when I > try to split it. > > I used this Java code to reproduce it using PDFBox 3.0.2 (haven't tried other > versions): > {code:java} > private static void splitPdf(File fileToSplit) { > try (PDDocument document = Loader.loadPDF(fileToSplit)) { > int documentPages = document.getNumberOfPages(); > Splitter splitter = new Splitter(); > List<PDDocument> Pages = splitter.split(document); > Iterator<PDDocument> iterator = Pages.listIterator(); > while (iterator.hasNext()) { > PDDocument pd = iterator.next(); > pd.save(fileToSplit.getName() + "-" + Pages.indexOf(pd) + ".pdf"); > pd.close(); > } > } catch (IOException e) { > throw new RuntimeException(e); > } > } {code} > The PDF file is attached to the issue -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org