for working with bookmarks you can look at http://pdfbox.apache.org/userguide/bookmarks.html for how to split a PDF you could use/review org.apache.pdfbox.util.Splitter
Kind regards Maruan Sahyoun Am 14.11.2012 um 11:30 schrieb Tzali Maimon <tzali.mai...@eqsquest.com>: > Thanks for the attention. > I assume I would like to take a look on both API and decide. It sounds > though that splitting according to bookmarks is easy so Can you tell me how > to pull that off plz? > > > On Wed, Nov 14, 2012 at 12:22 PM, Maruan Sahyoun > <sahy...@fileaffairs.de>wrote: > >> it's possible to split a PDF using PDFBOX. The question is how to retrieve >> the information where to split the PDF. That was the reason for me asking >> how your sections are stored e.g. bookmarks, links, text … . If it's a >> bookmark you need to get the information in a different way than sections >> being normal text where a human can see the division but retrieving that >> with a program might be difficult. There are APIS for retrieving bookmarks >> and text though. >> >> Kind regards >> >> >> Maruan Sahyoun >> >> >> >> Am 14.11.2012 um 11:10 schrieb Tzali Maimon <tzali.mai...@eqsquest.com>: >> >>> PDFs are sometimes divided into sections or subjects. >>> for example: >>> >>> Title 1: >>> Sub-title: >>> some text... >>> >>> sub title: >>> some title >>> >>> sub-sub-title: >>> >>> >>> I want to split my PDF not by pages but by the this tree of titles. I >> dont >>> know if PDF forces each subject to be a bookmark. >>> >>> >>> On Wed, Nov 14, 2012 at 12:05 PM, Maruan Sahyoun <sahy...@fileaffairs.de >>> wrote: >>> >>>> Hi - what do you mean with sections and subsections. Are these bookmarks >>>> in PDF? >>>> >>>> With kind regards >>>> >>>> Maruan >> >>