Sorry to say this Scott but with POI you cannot do this; that is get at a document page by page. Within a Word file, the document is not stored as a series of pages, rather as runs of text and other elements that combine to form the document. When you open the doc or docx file using a word processor such as Word, the application renders the document into pages - by calculating where each element fits onto the page. It is only at that time that you will be able to see where the text fits onto each page.
It is possible to control Word using COM or LibreOffice through the UNO interface and manipulate an instance of either. You could then open the document and read through it page by page but each option poses challenges for you. Alternativly, you could create your own document renderer but this would, in itself, pose quite a few challenges. There may be commercail options - Aspose is the one that most readilly springs to mind - that offer the ability to page through a Word document, but I do not know if this is the case. -- View this message in context: http://apache-poi.1045710.n5.nabble.com/How-to-get-text-from-each-page-tp5719036p5719040.html Sent from the POI - User mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
