Maruan, Thank you for your response. I used the pdfStripper() method to get the text from the PDF and then a substring to pull out the characters I want. Since I wrote the reports in the application, I know what text is in them and the text I want will always be at column 20. As I said, this works fine on a pdf I created using Word and "Save As." It correctly pulls the text at that position. Something is different about the pdf files that get created from the application and I don't know what is different. I was hoping to get some insight from the community.
Thanks, Karen On Tue, Feb 4, 2014 at 3:18 AM, Maruan Sahyoun <[email protected]>wrote: > Hi Karen, > > unfortunately I don't fully understand the issue you are describing, but > you can't rely on the fact that text is at a specific position within a PDF > file. > > BR > Maruan Sahyoun > > Am 03.02.2014 um 22:57 schrieb Karen Lindholm <[email protected]>: > > > I have written a program using PDFBox that looks into a PDF file and > pulls > > out characters from the 20th through 36th position in the file. It works > > fine with a simple PDF file I created using Word 2010 and saving as PDF. > I > > have an application that is exporting to a PDF file and my program is > > pulling some text from the third line of the file. I am not sure why this > > is happening. Does anyone know if any issues trying to look into a PDF > file > > that was saved in an older format? > > > > Thanks, > > Karen > >

