Maruan,
Thank you for your response. I used the pdfStripper() method to get the
text from the PDF and then a substring to pull out the characters I want.
Since I wrote the reports in the application, I know what text is in them
and the text I want will always be at column 20. As I said, this works fine
on a pdf I created using Word and "Save As." It correctly pulls the text at
that position. Something is different about the pdf files that get created
from the application and I don't know what is different. I was hoping to
get some insight from the community.

Thanks,
Karen


On Tue, Feb 4, 2014 at 3:18 AM, Maruan Sahyoun <[email protected]>wrote:

> Hi Karen,
>
> unfortunately I don't fully understand the issue you are describing, but
> you can't rely on the fact that text is at a specific position within a PDF
> file.
>
> BR
> Maruan Sahyoun
>
> Am 03.02.2014 um 22:57 schrieb Karen Lindholm <[email protected]>:
>
> > I have written a program using PDFBox that looks into a PDF file and
> pulls
> > out characters from the 20th through 36th position in the file. It works
> > fine with a simple PDF file I created using Word 2010 and saving as PDF.
> I
> > have an application that is exporting to a PDF file and my program is
> > pulling some text from the third line of the file. I am not sure why this
> > is happening. Does anyone know if any issues trying to look into a PDF
> file
> > that was saved in an older format?
> >
> > Thanks,
> > Karen
>
>

Reply via email to