Hi,

I found on a mailing list of 2012-jun-14 that this problem has been already 
discussed, but here is pretty different.

I also get the warning "Did not found XRef object at specified startxref 
position xxx" when executing the main function of org.apache.pdfbox.ExtractText 
class. However, some PDF texts are ignored and are not printed on the output 
TXT file. These same texts are displayed by Acrobat Reader and can be copyed by 
the user as texts from this program.

If the option "-nonSeq" is selected, then appears a "java.io.IOException: 
Error: Expected a long type, actual=..." which stops the text extraction.

Please, is there any way to make it work?

Thanks,

Rodrigo

Reply via email to