Paul,
I think I am aware the difference of
"stripper.setSortByPosition(true)" and
"stripper.setSortByPosition(false)". It is best explained when you try
to extract a PDF who has multiple columns, e.g. two columns.

When you have "stripper.setSortByPosition(false)", the extraction
result is usually the reading procedure which is fine. But when you
have "stripper.setSortByPosition(true)", PDFBox will extract text from
top to bottom, ignoring the columns, which is not expected by me.

Regards,

Reply via email to