Paul, I think I am aware the difference of "stripper.setSortByPosition(true)" and "stripper.setSortByPosition(false)". It is best explained when you try to extract a PDF who has multiple columns, e.g. two columns.
When you have "stripper.setSortByPosition(false)", the extraction result is usually the reading procedure which is fine. But when you have "stripper.setSortByPosition(true)", PDFBox will extract text from top to bottom, ignoring the columns, which is not expected by me. Regards,

