Sorry if this get this twice, I accidentally sent this to the wrong list first.

I have been looking around and I saw where the issue with extracted Arabic words being written in reverse was fixed, but I'm seeing an issue where the extracted Arabic text of a sentence is in reverse. I assume this is due to Arabic being a left-to-right language. Is there anyway to detect this and have pdfbox extract the text in the correct order?

Expected Arabic Text:
??????? ?????? ?????? ??????? ??????? ??????

Returned Arabic Text:
?????? ?????? ??????? ?????? ????? ???????

I am using the latest version (0.8.0-incubating).
Is there something else that I am missing?
--
Matt

Reply via email to