Hello,

fresh new user of pdfbox, I’ve got some problems extracting the text of pdfs with Chinese characters in it.

I use pdfbox from the command line with the command : java -jar C:/pdfbox-app.jar ExtractText C:/Test_Pdfbox.pdf C:/Test_Pdfbox.txt

Result text only contains question marks..


Here is the document : 
--- Begin Message ---
Hello,

fresh new user of pdfbox, I’ve got some problems extracting the text of pdfs with Chinese characters in it.

I use pdfbox from the command line with the command : java -jar C:/pdfbox-app.jar ExtractText C:/Test_Pdfbox.pdf C:/Test_Pdfbox.txt

Result text only contains question marks..


Here is the document : 

Attachment: Test_Pdfbox.pdf
Description: Adobe PDF document




Thanks for your help,
Renaud

--- End Message ---

Attachment: Test_Pdfbox.pdf
Description: Adobe PDF document




Thanks for your help,
Renaud

Reply via email to