Hello,
1. Does text extraction work with Adobe Reader?
2. Could you upload the file to a public location?
Tilman
Am 25.07.2015 um 09:42 schrieb 牛小伟:
Dear team:
We are using your product pdfbox 1.6 to do text extraction.
But when we are processing the encoding(UniJIS-UCS2-HW-H),
it appears unreadable code like this(????????????????????????3?????????????).
We have tried some other ways to process it. But they don't work.
We also have some doc with the encoding(GBK-EUC-H),the pdfbox
can work perfectly. I also tried the pdfbox 1.8, it also didn't work.
I checked the charset of the pdfbox. It contains both of the encoding.
I don't know why one is working, another is not working.
Hope your support for this .Very thanks.
Best Regard.
the docsnapshot of the encoding:
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]