Hello 牛小伟,

When the issues related to 2.0 have been released:
https://issues.apache.org/jira/issues/?jql=project%20%3D%20PDFBOX%20AND%20fixVersion%20%3D%202.0.0%20AND%20resolution%20%3D%20Unresolved%20ORDER%20BY%20due%20ASC%2C%20priority%20DESC%2C%20created%20ASC

I can't tell you when this will be. We're all volunteers. Be patient :-)

Tilman

Am 27.07.2015 um 03:13 schrieb 牛小伟:
Dear Tilman,
      Thanks.Then do you know when will the 2.0 version be released?

Best regards
Niu Xiaowei

--
发自我的网易邮箱手机智能版


在 2015-07-26 22:07:29,"牛小伟" <[email protected]> 写道:
Dear Tilman,
Thanks for your support.The original file is in the company.
I can't get it. But I made a simple one using Itext.
They are in the same encoding.The pdfBox can't  process it either.
Please check the attachment.


Thanks,
Best Regards,
Niu X








At 2015-07-25 15:42:55, "牛小伟" <[email protected]> wrote:
Dear team:
         We are using your product pdfbox 1.6 to do text extraction.
But when we are processing the encoding(UniJIS-UCS2-HW-H),
it appears unreadable code like this(????????????????????????3?????????????).
We have tried some other ways to process it. But they don't work.
We also have some doc with the encoding(GBK-EUC-H),the pdfbox
can work perfectly. I also tried the pdfbox 1.8, it also didn't work.
I checked the charset of the pdfbox. It contains both of the encoding.
I don't know why one is working, another is not working.
Hope your support for this .Very thanks.


Best Regard.


the docsnapshot of the encoding:







---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to