[ 
https://issues.apache.org/jira/browse/PDFBOX-2950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979992#comment-14979992
 ] 

John Hewson commented on PDFBOX-2950:
-------------------------------------

Ok, I have the problem with "SimSun-ExtB" mostly figured out, it's related to 
how we implemented the more complex cases of toUnicode for CIDFonts. Fix coming 
soon.

I've isolated the problem with "ArialUnicodeMS" to fontbox's cmap table parsing 
(as suspected). CmapSubtable#processSubtype4() is parsing only 96 code -> glyph 
entries, however there are 38917 in the font. 

> Chinese font substitution issue
> -------------------------------
>
>                 Key: PDFBOX-2950
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2950
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Rendering
>    Affects Versions: 1.8.10, 2.0.0
>         Environment: Windows 8, JDK 1.7
>            Reporter: WuYu
>            Assignee: John Hewson
>             Fix For: 2.1.0
>
>         Attachments: 20150829A01-18.jpg, 20150829A01-20.jpg, 
> 20150829A01_pdf.pdf, PDFBOX-2950-reduced.pdf, acrobat_mac.png, 
> pdfbox20_mac.png
>
>
> java -jar pdfbox-app-1.8.10.jar PDFToImage 20150829A01_pdf.pdf



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to