[
https://issues.apache.org/jira/browse/PDFBOX-420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Takashi Komatsubara updated PDFBOX-420:
---------------------------------------
Attachment: TestFilesForJapaneseGarbledIssue.zip
Hi Brian,
Before my change, I mean, the current PdfBox can handle Japanese character if
the pdf file is version 1.2 or 1.3.
If the version of the pdf is 1.4, this garble issue is happening.
My changing is to correct this issue targeting the version 1.4.
If the pdf file is version 1.5 or 1.6, sometimes this issue is happening again.
I'm trying to fix this issue.
BTW, the attached file is good sample.
Thank you, again.
Takashi.
> Japanese Characters are garbled.
> --------------------------------
>
> Key: PDFBOX-420
> URL: https://issues.apache.org/jira/browse/PDFBOX-420
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 0.8.0-incubator
> Reporter: Takashi Komatsubara
> Priority: Critical
> Attachments: supportJapanese-fontbox.patch, supportJapanese.patch,
> TestFilesForJapaneseGarbledIssue.zip
>
>
> The extracted Japanese characters are completely garbled.
> This issue is very critical for Japanese users.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.