[
https://issues.apache.org/jira/browse/PDFBOX-283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13280127#comment-13280127
]
Bernd Köster commented on PDFBOX-283:
-------------------------------------
I did some work on the PDAppearace. You need to use the code above on every
substring in the convertMulitline method.
> Character encoding/appearance issues when filling forms
> -------------------------------------------------------
>
> Key: PDFBOX-283
> URL: https://issues.apache.org/jira/browse/PDFBOX-283
> Project: PDFBox
> Issue Type: Bug
> Components: PDModel.AcroForm
> Priority: Minor
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1735902
> Originally submitted by scop on 2007-06-12 10:23.
> When filling a text field with non-ASCII characters such as in my surname
> "Skyttä" and saving the document in a UTF-8 environment, something goes
> wrong with the appearance of the text.
> The value itself seems to be stored correctly, but when opening the doc, the
> appearance of "ä" is not that, but rather something which happens when UTF-8
> is mistakenly treated as ISO-8859-1 (two garbage characters).
> PDAppearance uses the platform default encoding in quite a few places which
> apparently has potential to mess things up. In particular,
> insertGeneratedAppearance() generates a PrintWriter from an OutputStream
> without specifying the encoding. In fact, if I hack that to use ISO-8859-1,
> the appearance of my "ä" case is correct, but that won't obviously work with
> anything else than chars that are valid ISO-8859-1.
> In which char encoding should the value be written to the appearance stream
> (at end of insertGeneratedAppearance())?
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira