I have a question - how does hssf handle the conversion of the windows 1252
charset (the Microsoft 'customized' version of ISO Latin-1) to Unicode
strings? 

I just did a test with reading an xls format spread sheet that had a cell
with character values from 0x80 to 0x9f, all them non-printing characters in
Latin-1/Unicode, but with printable symbols assigned (at least to most of
them) in Windows 1252.

When I wrote the characters out to a file using FileOutputStream I found
that most of them were being written as 0x3f ("?"), with a few of them
having their original values (0x81, 0x8d, 0x8f, 0x90, 0x9d).

Is there 1252->Unicode encoding conversion being done (I am running on
Unix)?

I'm trying to get a handle of the encoding conversion issues here.

Thanks,

Carey Sublette

Reply via email to