Hi Barry,
you are probably using a POI 2.* version. However, HPSF's codepage
support is in the CVS repository's HEAD revision only. You have to get
the source code from the CVS and compile it yourself. See
<http://jakarta.apache.org/site/cvsindex.html> for details on getting
the code.
The bad news is that HSSF of the CVS HEAD is broken. The good news is
that you should be able to replace the HPSF subdirectory in POI 2.5 with
the one from the CVS HEAD. If you do that please let me know it if
really works or encounter any problems!
Best regards
Rainer Klute
Rainer Klute IT-Consulting GmbH
Dipl.-Inform.
Rainer Klute E-Mail: [EMAIL PROTECTED]
K�rner Grund 24 Telefon: +49 172 2324824
D-44143 Dortmund Telefax: +49 231 5349423
On Mon, 2004-08-09 at 19:17, Molof, Barry C wrote:
> Hello,
>
> I have a scenario in which the properties of a Word document are
> Japanese characters. Using HPSF to get these properties does not give
> me the results that I need. I have been debugging through the code and
> came upon a comment in org.apache.poi.hpsf.TypeReader at around line
> 105. This comment is:
>
> * FIXME: Reading an 8-bit string should pay attention
> * to the codepage. Currently the byte making out the
> * property's value are interpreted according to the
> * platform's default character set.
>
> While debugging the code, I have verified that the codepage read was
> 65001 (UTF-8) and that the type of the property being read is 30
> (VT_LPSTR). Is this FIXME comment the reason why I am not getting the
> property values back correctly? I am guessing that POI is using my
> character set which is not UTF-8.
>
> Are there any plans to fix this in the future?
>
> Thank you,
>
>
> Barry Molof
> Computer Associates
> Programmer
> tel: +1 631 342 3234
> [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]