Re: [kaffe] What UnicodeLittle should be?

2005-06-24 Thread Ito Kazumitsu
From: Ito Kazumitsu [EMAIL PROTECTED]
Subject: Re: [kaffe] What UnicodeLittle should be?
Date: Fri, 24 Jun 2005 12:00:41 +0900 (JST)

 But http://java.sun.com/j2se/1.5.0/docs/guide/intl/encoding.doc.html
 says about UnicodeLittle,
 
   Sixteen-bit Unicode Transformation Format, little-endian byte order, 
   with byte-order mark
 
 So, gnu.java.nio.charset.UnicodeLittle must be correct.

Correct, but buggy.

I submitted a bug report to GNU Classpath.
http://savannah.gnu.org/bugs/?func=detailitemitem_id=13532

___
kaffe mailing list
kaffe@kaffe.org
http://kaffe.org/cgi-bin/mailman/listinfo/kaffe


[kaffe] What UnicodeLittle should be?

2005-06-23 Thread Ito Kazumitsu
Hi,

I have found that the famous Java Excel API
(http://www.andykhan.com/jexcelapi/) does not work with Kaffe
these days: Japanese text cannot be extracted from Excel worksheets.

This seems to be due to the difference between iconv
and gnu.java.nio.charset on the handling of UnicodeLittle.

iconv, which kaffe used before importing gnu.java.nio.charset,
treats UnicodeLittle as an alias of UCS-2LE (WITHOUT byte order mark).
But gnu.java.nio.charset.UnicodeLittle treats it as UTF-16
little endian WITH byte order mark.

I do not know which is correct, but seeing the behavior of
Java Excel API, I think iconv's implementation is correct.

___
kaffe mailing list
kaffe@kaffe.org
http://kaffe.org/cgi-bin/mailman/listinfo/kaffe


Re: [kaffe] What UnicodeLittle should be?

2005-06-23 Thread Ito Kazumitsu
From: Ito Kazumitsu [EMAIL PROTECTED]
Subject: [kaffe] What UnicodeLittle should be?
Date: Fri, 24 Jun 2005 11:37:54 +0900 (JST)

 iconv, which kaffe used before importing gnu.java.nio.charset,
 treats UnicodeLittle as an alias of UCS-2LE (WITHOUT byte order mark).
 But gnu.java.nio.charset.UnicodeLittle treats it as UTF-16
 little endian WITH byte order mark.
 
 I do not know which is correct, but seeing the behavior of
 Java Excel API, I think iconv's implementation is correct.

But http://java.sun.com/j2se/1.5.0/docs/guide/intl/encoding.doc.html
says about UnicodeLittle,

  Sixteen-bit Unicode Transformation Format, little-endian byte order, 
  with byte-order mark

So, gnu.java.nio.charset.UnicodeLittle must be correct.

___
kaffe mailing list
kaffe@kaffe.org
http://kaffe.org/cgi-bin/mailman/listinfo/kaffe