Re: Shall we change our file.encoding

Nathan Beyer Tue, 14 Jul 2009 22:44:33 -0700

Is the IBM VME dealing with this correctly? Do we just need to fix DRLVM?


On Wed, Jul 15, 2009 at 12:25 AM, Regis<[email protected]> wrote:
> Kevin Zhou wrote:
>>
>> Yea, from luniglob.c, CL attempts to read the "file.encoding" property
>> adown
>> VM but fails to get the correct encoding.
>>
>> Regis, do you know any other specific ways that CL can gain the right
>> property?
>
> We can get from OS directly. Maybe just read env variables on Linux?
>
>>
>> Wed, Jul 15, 2009 at 9:59 AM, Regis <[email protected]> wrote:
>>
>>> Charles Lee wrote:
>>>
>>>> Hi Nanthan,
>>>>
>>>> If the file encoding derive from the OS, it should be the some bugs in
>>>> it
>>>> because on my LINUX machine the locale is en_US.UTF-8. Our default codec
>>>> is
>>>> still ISO8859-1. Do you know where can we found such codes?
>>>>
>>> Classlib expected vm do this and set the property, but it didn't, so we
>>> have to do this by ourselves.
>>>
>>>
>>>
>>>> On Tue, Jul 14, 2009 at 10:17 PM, Nathan Beyer <[email protected]> wrote:
>>>>
>>>>  Are we talking about windows or linux?the default file encoding should
>>>>>
>>>>> derive from the OS. I believe that's defined by the specs.
>>>>>
>>>>> Sent from my iPhone
>>>>>
>>>>>
>>>>> On Jul 14, 2009, at 5:51 AM, Charles Lee <[email protected]> wrote:
>>>>>
>>>>>  On Tue, Jul 14, 2009 at 6:12 PM, Jimmy,Jing Lv <[email protected]>
>>>>>
>>>>>> wrote:
>>>>>>
>>>>>>  Hi,
>>>>>>
>>>>>>>  Charles, I believe UTF-8 is the default encoding for RI, and it
>>>>>>> sounds
>>>>>>> reasonable.
>>>>>>>  BTW, it may encounter some compatibility problem, maybe we need to
>>>>>>> run
>>>>>>> more tests to verify?
>>>>>>>
>>>>>>> 2009/7/14 Charles Lee <[email protected]>
>>>>>>>
>>>>>>>  Hi guys:
>>>>>>>
>>>>>>>> I am doing some test cases on the ant junit test case and meeting
>>>>>>>> some
>>>>>>>> encoding problems. I find they are maybe caused by the different
>>>>>>>> default
>>>>>>>> encoding from RI and harmony. My local is en_US.UTF-8, RI default is
>>>>>>>>
>>>>>>>>  UTF-8
>>>>>>>
>>>>>>>  but harmony is 8859-1. And then I have encountered
>>>>>>>>
>>>>>>>> HARMONY-3736<https://issues.apache.org/jira/browse/HARMONY-3736>,
>>>>>>>> and the two diffs attached on that issue. It seems we always get
>>>>>>>> 8859-1.
>>>>>>>> Because: (correct me if wrong :-)
>>>>>>>>
>>>>>>>> 1. we remove the set code in the vm. we will always get null if we
>>>>>>>> call
>>>>>>>>
>>>>>>>>  vm
>>>>>>>
>>>>>>>  method
>>>>>>>>
>>>>>>>> 2. we set the file.encode in the libglob.c, if we got null from vm,
>>>>>>>> we
>>>>>>>>
>>>>>>>>  set
>>>>>>>
>>>>>>>  Sorry, it should be luniglob.c
>>>>>>>
>>>>>>  8859-1.
>>>>>>>>
>>>>>>>> 3. we can not set file.encode on the run time.
>>>>>>>>
>>>>>>>> ant use UTF-8 to encode filename which contains the non-ascii
>>>>>>>> character.
>>>>>>>> So why we use iso8859-1 as our unchangeable default?
>>>>>>>> From the wiki http://en.wikipedia.org/wiki/ISO8859-1, it says "In
>>>>>>>> computing
>>>>>>>> applications, encodings that provide full UCS support (such as
>>>>>>>> UTF-8<http://en.wikipedia.org/wiki/UTF-8>and
>>>>>>>> UTF-16 <http://en.wikipedia.org/wiki/UTF-16>) are finding increasing
>>>>>>>>
>>>>>>>>  favor
>>>>>>>
>>>>>>>  over encodings based on ISO 8859-1." Should we simply change
>>>>>>> iso8859-1
>>>>>>>>
>>>>>>>> to
>>>>>>>> utf-8?
>>>>>>>>
>>>>>>>> --
>>>>>>>> Yours sincerely,
>>>>>>>> Charles Lee
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>> --
>>>>>>>
>>>>>>> Best Regards!
>>>>>>>
>>>>>>> Jimmy, Jing Lv
>>>>>>> China Software Development Lab, IBM
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>> --
>>>>>> Yours sincerely,
>>>>>> Charles Lee
>>>>>>
>>>>>>
>>>>
>>> --
>>> Best Regards,
>>> Regis.
>>>
>>
>
>
> --
> Best Regards,
> Regis.
>

Re: Shall we change our file.encoding

Reply via email to