Re: [svg-developers] Unicode and SVG

2010-10-22 Thread Terry Riegel
Us ASCII will always be us ASCII in both encodings.

UTF-8 is what you want so convert any non-utf to utf and you'll be fine

Terry

Sent from my iPhone

On Oct 22, 2010, at 12:44 PM, JC Ahangama  wrote:

> I agree, sort of.
> 
> The question is still why does US-ASCII letters show inside an HTML file
> declared as charset utf-8 and letters like ð,þ, á show as glyph not found. I
> did not *convert* US-ASCII. You will understand the problem only if you open
> the attached HTML files on 3 tabs and compare.
> 
> I still believe that somebody forgot something somewhere about 2004.
> 
> Thanks.
> 
> JC
> 
> On Fri, Oct 22, 2010 at 5:29 AM, Terry Riegel
> wrote:
> 
>> 
>> 
>> I think you are confused about utf-8
>> 
>> declaring iso-8859-1 text to be utf-8 doesn't make it utf-8
>> 
>> If you want it to be utf-8 you need to convert it to that.
>> 
>> Terry
>> 
>> 
>> On Oct 21, 2010, at 11:53 PM, JC Ahangama 
>> >
>> wrote:
>> 
>>> I believe I know what is going on.
>>> 
>>> The treatment of ISO-8859-1 set by Unicode is the culprit, at least in
>> the
>>> Windows machines. Please check the three versions of an HTML file for the
>>> same text given at the bottom of the page. Characters outside ASCII that
>> are
>>> still within ISO-8859-1 (codepoints 128 thru 255) are not included in the
>>> Unicode repertoire (as the last sample illustrates). The HTML pages do
>> not
>>> declare a font, and therefore, uses the Last Resort font of the system,
>>> which demonstrably *has* the letters that UTF-8 set thinks are missing.
>>> 
>>> If you are working with people that want to use Indic, it is best that
>> they
>>> transliterate their languages to ISO-8859-1 and display them by means of
>>> orthographic fonts. Here are two web sites that illustrates it (The
>> language
>>> is Sinhala):
>>> http://www.ahangama.com/ -- My Wordpress blog has both English and
>> Sinhala
>>> http://www.lovatasinhala.com/ -- has only *one* graphic, that of the
>> lion.
>>> 
>>> Regards,
>>> 
>>> JC
>>> 
>>> 
>>> 1. No character set declaration. Shows the text correctly because
>> ISO-8859-1
>>> is the default charset.
>>> ==
>>> 
>>> 
>>> 
>>> ISO-8859-1 but no character set declared
>>> 
>>> 
>>> ASCII lc:
>>> 
>>> abcdefghijklmnopqrstuvwxyz
>>> 
>>> Some non-English letters:
>>> 
>>> ðþææéúíóáðçøçµëûüïöäÐçôçñ
>>> 
>>> 
>>> 
>>> ==
>>> 
>>> 2. character declared as iso-8859-1. The text shows correctly
>>> ==
>>> 
>>> 
>>> 
>>> ISO-8859-1 charset explicitly declared
>>> 
>>> 
>>> 
>>> ASCII lc:
>>> 
>>> abcdefghijklmnopqrstuvwxyz
>>> 
>>> Some non-English letters:
>>> 
>>> ðþææéúíóáðçøçµëûüïöäÐçôçñ
>>> 
>>> 
>>> 
>>> ==
>>> 
>>> 3. characters set declared as UTF-8. No European characters!
>>> ==
>>> 
>>> 
>>> 
>>> ISO-8859-1 charset explicitly declared
>>> 
>>> 
>>> 
>>> ASCII lc:
>>> 
>>> abcdefghijklmnopqrstuvwxyz
>>> 
>>> Some non-English letters:
>>> 
>>> ðþææéúíóáðçøçµëûüïöäÐçôçñ
>>> 
>>> 
>>> 
>>> ===
>>> 
>>> 
>>> [Non-text portions of this message have been removed]
>>> 
>>> 
>>> 
>>> 
>>> 
>>> -
>>> To unsubscribe send a message to:
>> svg-developers-unsubscr...@yahoogroups.com
>>> -or-
>>> visit http://groups.yahoo.com/group/svg-developers and click "edit my
>> membership"
>>> Yahoo! Groups Links
>>> 
>>> 
>>> 
>> 
>> 
> 
> 
> [Non-text portions of this message have been removed]
> 
> 
> 
> 
> 
> -
> To unsubscribe send a message to: svg-developers-unsubscr...@yahoogroups.com
> -or-
> visit http://groups.yahoo.com/group/svg-developers and click "edit my 
> membership"
> Yahoo! Groups Links
> 
> 
> 




-
To unsubscribe send a message to: svg-developers-unsubscr...@yahoogroups.com
-or-
visit http://groups.yahoo.com/group/svg-developers and click "edit my 
membership"
Yahoo! Groups Links

<*> To visit your group on the web, go to:
http://groups.yahoo.com/group/svg-developers/

<*> Your email settings:
Individual Email | Traditional

<*> To change settings online go to:
http://groups.yahoo.com/group/svg-developers/join
(Yahoo! ID required)

<*> To change settings via email:
svg-developers-dig...@yahoogroups.com 
svg-developers-fullfeatu...@yahoogroups.com

<*> To unsubscribe from this group, send an email to:
svg-developers-unsubscr...@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/



Re: [svg-developers] Unicode and SVG

2010-10-22 Thread Terry Riegel
I think you are confused about utf-8

declaring iso-8859-1 text to be utf-8 doesn't make it utf-8

If you want it to be utf-8 you need to convert it to that.

Terry




On Oct 21, 2010, at 11:53 PM, JC Ahangama  wrote:

> I believe I know what is going on.
> 
> The treatment of ISO-8859-1 set by Unicode is the culprit, at least in the
> Windows machines. Please check the three versions of an HTML file for the
> same text given at the bottom of the page. Characters outside ASCII that are
> still within ISO-8859-1 (codepoints 128 thru 255) are not included in the
> Unicode repertoire (as the last sample illustrates). The HTML pages do not
> declare a font, and therefore, uses the Last Resort font of the system,
> which demonstrably *has* the letters that UTF-8 set thinks are missing.
> 
> If you are working with people that want to use Indic, it is best that they
> transliterate their languages to ISO-8859-1 and display them by means of
> orthographic fonts. Here are two web sites that illustrates it (The language
> is Sinhala):
> http://www.ahangama.com/ -- My Wordpress blog has both English and Sinhala
> http://www.lovatasinhala.com/ -- has only *one* graphic, that of the lion.
> 
> Regards,
> 
> JC
> 
> 
> 1. No character set declaration. Shows the text correctly because ISO-8859-1
> is the default charset.
> ==
> 
> 
> 
> ISO-8859-1 but no character set declared
> 
> 
> ASCII lc:
> 
> abcdefghijklmnopqrstuvwxyz
> 
> Some non-English letters:
> 
> ðþææéúíóáðçøçµëûüïöäÐçôçñ
> 
> 
> 
> ==
> 
> 2. character declared as iso-8859-1. The text shows correctly
> ==
> 
> 
> 
> ISO-8859-1 charset explicitly declared
> 
> 
> 
> ASCII lc:
> 
> abcdefghijklmnopqrstuvwxyz
> 
> Some non-English letters:
> 
> ðþææéúíóáðçøçµëûüïöäÐçôçñ
> 
> 
> 
> ==
> 
> 3. characters set declared as UTF-8. No European characters!
> ==
> 
> 
> 
> ISO-8859-1 charset explicitly declared
> 
> 
> 
> ASCII lc:
> 
> abcdefghijklmnopqrstuvwxyz
> 
> Some non-English letters:
> 
> ðþææéúíóáðçøçµëûüïöäÐçôçñ
> 
> 
> 
> ===
> 
> 
> [Non-text portions of this message have been removed]
> 
> 
> 
> 
> 
> -
> To unsubscribe send a message to: svg-developers-unsubscr...@yahoogroups.com
> -or-
> visit http://groups.yahoo.com/group/svg-developers and click "edit my 
> membership"
> Yahoo! Groups Links
> 
> 
> 




-
To unsubscribe send a message to: svg-developers-unsubscr...@yahoogroups.com
-or-
visit http://groups.yahoo.com/group/svg-developers and click "edit my 
membership"
Yahoo! Groups Links

<*> To visit your group on the web, go to:
http://groups.yahoo.com/group/svg-developers/

<*> Your email settings:
Individual Email | Traditional

<*> To change settings online go to:
http://groups.yahoo.com/group/svg-developers/join
(Yahoo! ID required)

<*> To change settings via email:
svg-developers-dig...@yahoogroups.com 
svg-developers-fullfeatu...@yahoogroups.com

<*> To unsubscribe from this group, send an email to:
svg-developers-unsubscr...@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/