I believe I know what is going on.

The treatment of ISO-8859-1 set by Unicode is the culprit, at least in the
Windows machines. Please check the three versions of an HTML file for the
same text given at the bottom of the page. Characters outside ASCII that are
still within ISO-8859-1 (codepoints 128 thru 255) are not included in the
Unicode repertoire (as the last sample illustrates). The HTML pages do not
declare a font, and therefore, uses the Last Resort font of the system,
which demonstrably *has* the letters that UTF-8 set thinks are missing.

If you are working with people that want to use Indic, it is best that they
transliterate their languages to ISO-8859-1 and display them by means of
orthographic fonts. Here are two web sites that illustrates it (The language
is Sinhala):
http://www.ahangama.com/ -- My Wordpress blog has both English and Sinhala
http://www.lovatasinhala.com/ -- has only *one* graphic, that of the lion.

Regards,

JC


1. No character set declaration. Shows the text correctly because ISO-8859-1
is the default charset.
======================================================================
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html>
<head>
<title>ISO-8859-1 but no character set declared</title>
</head>
<body>
ASCII lc:<br>
<span style="font-size:20px;letter-spacing:8px;">
abcdefghijklmnopqrstuvwxyz
</span><br><br>
Some non-English letters:<br>
<span style="font-size:20px;letter-spacing:8px;">
ðþææéúíóáðçøçµëûüïöäÐçôçñ
</span>
</body>
</html>
======================================================================

2. character declared as iso-8859-1. The text shows correctly
======================================================================
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html>
<head>
<title>ISO-8859-1 charset explicitly declared</title>
<meta http-equiv="Content-Type" content="text/html; Charset=iso-8859-1">
</head>
<body>
ASCII lc:<br>
<span style="font-size:20px;letter-spacing:8px;">
abcdefghijklmnopqrstuvwxyz
</span><br><br>
Some non-English letters:<br>
<span style="font-size:20px;letter-spacing:8px;">
ðþææéúíóáðçøçµëûüïöäÐçôçñ
</span>
</body>
</html>
======================================================================

3. characters set declared as UTF-8. No European characters!
======================================================================
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html>
<head>
<title>ISO-8859-1 charset explicitly declared</title>
<meta http-equiv="Content-Type" content="text/html; Charset=utf-8">
</head>
<body>
ASCII lc:<br>
<span style="font-size:20px;letter-spacing:8px;">
abcdefghijklmnopqrstuvwxyz
</span><br><br>
Some non-English letters:<br>
<span style="font-size:20px;letter-spacing:8px;">
ðþææéúíóáðçøçµëûüïöäÐçôçñ
</span>
</body>
</html>
===================================


[Non-text portions of this message have been removed]



------------------------------------

-----
To unsubscribe send a message to: svg-developers-unsubscr...@yahoogroups.com
-or-
visit http://groups.yahoo.com/group/svg-developers and click "edit my 
membership"
----Yahoo! Groups Links

<*> To visit your group on the web, go to:
    http://groups.yahoo.com/group/svg-developers/

<*> Your email settings:
    Individual Email | Traditional

<*> To change settings online go to:
    http://groups.yahoo.com/group/svg-developers/join
    (Yahoo! ID required)

<*> To change settings via email:
    svg-developers-dig...@yahoogroups.com 
    svg-developers-fullfeatu...@yahoogroups.com

<*> To unsubscribe from this group, send an email to:
    svg-developers-unsubscr...@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
    http://docs.yahoo.com/info/terms/

Reply via email to