On Thu, June 19, 2008 12:40 am, T. R. Valentine wrote:

> Yes, Patrick is correct.
> 
> I would add one
caveat. If you use UTF-8 (personally, I see no reason
> to
anything else), you should not use ASCII characters (hex) 81-9F /
> (dec) 129-159 which includes stuff like &151; for an em dash
and &150;
> for an en dash. Instead, either use the character
directly or use
> — and – for the em dash and
en dash respectively.
> 

My understanding is that since
HTML 4.0 all numerical character references are defined in terms of the
document character set. For HTML4 onwards the document character set is
always Unicode regardless of the character encoding of the document. 

So in HTML4 onwards  en dash and em dash are – and
&#8212

You'd have to go back to HTML 3.2 for &150; and
&151; to be considered en-dash and em-dash characters. And even then
HTML 3.2 used ISO-8859-1 specifically, so &150; and &151; would be
technically undefined.

-- 
Andrew Cunningham
Research
and Development Coordinator
Vicnet
State Library of Victoria
Australia

[EMAIL PROTECTED]


*******************************************************************
List Guidelines: http://webstandardsgroup.org/mail/guidelines.cfm
Unsubscribe: http://webstandardsgroup.org/join/unsubscribe.cfm
Help: [EMAIL PROTECTED]
*******************************************************************

Reply via email to