Quoting Nick Wilson <[EMAIL PROTECTED]>:
> > Is there any way around Microsoft's broken ISO-8859-1 character set?
> > The ` or ' characters show up in Mutt as \222 (contractions) \223
> > (left single quote) and \224 (right single quote).  In the original
> > file they're 0x92, 0x93 and 0x94 (not ASCII, obviously).
> 
> That looks exactly like what I've been getting.
> Do you also get '?' chars coming up unexpectedly?

That sounds familiar, but I'm not sure where I've seen that.  It's
certainly not as common as the \222 - \224 issue.  It looks like sed
won't do the trick because it doesn't understand octal (\222 - \224) or
hexidecimal (\x92 - \x93) escapes.  Perl will, but that's an awful lot
of overhead.

I was also mistaken in my description of the characters -- 0x91 is `,
0x92 is ', 0x93 is ``, and 0x94 is ''.  See:

    http://czyborra.com/charsets/iso8859.html#CP1252

for details on Microsoft's ``extension'' of ISO-8859-1, properly
called CP1252, but often advertised as iso-8859-1 in email and HTML
pages generated by Microsoft software.

Chris
-- 
Christopher S. Swingley             phone: 907-474-2689
Computer / Network Manager          email: [EMAIL PROTECTED]
IARC -- Frontier Program            GPG and PGP keys at my web page:
University of Alaska Fairbanks      www.frontier.iarc.uaf.edu/~cswingle

Reply via email to