Quoting Nick Wilson <[EMAIL PROTECTED]>:
> > Is there any way around Microsoft's broken ISO-8859-1 character set?
> > The ` or ' characters show up in Mutt as \222 (contractions) \223
> > (left single quote) and \224 (right single quote). In the original
> > file they're 0x92, 0x93 and 0x94 (not ASCII, obviously).
>
> That looks exactly like what I've been getting.
> Do you also get '?' chars coming up unexpectedly?
That sounds familiar, but I'm not sure where I've seen that. It's
certainly not as common as the \222 - \224 issue. It looks like sed
won't do the trick because it doesn't understand octal (\222 - \224) or
hexidecimal (\x92 - \x93) escapes. Perl will, but that's an awful lot
of overhead.
I was also mistaken in my description of the characters -- 0x91 is `,
0x92 is ', 0x93 is ``, and 0x94 is ''. See:
http://czyborra.com/charsets/iso8859.html#CP1252
for details on Microsoft's ``extension'' of ISO-8859-1, properly
called CP1252, but often advertised as iso-8859-1 in email and HTML
pages generated by Microsoft software.
Chris
--
Christopher S. Swingley phone: 907-474-2689
Computer / Network Manager email: [EMAIL PROTECTED]
IARC -- Frontier Program GPG and PGP keys at my web page:
University of Alaska Fairbanks www.frontier.iarc.uaf.edu/~cswingle