Hello Mutt users, like most people, I often receive eMails from Windows users. These eMails tend to contain some of those characters from the Windows-1252 character set that are not part of the iso-8859-1 standard (aka Latin-1). You know.
Since Windows-1252 is actually a small extension to iso-8859-1, it
should be easy to define a mapping to iso-8859-1 codes approximating
those funny characters.
Right, that's what the excellent programs iconv and recode are for,
aren't they? The problem is, many Windows user send bogus Content-Type
header lines, so where the line should read,
Content-Type: text/plain; charset=Windows-1252
I often find,
Content-Type: text/plain; charset=iso-8859-1
or even,
Content-Type: text/plain; charset=us-ascii
instead. A quick fix would be to filter text/plain messages through a
sed or tr script doing the substitutions. Some code to trigger this
could be added in ~/.mailcap or ~/.procmailrc perhaps.
Is this a nonsense idea? Has anybody solved this problem already? I
could not find anything about this in the list archive at Yahoo. If
anybody finds this idea useful, I'll go ahead with this idea and
notify the list once I get my solution working. Or should this feature
be included into Mutt proper?
It might still be the case that this problem is just a
misconfiguration issue. Here is my configuration:
o Mutt:
System: Linux 2.4.4-4GB (i586) [using ncurses 5.2]
Einstellungen bei der Compilierung:
-DOMAIN
+DEBUG
-HOMESPOOL -USE_SETGID +USE_DOTLOCK -DL_STANDALONE
+USE_FCNTL -USE_FLOCK
-USE_POP -USE_IMAP -USE_GSS -USE_SSL -USE_SASL
+HAVE_REGCOMP -USE_GNU_REGEX
+HAVE_COLOR +HAVE_START_COLOR +HAVE_TYPEAHEAD +HAVE_BKGDSET
+HAVE_CURS_SET +HAVE_META +HAVE_RESIZETERM
+HAVE_PGP -BUFFY_SIZE -EXACT_ADDRESS -SUN_ATTACHMENT
+ENABLE_NLS -LOCALES_HACK +HAVE_WC_FUNCS +HAVE_LANGINFO_CODESET
++HAVE_LANGINFO_YESEXPR
+HAVE_ICONV -ICONV_NONTRANS +HAVE_GETSID -HAVE_GETADDRINFO
ISPELL="/usr/bin/ispell"
SENDMAIL="/usr/sbin/sendmail"
MAILPATH="/var/mail"
PKGDATADIR="/usr/local/share/mutt"
SYSCONFDIR="/usr/local/etc"
EXECSHELL="/bin/sh"
-MIXMASTER
o iconv: iconv (GNU libc) 2.2.2
(which knows a lot of names for the character sets I've just
mentioned. Hence, I don't think any charset-hook is necessary)
o locale: my shell environment contains LANG=de_DE.ISO-8859-1,
and I have not set locale or charset in any Muttrc.
(Setting LANG=de_DE.UTF-8 makes things even worse, even though
other UTF-8 enabled apps like w3m-m17n work just fine. I'll
post a separate message about my experiment with UTF-8.)
o my *nix: SuSE Linux 7.2 (Linux 2.4.4-4GB)
Thanks for you attention.
Cristian
--
}{ Cristian Pietsch
}{ http://www.interling.de
msg21177/pgp00000.pgp
Description: PGP signature
