#3608: Character Š wrong in index and status on Mac OS X
--------------------+-------------------------------------------------------
Reporter: kolcon | Owner: me
Type: defect | Status: accepted
Priority: minor | Milestone: 1.6
Component: mutt | Version: 1.5.21
Keywords: |
--------------------+-------------------------------------------------------
Changes (by me):
* owner: mutt-dev => me
* status: new => accepted
* milestone: => 1.6
Comment:
It appears the problem is that the rfc822 parsing code is not unicode-
aware. In particular, the next_token() function using ISSPACE, and what
is likely happening is that the second byte of the Š character is being
stripped, which causes mutt_paddstr() to substitute the replacement char �
when mbrtowc() fails.
Since non-ascii characters are not permitted to appear in header fields,
this is normally not a problem (and if it is, we blame the sender's broken
software for not rfc2047-encoding it).
However, in this case, the user's muttrc has an alias with utf-8. And
this is probably similar to the situation in #2956 when the user edits the
message header, because we are not cruel enough to let the user directly
edit the RFC2047 encoding.
More thought is required here.
--
Ticket URL: <http://dev.mutt.org/trac/ticket/3608#comment:11>
Mutt <http://www.mutt.org/>
The Mutt mail user agent