Re[2]: Slovenian (once again)

2001-12-12 Thread Alastair Scott

Jernej wrote:

 Hello Alastair,

 12. december 2001, 10:31:38, you wrote:

AS That anything comprehensible at all can be derived from this is a
AS miracle - so what's stopping the "c-hacek" from being right too? [And
AS where did the backslashes and quotes come from?]

 I think that it's the encoding's fault - in Win-1250 encoding "c is in
 the same place as 'e in "normal" Windows encoding (which CP is that
 anyway?)... Check, how this message comes through - it's ISO-8859-2
 encoded.

It looks exactly the same as the first try :(

The line in the source, this time, looks like:

From: =?ISO-8859-2?Q?Jernej_Simon=E8i=E8?= [EMAIL PROTECTED]

and, in the (UK) ISO 8859-1 character set, #E8 is e-grave :(

What seems to be happening is that something, somewhere, in The Bat!
isn't understanding that it needs to switch from ISO 8859-1 to ISO
8859-2 when displaying that line.

(I presume that there is a mapping from IS0 8859-x to two-byte Unicode
characters somewhere in the Windows APIs so that, when an application
says, 'This character code is #E8 and the current character set is ISO
8859-2', 'c-hacek' is printed).

I've found two rather useful sites from which I derived this explanation
(?):

http://czyborra.com/charsets/iso8859.html (what the various sets look
like)

http://nl.ijs.si/gnusl/cee/iso8859-2.html (ISO 8859-2 in detail)

Alastair


_
This message has been checked for all known viruses by the 
MessageLabs Virus Scanning Service. For further information visit
http://www.messagelabs.com/stats.asp


-- 
_
Archives   : http://tbbeta.thebat.dutaint.com
Moderators : mailto:[EMAIL PROTECTED]
Unsubscribe: mailto:[EMAIL PROTECTED]
Latest Beta: 1.54 beta/15
Wish List  : http://wish.thebat.dutaint.com




Re[2]: Slovenian (once again) -

2001-12-12 Thread Jernej Simoni

Hello Maurice,

13. december 2001, 1:27:06, you wrote:

JS if the encoding isn't supported:
JS - display the closest approximates of the letters (CSZCDcszcd)

MS Which means that all known character tables whould have to be known to
MS the program, including tables mapping non [A-Z]|[a-z] characters to
MS their counterparts.

FAR manager (Norton Commander clone for Windows) does that in it's
viewer/editor (I've seen it on NT4, which had no CE-encoding support).

MS If I analyse TB!'s behaviour correctly, I come to the conclusion that,
MS given that a font supporting the needed characters is selected, TB!
MS forgets to put the font in the right characterset before displaying
MS the header information.

I've noticed interesting thing on Linux (under Wine): if message's
charset was anything other than None, the message text and headers
(both in the listing and those above the body) appeared only as
 squares. To view it I had to View-Character Set-None...

-- 
Jernej Simoncic, [EMAIL PROTECTED]
http://www2.arnes.si/~sopjsimo/
ICQ: 26266467

[The Bat! v1.54 Beta/15 on Windows 98 4.10.67766446. A ]

Any product cut to length will be too short.
   -- Klipstein's Observation


--
_
Archives   : http://tbbeta.thebat.dutaint.com
Moderators : mailto:[EMAIL PROTECTED]
Unsubscribe: mailto:[EMAIL PROTECTED]
Latest Beta: 1.54 beta/15
Wish List  : http://wish.thebat.dutaint.com