Re: OT: languages and spellings
Hi! Peter Steiner [EMAIL PROTECTED] about "OT: languages and spellings": Perhaps we should use the phonetic alphabet of the IPA (international phonetic association), Unicode characters 0x0250..0x02af, but then i know just know one font, Lucida Sans Unicode, having these, and where is a mail client supporting UTF8...? BTW, Office 2000 comes with Arial Unicode MS which supports the *whole* Unicode charset - and has a size of 24 MB! -- Christian GassmannPGP keys: RSA 0x6975E6D3 - DH/DSS 0x83BD18C5 Internet Mail: [EMAIL PROTECTED] ICQ UIN: 12893571 Using The Bat! 1.36 under Windows NT 4.0 Build 1381 Service Pack 5 -- -- View the TBUDL archive at http://tbudl.thebat.dutaint.com To send a message to the list moderation team click here: mailto:[EMAIL PROTECTED] To Unsubscribe from TBUDL, click below and send the generated message. mailto:[EMAIL PROTECTED] --
Re: Re[2]: OT: languages and spellings
Hi there! On 19 Oct 99, at 1:28, Thomas Fernandez wrote about "Re[2]: OT: languages and spellings": Perhaps we should use the phonetic alphabet of the IPA (international phonetic association), Unicode characters 0x0250..0x02af, but then i know just know one font, Lucida Sans Unicode, having these, and where is a mail client supporting UTF8...? CG BTW, Office 2000 comes with Arial Unicode MS which supports the CG *whole* Unicode charset - and has a size of 24 MB! Anybody knows whether this is covered by some RFCs or not? I've got a number of OL-made messages lately, charset was set to UTF-8 there. It was just plain gibberish. I mean, all the higher- bit letters were replaced by "?"s... BTW, that was OL'97... Sounds like Unicode is the future. Forget about all the other encoding systems? Is that what you're saying? In that case, I hope TB will support Unicode. I am not sure that we need it. What will we gain from this, after all? I'll bet, nothing but a heavy mess currently:-( SY, Alex (St.Petersburg, Russia) -- Thought for the day: ``I came, I saw, SHE conquered.'' (The original Latin seems to have been garbled.) --- PGP public keys on keyservers: 0xA2194BF9 (RSA); 0x214135A2 (DH/DSS) fingerprints: F222 4AEF EC9F 5FA6 7515 910A 2429 9CB1 (RSA) A677 81C9 48CF 16D1 B589 9D33 E7D5 675F 2141 35A2 (DH/DSS) --- -- -- View the TBUDL archive at http://tbudl.thebat.dutaint.com To send a message to the list moderation team click here: mailto:[EMAIL PROTECTED] To Unsubscribe from TBUDL, click below and send the generated message. mailto:[EMAIL PROTECTED] --
Re: OT: languages and spellings
Hello Alexander On Monday, October 18, 1999, 10:04:50 PM, Alexander V. Kiselev wrote: Perhaps we should use the phonetic alphabet of the IPA (international phonetic association), Unicode characters 0x0250..0x02af, but then i know just know one font, Lucida Sans Unicode, having these, and where is a mail client supporting UTF8...? And what is that IPA you're talking about? The IPA seems to be an organisation that has defined a special alphabet for transcribing the phonetics of words. My English-German dictionary uses it to explain the pronounciation of the English words. If you have Windows NT, you can see the glyphs with charmap.exe in any Unicode font in the subset 'IPA extensions' Peter -- Peter Steiner [EMAIL PROTECTED] "Schtill! Was ziberlet dert näbem Tobelöhli z grachtige n uuf u aab?" - Franz Hohler -- -- View the TBUDL archive at http://tbudl.thebat.dutaint.com To send a message to the list moderation team click here: mailto:[EMAIL PROTECTED] To Unsubscribe from TBUDL, click below and send the generated message. mailto:[EMAIL PROTECTED] --
Re: OT: languages and spellings
Hello! On Monday, October 18, 1999, 10:01:11 PM, Alexander V. Kiselev wrote: Anybody knows whether this is covered by some RFCs or not? It's covered in the Unicode Standard (ISO 10646), but to find an online explanation is somewhat difficult... I've got a number of OL-made messages lately, charset was set to UTF-8 there. It was just plain gibberish. I mean, all the higher- bit letters were replaced by "?"s... BTW, that was OL'97... Was it UTF-8 or UTF-7? I vaguely remember having seen OL97 messages with UTF-7 (where some characters take up 4 bytes, but never use bit 8)... Let's try to explain: Unicode normally uses 2 bytes for each character; and the first 128 ones are the ascii ones. UTF-8 is a compression method for saving bandwidth or memory. All plain ASCII characters are left alone (and use only one byte), the Unicodes 128..2047 use two bytes (but somewhat transformed) and all other Unicodes use three bytes. Here's a table: Unicode hexadecimal Unicode as Bits UTF-8 as Bits 0x..0x007f 0xxx 0xxx 0x0080..0x07ff 0xxx 110x 10xx 0x0800..0x 1110 10xx 10xx Just fill in the bits at the x positions from one column into the other column without changing the order. Lets do an example: The famous seven-german-letters letter ý (schtsch, if i really recall it correctly) with Unicode 1065 (0x0429) would be transformed to UTF-8 'ò©' (0xd0a9) Sounds like Unicode is the future. Forget about all the other encoding systems? Is that what you're saying? In that case, I hope TB will support Unicode. I am not sure that we need it. What will we gain from this, after all? I'll bet, nothing but a heavy mess currently:-( It will give us the ability to mix characters from all all these codepages in one message. Now it's not possible to put cyrillic letters (koi8-r) together with the german umlauts (ISO 8859-1). Look at my signature in this message (in KOI8-R enconding) and compare it to the one of my other reply... ð£ÔÒ [thats about all i know in your language: my name :-( ] -- Peter Steiner [EMAIL PROTECTED] "Schtill! Was ziberlet dert nÄbem TobelÃhli z grachtige n uuf u aab?" - Franz Hohler -- -- View the TBUDL archive at http://tbudl.thebat.dutaint.com To send a message to the list moderation team click here: mailto:[EMAIL PROTECTED] To Unsubscribe from TBUDL, click below and send the generated message. mailto:[EMAIL PROTECTED] --
Re: OT: languages and spellings
Hi Thomas, TF In that case, I hope TB will support Unicode. That's from Stefan' letter: AD Is it possible to read messages written with AD UTF-8 (Unicode) charset? Not yet, but it is planned... -- Alexander Drunin Using The Bat! 1.36 under Windows NT 4.0 Build 1381 Service Pack 5 -- -- View the TBUDL archive at http://tbudl.thebat.dutaint.com To send a message to the list moderation team click here: mailto:[EMAIL PROTECTED] To Unsubscribe from TBUDL, click below and send the generated message. mailto:[EMAIL PROTECTED] --