Saying goodbye
Dear Unicode community, I wanted to take a moment to let you know that I am leaving my Administrative Director position at Unicode effective November 15, 2011. I have greatly enjoyed my twelve year tenure with Unicode and I truly appreciate having had the opportunity to work with all of you. Thank you for the support, guidance, and encouragement you have provided me during my time here and for many lasting friendships that were shaped along the way. I will miss my Unicode family a lot but there is an end to every fun ride and mine just happens to stop now. If you need to contact the Unicode office, please continue to use the reporting form at http://www.unicode.org/reporting.html as you have done in the past. Thanks again for some unforgettable years and wishing each one of you a happy and successful future. Sincerely, Magda Danish Magda Danish Sr. Administrative Director Unicode, Inc. ma...@unicode.org
FW: Subj: Displaying Chinese characters and Chu Nom characters
-Original Message- Date/Time:Sun Dec 12 18:57:05 CST 2004 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Opt Subject: Displaying Chinese characters and Chu Nom characters Dear Unicode, I am using Windows Xp pro, and have the Chinese simplified and traditional, as well as Japanese and Korean fonts and IME's installed on my computer. My question is if it is possible to have Chu Nom characters display in my browser (Mozilla Firefox v1.0). An example of the character can be found at this link, http://www.nomfoundation.org/nomdb/nom_details.php?codepoint=21a38img=1 ,,the Unicode number assigned to this character is U+21a38, and when inputting this number into the Unicode look-up chart, the box is blank. Is there a way i can make characters up in this block of Unicode display in my browser, and if so, what would i have to do? thank you for your time, Paul -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
New translation posted
A new translation of our web page What is Unicode? has been posted today. What is Unicode in Bulgarian was prepared by Mr. Ivan Neytchev and can be viewed at http://www.unicode.org/standard/translations/bulgarian.html. Thanks to all those who volunteered their time and effort, the number of languages covered by What is Unicode? page has more than doubled since its conception in early 2001. --- Magda Danish Sr. Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED]
[no subject]
A new translation has been posted on the Unicode website: What is Unicode? in Slovenian http://www.unicode.org/standard/translations/slovenian.html --- Magda Danish Sr. Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED]
New translation.
A new translation is now available now on the Unicode website, What is Unicode? in Vietnamese http://www.unicode.org/standard/translations/vietnamese.html Many thanks to James Do for his contribution. --- Magda Danish Sr. Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED]
Unicode announces a new membership level
Mountain View, CA, July 23, 2004 - The Unicode(r) Consortium announced today a restructuring of its membership levels, to provide smaller companies with an enhanced opportunity to actively participate in the ongoing development of the Unicode Standard. For more information, please see http://www.unicode.org/press/pr-sbm.html --- Magda Danish Sr. Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED]
Web Form: Subj: Jawi letters
-Original Message- From: Magda Danish (Unicode) Sent: Tuesday, July 20, 2004 10:02 AM To: '[EMAIL PROTECTED]' Subject: FW: Web Form: Subj: Jawi letters Yasmin, I am posting your letter to the Unicode mailing list. I have also subscribed you to the list so that you can receive answers directly. --- Magda Danish Sr. Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED] -Original Message- Date/Time:Tue Jul 20 09:40:55 CDT 2004 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Opt Subject: Jawi letters Hi! I've tried to submit to the mailing list but somehow I managed to fail it. So I'm trying here instead. First a short presentation of the subject: I'm currently trying to write a program where kids can learn how to spell in a language called Bahasa Jawi (or Jawi in short). This is a language based on the Arabic letters. The letters can be found on this link: http://ms.wikipedia.org/wiki/Tulisan_Jawi For this program I have tried using the following letters: cha = \u0686 nga =\u06A0 pa = \u06A4 ga = \u06AC nya = \u06BD First I must say that the letter cha has worked perfectly as I want it to and I'm very pleased with it. However, neither the letter nga nor the letter nya have worked in a satisfying way due to their inability to bond to neighbouring letters. nga is supposed to behave just as it's closest sibblings ein (\u0639) and ghein (\u063A) and nya is supposed to behave as the letters ba (\u0628),ta (\u062A) and sa (\u062B) except that its dots are moved from being on top to be at the bottom (with the single dot at the rearest bottom and the two other dots just above it (like a ba with three dots). As for the letter ga you don't seem to have the real ga since it really should look like the letter kaf (\u06A9) but with one single dot above the base character as you can see in the link I posted above. Is it possible for you to create such a symbol anytime soon? Or could you possibly tell me how I can create it for my own use? And last but not least, I am very sorry to say that your letter pa doesn't work at all! All I see when the letter is written is a sqare or rectangle. Do you have any idea why this happens? I know it might be hard to do anything about these problems but I hope that you could help me with these problems by telling me how I could correct them myself or at least answering this letter so that I know that something is happening. Sincerely, Yasmin Khan -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Unicode Press Release
The Unicode(R) Consortium announced today the release of new versions of the Common Locale Data Repository (CLDR 1.1) and the Locale Data Markup Language specification (LDML 1.1), providing key building blocks for software to support the world's languages. This new release contains data for 247 locales, covering 78 languages and 118 countries. There are also 36 draft locales in the process of being developed, covering an additional 17 languages and 7 countries. For more information, see http://news.google.com/news?q=CLDR
RE: unicode site problem
Frank, It's fixed now. Magda Danish Sr. Administrative Director The Unicode Consortium From: [EMAIL PROTECTED] on behalf of Frank Yung-Fong Tang Sent: Thu 4/22/2004 9:27 AM To: Unicode Mailing List Subject: unicode site problem any one know who can fix http://www.unicode.org/reports/index.html ? all the links are broken
FW: Web Form: Subj: Unicode conversion- Microsoft Visual C++ compiler
Mino, I am sending your question to the Unicode public email list http://www.unicode.org/consortium/distlist.html for a possible answer from one of the list subscribers. Regards, --- Magda Danish Sr. Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED] -Original Message- Date/Time:Mon Apr 19 05:09:20 EDT 2004 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Opt Subject: Unicode conversion I would like to convert a 2 byte Unicode code into its corresponding Unicode character (for instance the decimal 1063 or the hexadecimal 0427 into 'Ч'). Is there a C function in order to make the conversion? What file .h do I need to include in the C program? Can I use the 6.0 version of the Microsoft Visual C++ compiler, or do i need a newer version? Thanks a lot in advance. Mino Napoletano -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
New Translation
A new Croatian translation of What is Unicode? has been posted. Check it out at http://www.unicode.org/standard/translations/croatian.html and many thanks to the translator: Stjepan Brbot. --- Magda Danish Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED]
RE: New What is Unicode translation.
Don, Offers to translate What is Unicode? to a particular language should be addressed to the Unicode office. This can be done through our reporting form http://www.unicode.org/reporting.html or by emailing me directly. Magda PS: For everybody's convenience, we provide an html template for the translation as well as a set of translation formatting instructions. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED] Sent: Thursday, March 18, 2004 6:30 AM To: [EMAIL PROTECTED] Subject: Re: New What is Unicode translation. If someone were interested in translating to an additional language(s), to whom should they write? TIA... Don Osborn Bisharat.net Quoting Magda Danish \\(Unicode\\) [EMAIL PROTECTED]: What is Unicode in Finnish is now online thanks to Jarkko Hietaniemi. Check it out at http://www.unicode.org/standard/translations/finnish.html --- Magda Danish Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED]
New What is Unicode translation.
What is Unicode in Finnish is now online thanks to Jarkko Hietaniemi. Check it out at http://www.unicode.org/standard/translations/finnish.html --- Magda Danish Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED]
FW: Uyghur translation
Dear list subscribers, Is there anyone interested in taking on the project of turning the Uyghur translation page - referred to in James Kass' email below - into nominal forms of Arabic characters rather than presentation forms? Thanks, Magda Begin forwarded message: -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Wednesday, February 04, 2004 7:13 PM To: Magda Danish (Unicode) Subject: Re: New translation Hi, While checking out the new Turkish translation, it seemed like a good time to look at some of the others, too. http://www.unicode.org/standard/translations/uyghur.html The Uyghur page seems to use presentation forms directly encoded in UTF-8. (From Arabic Presentation Forms A and B). This shows up in BabelPad, which shows all of the code points in the page to be from the U+F... range. (This after copy/pasting from the browser to the editor.) Double-checked with Notepad through the browser's View- Source command, and based on the behavior of the display when spaces were interspersed, it really does look like these are all presentation forms. Cheers, James
FW: Web Form: Feedback about website: Netscape, ISO-8859-1 UTF-8
Steve, I am posting your email to the Unicode mailing list http://www.unicode.org/consortium/distlist.html for possible feedback from one of the list subscribers. --- Magda Danish Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED] -Original Message- Date/Time:Mon Mar 15 11:09:34 EST 2004 Contact: [EMAIL PROTECTED] Report Type: Problems / Feedback about website I have noticed that when browsing pages on the Unicode website, the Character Coding selected by my Netscape 7.1 browser always seems to be ISO-8859-1 rather than UTF-8, despite all the pages having a meta http-equiv=Content-Type content=text/html; charset=utf-8 line in them. It occurred to me that the problem might relate to what HTTP response headers are being output, e.g. if they say ISO-8859-1 then perhaps Netscape selects that rather than what the page's meta tag says. Sure enough, using the Perl lwp-request program to display the response headers from a request for http://www.unicode.org/ I find that there are actually two Content-Type response headers -- the first says ISO-8859-1 and the second says utf-8. Is the output of two different Content-Type headers deliberate? If not, then removing the ISO-8859-1 response line might stop my Netscape browset getting confused. Regards, - Steve Hay -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Other Question, Problem, or Feedback - Lucida Sans Unicode vs. MS Arial Unicode
Neil, I don't have an answer to your question. I am forwarding your email to the Unicode mailing list http://www.unicode.org/consortium/distlist.html. Hopefully someone on the list will provide you with an explanation. --- Magda Danish Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED] -Original Message- Date/Time:Thu Mar 4 11:49:23 EST 2004 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hi people, Excuse my ignorance and lack of jargon, but this is the problem. I've a Word doc.s in the following languages Russian Greek Czech They've been sent to me in Times New Roman font. When I change the font to ariel unicode MS, it all starts to go horribly wrong. The Russian Greek text spreads evenly over the page as if there are no breaks between the words The Czech text Inserts spaces into words for no apparent reason - it's as if it's suggesting hyphenation breaks, without the punctuation appearing. ALSO, I've a Slovakian pdf that has been annotated, if I use the Unicode Exporter to create the Summary table, with the default font of Ariel Unicode MS, it substitutes one of the character's accents. Thereby creating a character that doesn't appear in the Slovakian language. However, all the above problems are solved if I use Lucida sans unicode instead of Ariel unicode. I was under the impression that Ariel Unicode was the font of choice for dealing with foreign languages. So how come Lucida Sans Unicode, seems to be able to do what the Ariel unicode can't ? To the best of my knowledge, I'm running a standard Windows 2000 and Acrobat 5.0 set up. All the documents supplied to me, should have come from the same basic platform. I've checked other sites / discussion groups - they all seem to mention earlier versions of Windows creating these sorts of errors. Any ideas ? -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Other Question: Does use of Unicode charset in Oracle database affect performance?
Hello, I am posting your question to the Unicode public list for possible answer from one of our list subscribers. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 [EMAIL PROTECTED] -Original Message- Date/Time:Tue Mar 2 23:39:29 EST 2004 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback I was wanting to know whether the use of Unicode charset in Oracle database will have any negative effect on database/application Performance. Please mail back ASAP because this information is urgently required. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Fonts display on Palm OS
Bob, I am forwarding your email to the Unicode public list http://www.unicode.org/consortium/distlist.html for a possible answer from one of the list's subscribers. --- Magda Danish Administrative Director The Unicode Consortium +1 650-693-3921 -Original Message- Date/Time:Fri Feb 20 15:27:42 EST 2004 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hello and great site. In short - is there a way to modify or make my own 256 character set? I am working on displaying fonts on Palm OS with a program called iSilo. It appears that Palm uses an Ascii Chart very similar to that used by Windows (with a few PDA specific characters added). It also appears that Palm is limited to 256 characters. iSilo is a program that translates web pages into a Palm readable format. Greek letters will not display on the Palm if you use character set 1252 on none at all on your web page. If you set the character set to 1253 (Greek) and use Greek characters on your page, the default Latin characters in those Ascii positions will be displayed. In turn, if you use a custom font with Greek characters in those spots - you get Greek letters on your web page and your Palm. My problem is that I use arrows and other characters (such as the Male and Female symbols) not on any specific Character set I have found. I can make a custom font for these and the Greek letters, but need to make a custom character map. Is there a way of doing this? Thanks for your time. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Other Question: Unicode Conversion in MySQL database
Hello, I am posting your problem to the Unicode mailing list http://www.unicode.org/consortium/distlist.html. I hope someone on the list will have a suggestion for your problem. Regards, Magda Danish Administrative Director The Unicode Consortium +1 650-693-3921 -Original Message- Date/Time:Mon Feb 9 08:44:20 EST 2004 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hi, we have major Problems with unicode-conversion in use with database-stored content! We have a CMS-System which stores the content in a MySQL-Database with PHP. We build websites in japanese Languages! If an administrator put text-content into the system, the characters are automatically converted into a unicode format by the Database, something like _#_1_2_4_6_4_; (Note! I put a _ between each character to be sure it is not reconverted by your Mail-Client!)! We have no problem, generating dynamic HTML-Pages, because the browser reconverts the characters on-the-fly. We still can see the original conversion in the HTML-Source. But we have a lot of problems, if we want to put the content in a server- generated Mail-Message or in a Flash-Movie! Can you help us with this problem. Or do you know somebody, who can help us? Thank you very much! -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
New translation
Another translation has just been posted to the Unicode site: What is Unicode? in Turkish. Check it out at http://www.unicode.org/standard/translations/turkish.html --- Magda Danish Administrative Director The Unicode Consortium +1 650-693-3921
FW: Web Form: Other Question: Etruscan,Sanscrit Linear B on ibook G4
Giovanni, I am posting your request to the Unicode list http://www.unicode.org/consortium/distlist.html for a possible answer to your query from the Unicode community. Maga Danish Administrative Director Unicode, Inc. Date/Time:Wed Jan 28 08:14:14 EST 2004 Contact: jklp.email.it Report Type: Other Question, Problem, or Feedback Gentle sirs, I ask to you. I study ancient languages at the university of Naples, I need to write with etruscan, sanscrit, linear B characters and so on, I have a Macintosh ibook G4 and there si the unicode gamma of characters but it is not complete, it is not fill and then I can't use even the few characters I have. Nobady here can help me! Can you? Or can you say to me to who I can ask here in Naples? Many thanks Giovanni -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Using Unicode Han in MS Access
My apologies for the double posting. I forgot to copy the originator. Magda -Original Message- From: Magda Danish (Unicode) Sent: Monday, January 26, 2004 10:35 AM To: '[EMAIL PROTECTED]' Subject: FW: Web Form: Using Unicode Han in MS Access Louis, I am posting your question to the Unicode list http://www.unicode.org/consortium/distlist.html for a possible answer from one of the list subscribers. --- Magda Danish Administrative Director The Unicode Consortium +1 650-693-3921 -Original Message- Date/Time:Sun Jan 25 06:00:09 EST 2004 Contact: [EMAIL PROTECTED] Report Type: Technical Report or Tech Note issues Through a macro I utilize Han unicode without problems in Word from Office 2000. How could I do to do the same in MSACCESS ? Thanks for your help ! -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Using Unicode Han in MS Access
Louis, I am posting your question to the Unicode list http://www.unicode.org/consortium/distlist.html for a possible answer from one of the list subscribers. --- Magda Danish Administrative Director The Unicode Consortium +1 650-693-3921 -Original Message- Date/Time:Sun Jan 25 06:00:09 EST 2004 Contact: [EMAIL PROTECTED] Report Type: Technical Report or Tech Note issues Through a macro I utilize Han unicode without problems in Word from Office 2000. How could I do to do the same in MSACCESS ? Thanks for your help ! -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: General question
Hello, I don't know the answer to your question so I am forwarding it to the Unicode list. I hope someone on the list will reply. Best, Magda -- Magda Danish Administrative Director The Unicode Consortium 650.693.3921 [EMAIL PROTECTED] -Original Message- Date/Time:Fri Nov 21 20:26:56 EST 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback The western alphabet has an equivalent for people who are unsighted. Is there an equivalent for languages that use other characters...Cyrilic, arabic.? -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Other Question, Problem, or Feedback
Joel, I am posting your question to the Unicode list www.unicode.org/consortium/distlist.html and hope that one of the subscribers will have an answer to your question. Magda Danish Administrative Director The Unicode Consortium 650-693-3010 [EMAIL PROTECTED] -Original Message- Date/Time:Fri Nov 14 14:36:26 EST 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback I am a novice to using Unicode, but I can create simple web-pages. I have some old MSDOS programs that display box elements that I would like to show on my web-site. I saw some info about 'Box' elements using codes of 2510,2511 etc., but I have no idea how to put these on a web-page. One of your pages show an example of using various languages, using TD lang=en. If this is how to do it, please send me an example. Any help will be greatly appreciated. Joel Platt -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Other Question, Problem, or Feedback
Mr. Nikolai, I am forwarding yopur email to the unicode mailing list http://www.unicode.org/consortium/distlist.html for a possible answer from one of the list subscribers. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Tue Oct 21 07:54:01 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hello Unicode-Team, i'm looking for a tool or a tutorial to convert japanese signs in numeric unicode signs (e.g. #30041;). Can you help me? Greetings from Germany T. Nikolai please mail to: [EMAIL PROTECTED] -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Preliminary minutes from UTC 96 (August 2003) posted publicly
The preliminary minutes from UTC 96 (August 2003) have beenpostedfor public access at http://www.unicode.org/consortium/utc-minutes.html Magda Danish Administrative Director The Unicode Consortium 650-693-3921
FW: Web Form: Other Question: CJK
Roberto, I am forwarding your question to the Unicode mailing list for possible answers from the list's subscribers. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Thu Oct 9 10:20:19 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hi at all, i have a little question: Characters in the unicode range U+4E00 and U+9FFF are Unified Ideographs for CJK languages. This means that all characters are togheter for Chinense, Japanese and Korean languages? If i take a charcters for, example U+4E01, this is a valid character for all three languages? My problem is to recognize from the 32 bit value of unicode character if this is a chinese character or korean or japanese. How can do this? I develop international application under win98, win200 with Visual Studio 6.0 thanks a lot. Roberto (ITALY) -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Other Question: IPA download?
-Original Message- Date/Time:Mon Oct 6 13:56:18 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Dear Madam/Sir, I am currently looking for an International Phonetic Alphabet download/plugin for Flash and do not know where to find this. Can you help? Best wishes, Guy Abbott -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Other Question: British pound sign - U+00A3
Jim, I am forwarding your email to the Unicode list http://www.unicode.org/consortium/distlist.html for possible answers from the list subscribers. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Wed Oct 1 05:19:00 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hi, I'm a web developer at Oxford University in the UK, and we are considering encoding all our websites in Unicode to allow support of non-western languages. However, we have a problem. [...] Our problem is the representation of the £ sign (British pound sign - U+00A3). When we type this character into our pages and then set the character encoding in our pages to Unicode (UTF-8) (either by setting it directly in the HTTP header, or setting it using the meta http-equiv=Content-Type content=text/html; charset=utf-8 tag), when we view the pages we see the standard ASCII set of characters, but the Pound sign displays as an error. This happens when we use Netscape 7.02, and IE 6.0 (both very modern browsers. Is there something obvious that I am missing? If there is then I would very much appreciate it if you explain it in as simple terms as possible as I am a real novice in this area. Also which version of Unicode does HTML 4.0 support using escape characters (eg. #163)? With this problem with our pages we are seriously considering abandoning Unicode for ISO-8859-1. Thanks in advance, Jim Leek -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Other Question: Unicode characters in Form in MSAccess
Hi, I am forwarding your question to the Unicode list for possible answer from one of the list subscrbers. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Tue Sep 23 04:06:15 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Greeings, I have a problem using Unicode outeside the web. while in IE5.0 or higher ,when I choose UTF-8 encoding I can see unicode data easily.These data are stored in a SQL2000 database.But when I want to print them in a Form in MSAccess or Crystal Reports9.0(i.e. Outside IE 5.0) the characters are unreadable. What shall I do to overcome this problem? Best Regards M.Janbeglou -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: Other Question: Java Tool
Anne, I am forwarding your email to the Unicode Public Email list http://www.unicode.org/consortium/distlist.html. I hope someone will be able to answer your question. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Fri Sep 19 05:38:35 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Dear Unicode-Team, my name is Anne Gleitsmann. My task is to implement a tool in java that administers different Ressource-Bundles. In my tool you can choose a master-document and one or more slave-documents, then follows the data-comparision. The document is being displayed in JTables. Now the language-variety has been expanded to include Japanese and Korean - and that is where my problem is: the font of these languages is shown as little squares. I found the information in the internet that I need to add the range of those characters to the font.properties file - but how do I do this? Could you help me or give me advise as to how to do this? Thank you! Greetings from Germany! Sincerely Anne Gleitsmann -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
RE: Mailing lists
Pim Blokland wrote: Where can I find information about which mailing lists there are I think you pretty much listed them all below. The Unicode website only mentions [EMAIL PROTECTED], for members of the Consortium, and this public one, [EMAIL PROTECTED] Recently a new one was added, [EMAIL PROTECTED], which is not mentioned on the website, presumably because it is too new, and I can understand that. But what about all the other email addresses I see coming by? [EMAIL PROTECTED], [EMAIL PROTECTED], [EMAIL PROTECTED], etc? Are those all mailing lists? Offtopic isn't a valid list. It was suggested but I don't think our webmaster actually created it. The others are all valid. If so, are they open to everybody (including non-UC members)? Yes If so, how do I subscribe? Send an email to [EMAIL PROTECTED] with subscribe list name in the subject line. Ex. Subscribe hebrew Are they archived; if so, where? No they are not, at least not that I know of. If you have any trouble subscribing, please let me know. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921
FW: Web Form: Other Question: Mapping table
-Original Message- Date/Time:Tue Aug 19 13:20:07 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Dear Sirs, Is there a mapping table from Unicode 4.0 to the GB 18030-2000 standard available for download? Thanks in advance, Will McKee. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
RE: Pre-orders of The Unicode Standard, Version 4.0
-Original Message- From: John Cowan [mailto:[EMAIL PROTECTED] Sent: Thursday, August 14, 2003 10:20 AM To: Magda Danish (Unicode) Cc: Unicode Core List; [EMAIL PROTECTED] Subject: Re: Pre-orders of The Unicode Standard, Version 4.0 Thanks. Is the Unicode Consortium in any way benefited (or disadvantaged) if non-members order through it rather than through Amazon or BN? The Unicode Consortium has an Associate agreement with both Amazon and BN so we do benefit from members and/or non-members purchasing the book through either of them, as long as they follow the link (to Amazon or BN) from the Unicode website. Magda
Pre-orders of The Unicode Standard, Version 4.0
Dear Unicode and Unicore List Subscribers, The release of the Unicode Standard, Version 4.0 is right around the corner. There is still time to place your individual or group orders and to get the book sent to you directly from the publisher, fresh off the press. Anyone placing bulk orders is highly encouraged to do so by August 20 as this will substantially speed up the delivery time. Full members of the Consortium receive 20% discount, Associate and Specialist members receive 10% off the list price of $74.99. To order, please use the the book order form at http://www.unicode.org/book/bookform.html Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921
FW: Web Form: Other Question: Unicode character in Visual C++6 w/ MSComm control
-Original Message- Date/Time:Wed Aug 6 12:03:43 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hello , I have Problem with Unicode character in Visual C++ 6 with the MSComm control When the OS is using Unicode character (chinese, ...), a com data can't be stablished : because of MSCOMM. In such OS MSCOMM32 transforms some double bytes in one byte when the first byte is 0x80. For exemple : 0xC0 0x21 == 0x3F. Could you help me, Regards, -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW:transform a (UNICODE) accented character to its equivalent (UNICODE) non-accented character
-Original Message- Date/Time:Tue Aug 5 09:19:18 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback I'm looking for the easiest and more stable way to transform an (UNICODE) accented character to its equivalent (UNICODE) non-accented character. Do you know any product that do so (windows .NET component if possible) or do you know any technical article providing information on how to do that task ? Thanks for your help -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report) IP of Sender: 213.30.159.66
FW: Using UniCode International Charecters with Peachtree Accounting Program
Hi, Can anyone on the list help me respond to this inquiry? Thanks Magda Date/Time:Sat Jul 19 16:38:16 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Using UniCode International Charecters with Peachtree Accounting Program Up to Peachtrre Version 2003 we were able to use Unicode for entering International Charecters, such as Arabic, in the description field of Invoices, Receipts, Purchase and Payments. We enter Arabic text in the description field and were able to use peachtree succesfuly for our company's accounting. With version 2004 Unicode charecters are displayed as empty square boxes. it appears that the font used in input templates has changed and does not allow for foreign characters to display correctly. Unicode, to my understanding, was adopted by Microsoft in Windows 2000 and Windows XP and is a character set for protocols evolving beyond the use of ASCII, that enables a far greater range of names and characters than can be achieved using ASCII or extended ASCII encoding for character data. If we type Unicode in the search box of Windows XP Help, we will find this explained. Windows XP has a language bar that can be used to switch the language (using Unicode) and that makes us able to enter characters in whatever language is installed besides English. My Regional setting in the Control Panel is set correctly and Arabic works in most other Programs including Peachtree Complete 2003. The print driver is set correctly and prints Arabic correctly even with Peachtree Premium 2004 as it did with version 2003 In previous versions of Peachtree up to version 2003 we were able to enter foreign Characters in the description field of Invoices, Receipts, Purchase and Payments. With version 2004, it appears that the font used in input templates has changed and does not allow for foreign characters to display correctly. In PPA 2004 Foreign characters are displayed as empty square boxes when we activate the extended characters by pressing Alt+Shift (or changing the language from the language bar) This problem does not happen with PCA 2003 on the same computer and with all other programs. It appears that the font used in these input templates is not the same font used in previous versions of Peachtree. How can this problem be solved ? -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Does the Unix-based LYNX browser have issues diplaying UTF-16?
Does anyone on the Unicode list have an answer to this question? Please make sure to copy [EMAIL PROTECTED] Thanks, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Mon Jun 2 16:26:11 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Error Report Question: Does the Unix-based LYNX browser have issues diplaying UTF-16? I have an internal app within my firm that uses it, and it is displaying the entire source for a set of pages instead of plain text. Please, any information, suggestions or directions would help - Scott Hudnall Lead Developer/Content Manager Corporate Intranet ALLTEL 501.905.4387 [EMAIL PROTECTED] -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report) IP of Sender: 198.133.100.130
Announcement: New Unicode Savvy Logo
Title: Announcement: New Unicode Savvy Logo Dear Unicoders, Very often the Unicode Consortium has received requests from webmasters who wished to indicate with a logo or banner that their site supports or uses Unicode. For such purposes we have developed two logos that can be freely displayed on web sites. You can use a Unicode Savvy logo to indicate that a page (or collection of pages) is encoded in Unicode. To learn more and to obtain an image of these logos, please refer to http://www.unicode.org/consortium/unisavvy.html. Thank you and best regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921
FW: Web Form: Problems - Hebrew in Java applet
-Original Message- Date/Time:Sun May 25 18:34:01 EDT 2003 Contact: [EMAIL PROTECTED] Report Type: Problems / Feedback about website Hello, I am trying to include Hebrew into a Java applet and I have a question. Sometimes you need to include a letter with a dash underneath. For example, on page: http://www.unicode.org/charts/PDF/U0590.pdf you may want to combine \u05D0 and \u05B7 into one character. Is it possible to do that? I check your FAQ but could not find an answer. Thank you very much for your help. Jacobo Bulaevsky -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report) IP of Sender: 64.172.58.116
FW: Web Form: Other Question - NCR 96 codepage to Unicode Codepage
Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Mon Mar 17 02:59:09 EST 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Dear Sir, We are using a character set conversion table for converting NCR 96 codepage to Unicode Codepage. However when we try to view this converted contents on the internet explorer with Unicode (UTF-8) encoding, we are getting wrong output than what we expected. some parts of the codepage conversion table is given below. 0xA0 NOT USED 0xA1 NOT USED 0xA2 NOT USED 0xA3 0x061F 0xA4 0x060C 0xA5 0x066A 0xA6 0x0640 0xA7 0x066C 0xA8 NOT USED 0xA9 NOT USED 0xAA 0x066D 0xAB NOT USED 0xAC NOT USED 0xAD NOT USED 0xAE NOT USED 0xAF NOT USED 0xB0 0x0660 0xB1 0x0661 0xB2 0x0662 0xB3 0x0663 0xB4 0x0664 0xB5 0x0665 0xB6 0x0666 0xB7 0x0667 0xB8 0x0668 0xB9 0x0669 0xBA 0x0622 0xBB 0x0623 0xBC 0x0624 0xBD 0x0626 0xBE 0x06CE 0xBF 0x0621 I would be very grateful if you can help me solve this problem, Best Regards Shafeeq O P Computer Programer Sharjah Chamber of Commerce and Industry. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report) IP of Sender: 213.42.1.171
FW: ZWNJ Persian Collation
Title: Message Please make sure to copy Vladimir[EMAIL PROTECTED] on your reply. Thanks, Magda -Original Message- From: Vladimir Ivanov [mailto:[EMAIL PROTECTED]] Sent: Tuesday, March 11, 2003 6:22 AM To: Magda Danish (Unicode) Subject: ZWNJ Persian Collation Dear Magda, Excuse for bothering you again, but my message was rejected by some server on its way to [EMAIL PROTECTED] . May I ask you to publish my question below? Thank you, Vladimir. Sorting Persian words with a utility, based on version 3.1.1 of tailored Allkeys Table http://www.unicode.org/reports/tr10/#AllKeys, Ive encountered a problem that affects the lexicographical order of the words in a dictionary. To my mind, ZWNJ (zero width non-joiner) U+200C (also found among MS Word Special Characters/No-width Optional Break), was invented to prevent connection of Arabic letters within a word. It is used in Persian to show the morphemic boundary in compound words like xnedri household. The latter consists of the word xne house + verb stem dr hold + suffix i. It can be transliterated like xne + ZWNJ + dri. There are thousands words with similar structure in Persian, Dari, Tajik and neighboring languages. It is clearly seen that there are letters on both sides of ZWNJ within the word boundaries. Placing ZWNJ on an edge of the word doesnt make sense in Persian. From this point of view ZWNJ should be treated as a special character rather than a delimiter. But in Allkeys Table it is placed on line #68 well before other popular delimiters: HORIZONTAL TABULATION line #192, LINE FEED line #193, CARRIAGE RETURN line #196, SPACE line #197 etc. Such an ordering gives wrong sorting results for Persian dictionaries: compound words like xnedri household appear in the list before their components like xne house. Ive sold this problem for myself by placing ZWNJ somewhere after delimiters, but what are the theoretical reasons for putting it before them? In order to get what? In what languages? Is it a Persian specific problem or a global one? Are there languages where ZWNJ marks a word boundary? By the way, the sorting algorithm built into MS Windows puts compound words with ZWNJ AFTER their simple components. So in this respect it acts on the principles different from Allkeys Table. Thank you, Vladimir Ivanov
Unicode VB
-Original Message- Date/Time:Fri Mar 7 12:44:47 EST 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback I was wondering when writing code for a program in Visual Basic.NET. Just a very very simple code that converts charactersto Code. could you tell me what the code would be for converting it to or from Unicode. I know that you use Asc(..) for ASCII similarly what do you use for unicode. Thank you. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report) IP of Sender: 63.201.36.100
The Unicode Standard V4.0
Title: The Unicode Standard V4.0 Dear Unicode Enthusiast, The Unicode Editorial Committee has moved into high gear with the start of 2003 to bring you The Unicode Standard 4.0 in September of this year. The deadline for members' acknowledgement in the foreword of the new book has been extended to May 1st. If you are not a member yet, please support the Unicode effort by joining us. Logon to http://www.unicode.org/consortium/join.html and learn more about our membership benefits. Thank you, The Unicode team.
Need help writing Rumanian Using Appleworks on iMac
-Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Monday, February 17, 2003 2:27 PM To: Magda Danish (Unicode) Cc: [EMAIL PROTECTED] Subject: Second try Dear Magda, Many thanks for your kind willingness to help me. Here is my problem: I don't know how to place a cedilla under the letters s and/or t, which, in Rumanian, would make them sound sh and tz, respectively. I am using the new iMac (flat screen) which provides me with Appleworks Word Processing. I hope to hear from you soon and remain, Sincerely, Ted Avitahl
Web Form: Old Russian charcaters
Can anyone on the Unicode list help? Thanks, Magda -Original Message- Date/Time:Thu Jan 16 13:11:06 EST 2003 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback I am looking for a way to use old Russian charcaters that are no longer used in modern Russian langauage. Can you help or provide a direction in which to look? -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
converting devanagari to mangal unicode
-Original Message- Date/Time:Mon Dec 16 11:28:22 EST 2002 Contact: [EMAIL PROTECTED] Report Type: Submission (FAQ, Tech Note) HI I am Gis/Website developer my query is I have a data in devanagri true type font i want to convert this data into mangal unicode. I want to know whether any converter is available for converting devanagari to mangal unicode. Please reply ASAP Sunil -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Mapping from HTML to Unicode
-Original Message- Date/Time:Fri Dec 13 12:53:33 EST 2002 Contact: [EMAIL PROTECTED] Report Type: Problems / Feedback about website Good evening/morning I was in search for a mapping table from HTML entities (such as nbsp;) to unicode charachters (for the eg A0), but had no luck up to now. Maybe you are able to help? Kind regards Hermes Glarner PS: Keep on your work folks (wish I could join that group, but alas, work...) -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Farsi Keheh +06A9 vs. Arabic Kaf +0643 ??
Hello, I received this email and was hoping that someone on the list (Roozbeh, maybe) would shed some light on this issue. Please cc Laura Tull [EMAIL PROTECTED] Thank you, Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 BECOME A MEMBER OF THE UNICODE CONSORTIUM NOW BE AMONG THE FIRST TO RECEIVE YOUR FREE COPY OF THE UNICODE STANDARD V4.0 AND WE'LL ADD YOUR NAME IN THE ACKNOWLEDGEMENT SECTION OF THE NEW BOOK DETAILS SOON ON http://www.unicode.org/unicode/consortium/join.html -Original Message- Date/Time:Wed Dec 11 09:55:25 EST 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback I'm trying to find out some information about the extended Arabic character KEHEH (06A9), which the code chart indicates is used in the Persian (Farsi) language. Staff familiar with Persian in the library use the Arabic letter KAF (0643) instead of this and the two characters look the same to them. What is the difference? If no difference, why two code points? This was noticed as soon as we tried using the Farsi(Persian) keyboard in Windows 2000. It contains the KEHEH character but does not have a KAF. I have already made inquiries of Microsoft about this and the character that looks like the KAF to us definitely maps to 06A9. Is there an expert in the Persian language who could help? Laura Tull Systems Librarian Ohio State University -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Anyone who can write Hindi on the Unicode List?
The Unicode office has received this email claiming that our page What is Unicode in Hindi is incorrect. Can anyone verify this. Thank you Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- From: Anirudh Pandya [mailto:[EMAIL PROTECTED]] Sent: Wednesday, November 20, 2002 2:54 AM To: Magda Danish (Unicode) Subject: RE: Web Form: Error Report Importance: High Dear Magda, Thank you for the prompt response. We appreciate it. I am a native Hindi speaker and am sending you this email from India. I am using IE 6 with SP1 installed. I am attaching (an HTML file) the corrected spelling of the transliteration of 'Unicode' on the hindi site. The word Unicode in hindi is not the only word mis-spelt. Most of the errors occur in the usage of the vowel 'e' (Unicode character - 093F). regards, Anirudh Pandya 'Rudy' -Original Message- From: Magda Danish (Unicode) [mailto:[EMAIL PROTECTED]] Sent: Wednesday, November 20, 2002 11:20 AM To: [EMAIL PROTECTED] Subject: FW: Web Form: Error Report Rudy, Thank you for your encouragement. The Hindi translation is correct, it's been verified by native Hindi speakers and we haven't had any complaints so far - It's been up for almost a year now. It is more likely that you need to upgrade yor browser version in order for it to display correctly. Date/Time:Tue Nov 19 01:37:19 EST 2002 Contact: [EMAIL PROTECTED] Report Type: Error Report The translation of the text presented at http://www.unicode.org/unicode/standard/translations/hindi.html; is INCORRECT. Please correct the translated Hindi text or the translation engine software that is translating the data from english. The 'matras' are displayed on the character following the one that needs to be correctly associated with. Otherwise you folks are doing great work!!! Rudy -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report) Title: What is Unicode? in Hindi Original (as on website) ? Corrected - ?
Emergency help required!
Title: Message Hi Unicoders, Does anyone have any idea what is causing this person's computer to freeze over a "Unicode" issue. Any help will be greatly appreciated. Magda. -Original Message-From: barbara kolender [mailto:[EMAIL PROTECTED]] Sent: Thursday, November 14, 2002 12:04 PMTo: [EMAIL PROTECTED]Cc: [EMAIL PROTECTED]; [EMAIL PROTECTED]; [EMAIL PROTECTED]Subject: error message As per our conversation, The error message: "CAN NOT RUN UNICODE VERSION OF ATL.DLL ON WINDOWS 95. PLEASE INSTALL CURRENT VERSION" appears everytime I open anything on my computer. We are running Windows 98. This message is causing the computer to freeze constantly and I cannot run any new programs. PLEASE HELP. Thanks, Barbara Merrie-Paralegals Law Office of Marc Simon Protect your PC - Click here for McAfee.com VirusScan Online
Double Byte Character Set (DBCS)
Paul, I am forwarding your inquiry to the Unicode list. I hope someone on the list will be able to address your question. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Wed Nov 13 09:32:39 EST 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Dear Sir/Madam, I am writing on behalf of a global software company called Rebus iS. Rebus iS has developed an underwriting package that has been very successful in Europeon markets. The application runs on an AS400 device but was developed primarily for English speaking countries. We are now looking to expand the market for this product into countries such as China. To achieve this I have been informed we need to enable our application for Double Byte Character Set (DBCS). This confuses me when I read of UNICODE. If the AS400 supports UNICODE, and assuming DBCS UNICODE are mutually exclusive, would it not make more sense to enable for UNICODE only from the outset? I would be grateful for your assistance in this matter. Kind Regards Paul Downey Rebus Insurance Systems Limited Registered No. 508212 England Registered Office Suffolk House, 102 - 108 Baxter Avenue, Southend-on-Sea, Essex SS2 6JP United Kingdom Tel:+44 (0) 1702 236691 Fax:+44 (0) 1702 353276 http://www.rebusis.com -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Question: the german umlaut
-Original Message- Date/Time:Fri Nov 8 09:05:40 EST 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hello I just wanted to know how much space in bytes the Latin-1 characters such as the german umlaut characters take up in UTF-8 encoding. Is it still just one byte or does it now require 2 bytes? Regards, Magnus Rosenberg -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Toned Greek Capital Vocals
-Original Message- Date/Time:Fri Oct 25 08:12:22 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Most browsers do not support Toned Greek Capital Vocals and I can't find this code in Uni-Coding. If you can read greek the letters I'm reffered to are: Έ , Ά , Ύ , Ό , Π, Ί and Ή . Is there a code that I can't find? -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
CJK-Ext B, Fonts and input methods - Question from SAP
Christian, I am forwarding your question to the Unicode list. If you're not subscribed to the list please go to http://www.unicode.org/unicode/consortium/distlist.html#4 and self-subscribe so you can follow the thread. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Thu Oct 24 05:06:48 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Dear Sir/Madam, in the newer Unicode versions you started to occupy the surrogate area with new characters (e.g. CJK Unified Ideographs Extension B Range: 2–2A6DF The Unicode Standard 3.1). There are several mappings like the Hongkong supplementary character set using those surrogate characters (http://www.info.gov.hk/digital21/eng/hkscs/download/big5-iso. txt), but unfortunately there are no fonts or input methods that support those surrogate mappings. (Actually even the Government of the Hong Kong Special Administrative Region Information Technology Services Department has no plans to develop a surrogate mapping version for fonts and input methods. Instead they use a mapping to the PUA U+E000 to U+F8FF). Do you know of any font and input method that does support the surrogate mapping? For our applications it is very important not to have any duplicate definitions of the same character in the database, whereas we would like to implement the state of the art mapping (surrogate are) rather than having the PUA occupied. If we start with the PUA mapping we cannot change it later. But without the input method and the correct font it is difficult to pull it off. Best regards, Dr. Christian Hansen Server Technology Internationalization SAp AG Walldorf -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
US7ASCII, UTF8 and ZHT16BIG5 in database configurations
Ronald, I am forwarding your question to the Unicode list. If you're not subscribed to the list please go to http://www.unicode.org/unicode/consortium/distlist.html#4 and self-subscribe so you can follow the thread. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Thu Oct 24 05:11:30 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hi, This is Ronald TO working on a data warehouse project. Following is a problem that we have. Source systems are using different character sets for their database configurations. They are namely, US7ASCII, UTF8 and ZHT16BIG5, so far! The target repository is tentatively UTF8. We are using a ETL tool called DataStage. A simple test has been performed between UTF8 and ZHT16BIG5. I guess you knew the result already, Chinese characters that move across are wrongly recognized when transfer from UTF8 to ZHT16BIG5 and length error if transfer from ZHT16BIG5 to UTF8. English characters transfer produce no error in either directions. The immediate solution is of course, no Chinese character data. But then source system need not configure their database with ZHT16BIG5. Alternatively, use ZHT16BIG5 as the configuration of the data warehouse. Again system that rely on output from the data warehouse has to be ZHT16BIG5 as well. A simple rule of thumb: given the data warehouse is ZHT16BIG5, source system can be either ZHT16BIG5, UTF8 or perhaps US7ASCII and systems that rely on output from data warehouse has to be ZHT16BIG5. Please enlighten. Regards, Ronald TO -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Question from IBM about CJK
Ashish, I am forwarding your question to the Unicode list. If you're not subscribed to the list please go to http://www.unicode.org/unicode/consortium/distlist.html#4 and self-subscribe so you can follow the thread. Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Thu Oct 24 02:16:47 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hi Unicode team, How can I find out the unicode characters for languages like GERMAN, SPANISH, RUSSIAN, JAPANESE, CHINESE, KOREAN, etc. I have all the unicode charts, but they are listed as CJK how will I know which character is Chinese and which one is Japanese. Any help will be highly appreciated. Keep up the good work ! Regards, Ashish A. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Software tool for Greek report designing?
-Original Message- Date/Time:Wed Oct 16 14:18:15 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hi. I am developing a desktop application in Visual Basic 6 programming language. All the reporting has to include Greek Unicode characters. Can you suggest any Software tool capable for reports designing that will accept Unicode text labels? Thanks for your time. Regards Panos Adam -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Sorting on number of strokes for Traditional Chinese
-Original Message- Date/Time:Tue Oct 15 05:13:41 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback To whom concerns, I wonder Unicode provide us a way to do sorting on number of strokes for Traditional Chinese characters. This is urgent, please advise. regards Tony -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Hindi keyboard with the Microsoft Hindi font Mangal
-Original Message- Date/Time:Mon Oct 14 19:55:51 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Submission (FAQ, Tech Note) I am trying to use the Hindi keyboard with the Hindi font Mangal provided by Microsoft. I'm not understanding what key I should press to get the ZWJ (Zero-Width Joiner) as well as the Zero-Width-Non-Joiner (ZWNJ). I've looked at various keyboard mappings online and not found how to get these characters. I'm able to use these two in Microsoft Word if I go into the character map and find them by their Unicode number value and then assign a keyboard shortcut to them, but otherwise, I've not met with success. Please advise urgently. Thanks. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Chinese in VB
-Original Message- From: Uni d'Angelo [mailto:[EMAIL PROTECTED]] Sent: Sat 10/12/2002 12:23 PM To: [EMAIL PROTECTED]; [EMAIL PROTECTED]; [EMAIL PROTECTED] Cc: Subject: Web Form: Other Question, Problem, or Feedback -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- Date/Time:Sat Oct 12 15:23:59 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback I am trying to write an application that can read input in Tradisional Chinese but output (printout on papers)in Simplified Chinese, without any 3rd party software (e.g. ChineseStar, TwinBridge). How can I implement Unicode in the coding? The programming language I'm using is Ms Visual Basic 6 Professional Edition. Will appreciate much if you can provide sample source codes and the related mapping tables. Please reply my requery as soon as possible as it is very urgent for me to solve the problem faced. Thank you. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: MR in superscript - Spanish translation for TM
Can someone help me reply to this inquiry. Thanks, Magda. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]] Sent: Sunday, November 10, 2002 3:46 AM To: Tex Texin Cc: [EMAIL PROTECTED]; Magda Danish (Unicode) Subject: Re: MR in superscript - Spanish translation for TM Hi Tex. I cannot use html code. People in 3M St. Paul that are loading our pages asked me the following: If you can send me the UTF-8 characters for MR, I can create what we call a scrublet that will fix this problem. Thank you, Cristina - Cristina, In html, you can use the SUP tag to make the enclosed text superscript: SUP...superscript.../SUP SUPMR/SUP Alternatively, you can use CSS styles to make text superscript: SPAN STYLE=vertical-align: super;MR/span (http://www.w3.org/TR/REC-CSS2/visudet.html#propdef-vertical-align) tex Magda Danish (Unicode) wrote: -Original Message- Date/Time:Wed Oct 9 17:27:43 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hi, I need to have in UTF-8 code or Unicode of the following word MR but in superscript way, in order to use it when in our Web Site we are speaking of a Trademark(TM). MR is use the translation in Spanish for TM. Could you please assist me? Thanks, Cristina Digitization Coordinator 3M Chile -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report) -- - Tex Texin cell: +1 781 789 1898 mailto:[EMAIL PROTECTED] Xen Master http://www.i18nGuy.com XenCraft http://www.XenCraft.com Making e-Business Work Around the World -
MR in superscript - Spanish translation for TM
-Original Message- Date/Time:Wed Oct 9 17:27:43 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback Hi, I need to have in UTF-8 code or Unicode of the following word MR but in superscript way, in order to use it when in our Web Site we are speaking of a Trademark(TM). MR is use the translation in Spanish for TM. Could you please assist me? Thanks, Cristina Digitization Coordinator 3M Chile -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
unallocated Unicode character and VB
-Original Message- Date/Time:Thu Oct 3 12:24:19 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback When an unallocated Unicode character is placed into a MS Word document it appears as a square, or as a question mark in a text document. Can the value of this apparent null/blank character be determined from within a Visual Basic program? For example, if Russian 'A' is chrw(1040, what is the equivalent value for this blank character? I have looked for 20 minutes or more at your fascinating website, but cannot find the answer to this question. Regards John Harrison Moscow Narodny Bank London, UK -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
French or German Unicode Names??
-Original Message- Date/Time:Tue Sep 17 01:20:08 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: Other Question, Problem, or Feedback HI, We are trying to find a set of the unicode tables with the character labels in French, and one where they are in German. Do you have these available, or can you point us in the direction of where we might find them please? Kind regards, Maryanne Hughes Technical Writer, Pulse Data International, New Zealand -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
UTF-8 to EBCDIC
-Original Message- From: Vishweshwaraiah, Balasubramanya [mailto:[EMAIL PROTECTED]] Sent: Tuesday, July 30, 2002 2:52 PM To: Magda Danish (Unicode) Subject: RE: Web Form: General question Magda Danish, Thanks a lot for your interest in helping me by giving suggestions. I visited the site you mentioned in your reply. I didn't get any idea about how to do the conversion from UTF-8 to EBCDIC or I may be thinking in wrong direction. All I understood from that site is the equivalent ebcdic code(cp037) for each Unicode character. If I have a UTF-8 formatted file, how does this table helps me to do the conversion?. Kindly advice me if I misunderstood any thing. Thanks Balu. -Original Message- From: Magda Danish (Unicode) [mailto:[EMAIL PROTECTED]] Sent: Thursday, July 25, 2002 3:27 PM To: [EMAIL PROTECTED] Subject: FW: Web Form: General question Have you tried looking at http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/EBCDIC/ Best Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- Date/Time:Thu Jul 25 12:51:15 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: General question Hi, I need to convert UTF-8 format files to EBCDIC. Could you please suggest me any available tool that does this conversion or How to do this conversion. Thanking you lot in anticipation of quick response. With Best regards Balu. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Subscript Superscript
-Original Message- Date/Time:Tue Jul 30 12:26:40 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: FAQ Suggestion We need to know how to express a Subscript letter in Unicode. On your site, we've found in 2070-208E how to express a Superscript letter or number or a Subscript number, but there is no information about how to write a Subscript letter. We're using the XML Authoring Software Epic developed by Arbortext. We need to be able to express mathmatical formulas in XML and we're trying to use Unicode to do it. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW:Unicode in MS Excel
-Original Message- Now I have a Unicode question. In MS Excel, the worksheet function CODE gives the ASCII value of a character and CHAR gives the character corresponding to a decimal number. Thus CODE(a) gives 97 and CHAR(97) gives the letter a. These two functions do NOT work with Unicode. Is there anything that does? Thanks for any help you can find. Clopper Almon Professor of Economics University of Maryland [EMAIL PROTECTED] 301-405-4604
How to test apps for UTF-8 compliance
If anyone on the list can help, please reply directly to [EMAIL PROTECTED] Thanks, Magda. - Magda Danish Administrative Director The Unicode Consortium 650-693-3921 - -Original Message- Date/Time:Thu Jun 13 04:42:43 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: General question Text of the report is appended below: -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- Can you advise how to test applications (Java enterprise) for UTF-8 compliance or point me in the direction of an appropriate vendor for testing tools? Thankyou Robert Crowther -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
FW: Web Form: General question
-Original Message- Date/Time:Mon May 27 20:58:22 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: General question Text of the report is appended below: -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- When I use an editor(written by Python language) in Pc(Windows 2000), some Unicode can be shown correctly, such as square root symbol(u221A) and integration symbol(u222B), but they cannot be shown on Unix(be shown as a square box). On the contray, some Unicode can be shown on Unix, such as the Middle dot(u00B7), but cannot be shown on Pc. I don't know why, how can I solve this problem? -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Please help: problem with Netscape 6.x
Please reply directly to [EMAIL PROTECTED] Thanks. Magda -Original Message- Date/Time:Fri May 24 04:02:24 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: General question Text of the report is appended below: -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- I'm developing an internet application for user to input multi-lingual data. It works fine using internet explorer 5.x and above. However, it does not work for Netscape even for version 6.x. 1. All my web pages are set the charset to UTF-8. head meta http-equiv=Content-Type content=text/html; charset=utf-8 ... /head 2. My database is set to accept UTF-8 or Unicode characters. 3. My browser encoding selects UTF-8. IE 5.x: View Encoding Auto-Select or Unicode (UTF-8). Netscape 6.x: View Character Coding Auto-Detect Auto-Detect (All) or Unicode (UTF-8). Problem description (in netscape): When I retrieved data (in format such as #x3B1; - my non-english data is stored in the database in this form) from my database and displayed in a page - they displayed correctly. I proceeded to edit some fields, leaving other fields as they were (even for multi-lingual data). When I posted the data, and navigate to the next page, the non-english fields appeared as '?'. I'd spent days trying to figure out the problem, but to no avail. I'm really at my wit- end and really need any help you can offer. I'd got to know your email address while surfing the net for solution and saw your articles on the net. Hope that you really can help. Thanks, Geok Hu -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
u0641 - Arabic Letter FEH
-Original Message- Date/Time:Thu May 16 08:16:41 EDT 2002 Contact: [EMAIL PROTECTED] Report Type: General question Text of the report is appended below: -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- Dear Sir, I am involved in developing a multi lingual package. I am using english and arabic languages in my package. I am using MS Access as database. I am using unicode key mapping for input of Arabic characters. I am using Arial Font. To store arabic character I am encoding all arabic characters into UTF-8 format and storing it in the database. While retrieving the data I am decoding it and displaying. It works properly for all characters except \u0641 - Arabic Letter FEH Could you please help me in solving this problem. Awaiting your reply, Thanks Martin Sunder Singh D.S. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
RE: Useful Resources
Rajesh, The correct links are now posted at http://www.unicode.org/unicode/onlinedat/resources.html Magda. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]] Sent: Monday, May 13, 2002 3:10 AM To: Markus Scherer; Magda Danish (Unicode) Cc: unicode Subject: Re: Useful Resources - Another round of spring cleaning Magda Danish (Unicode) wrote: - Akkadian http://saturn.sron.ruu.nl/~jheise/akkadian/index.html http://www.sron.nl/~jheise/akkadian/ - Multilingual Project Gutenberg http://www.informatik.uni-hamburg.de/gutenb http://www.sharat.co.il/pg/ - USMARC to UNIVERSAL CHARACTER SET MAPPINGS http://lcweb.loc.gov/marc/marc2ucs.html Above link doesn't work. I am curious to visit the site. rajesh
RE: Rendering Problem in indian languages
You should read our FAQ page on Indic languages and scripts at http://www.unicode.org/unicode/faq/indic.html Magda Danish Administrative Director The Unicode Consortium 650-693-3921 -Original Message- From: P.S.L.Sarada Devi [mailto:[EMAIL PROTECTED]] Sent: Monday, May 06, 2002 4:09 AM To: [EMAIL PROTECTED] Subject: Rendering Problem in indian languages Dear Sir/Madam, I am presently working on Internationalization. In our project, i should display html pages in indian languages like Hindi and Telugu. Our main problem is for telugu language. The characters r not rendered properly n there are no unicode range for half chars. What is that i have to do to improve rendering? And one more thing i want is : are there any tools for storing unicodes for different languages in database?(instead of doing manually) Please mail me as early as possible. Thank u Sir/Madam. Regards, Sarada.
RE: Copyright of the generated image..
Dan the Encode Maintainer asked: I would like to use the gif-rendered glyphs that are available via http://www.unicode.org/cgi-bin/refglyph. But is the data copyrighted? If so to whom should I ask? People address questions about copyright/trademark issues to the Unicode office. See http://www.unicode.org/unicode/contacts.html Meanwhile, the answer to your question is: Yes this data is copyrighted. I will follow up in more details offline. Best Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921
Browser support
-Original Message- Date/Time:Fri Mar 15 12:22:10 EST 2002 Contact: [EMAIL PROTECTED] Report Type: General question Text of the report is appended below: -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- I am trying to establish text style guides for new web site and wish to avoid awkward HTML fixes such as three hyphens for an em dash. I find many recomendations not to use unicode characters for entities like em dashes trademark symbols because there is poor browser support. In my testing and research it looks like there is good browser support for em dash. I would like it to render correctly in IE and Netscape 4+ on Macs and Windows machines. Do you know of a chart for browser support of unicode by browser version. thanks, Stuart Somer -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
MS Command Prompt
I have MS Windows NT 4 installed with Service Pack 6a on several PCs. The keyboard is set to English (United States). Within all 32-bit applications ALT-0248 ø is working fine. However, within a MS Command Prompt the above ALT does not work and I get a o instead. The keyb in MS DOS is set to us 437. This means that the ALT-0248 does not work in 16-bit applications. Any help would be much appreciated. Indie Toor [EMAIL PROTECTED] NT Desktop Support European Central Bank -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- (End of Report)
Are UTF-8 and UTF-16 compatible with...
Hello, I'm a software designer, and I'm working in an application that must interchange information between diferent systems. With this pourpose, I'm using XML documents. The problem is that I'm not very sure about the character encoding most adecuate. I need to know: 1.- Are UTF-8 and UTF-16 compatible with Java, Windows NT, W2000, W95, W98 and UNIX? 2.- Is ASCII also compatible with all of them? Thank you, very much, Laura. [EMAIL PROTECTED]
General question: unicode verification test script
-Original Message- Date/Time:Thu Feb 28 18:27:50 EST 2002 Contact: [EMAIL PROTECTED] Report Type: General question -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- Magda: Are you still at the Consortium? If you are I have a quick question. Is there a unicode verification test script that has been produced based on code pages? Thanks and hope all is well. Gary. -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- --
FW: list of abbreviated character names
Can someone on the list help me respond to this query. Thanks. Magda. -Original Message- From: Kevin Brown [EMAIL PROTECTED] Date: 2002-02-18 21:05:52 -0800 To: [EMAIL PROTECTED] Subject: list of abbreviated character names Hello I work professionally with type and am constantly using your website for reference. I am desperately looking for a list which gives NOT the full character description given in the Names List (eg LATIN CAPITAL S WITH CARON) but the abbreviated name (eg Scaron) This extremely useful shorthand method of naming the glyphs is used in font-editing applications such as Fontographer and FontLab. Does such an official list exist? If so, can you please direct me to where I can download it? Many thanks Regards Kevin Brown * TYPOGRAPHY * GRAPHIC DESIGN * PRINTING 147 Magill Road, Stepney S.A. AUSTRALIA 5069 Phone/Fax: +61 (0)8 8362 8664 Email: [EMAIL PROTECTED] Est.1979 *
FW: politonik greek
-Original Message- From: george lingas [mailto:[EMAIL PROTECTED]] Sent: Tuesday, February 19, 2002 2:10 AM To: Magda Danish (Unicode) Subject: RE: unicode_help Dear Magda Danish, Thank you very much for your help, but i still cant find the unicode standart for symbols such as oxia perispomeni dasia upogegrameni etc... in order to write polytonic text under windows 2000 or xp. thank you again. George lyngkas student of graphic Arts --- Magda Danish (Unicode) [EMAIL PROTECTED] wrote: George, Have you looked at the code charts on our website at http://www.unicode.org/charts/PDF/U0370.pdf I hope this is helpful. Best Regards, Magda Danish Administrative Director The Unicode Consortium 650-693-3921 From: george lingas [EMAIL PROTECTED] Date: 2002-02-13 16:00:54 -0800 To: [EMAIL PROTECTED] Subject: unicode_help dear sir or madam, I am a student from the technical institude of athens, department of graphic arts. I would like to sent me a map in order to find the correct unicode position of each politonik greek character. I want this information because i want to make a politonik font for the windows xp platform. I would be very gratefull if you can help me. thank you!! George Lygkas __ Do You Yahoo!? Send FREE Valentine eCards with Yahoo! Greetings! http://greetings.yahoo.com __ Do You Yahoo!? Yahoo! Sports - Coverage of the 2002 Olympic Games http://sports.yahoo.com
FW: 16 bit unicode
From: [EMAIL PROTECTED] Date: 2002-02-14 05:26:42 -0800 To: [EMAIL PROTECTED] Subject: 16 bit unicode Hello, I am presently investigating the use of 16bit unicode within Oracle9i and Access 2000 On searching the web I have not yet been able to successfully locate driver that supports 16 bit unicode. Do you know of any sites or have any information of drivers for 16 bit unicode for Oracle9i ? Regards Brian Hooker
FW: Yoruba characters
-Original Message- From: Rick McGowan [mailto:[EMAIL PROTECTED]] Sent: Saturday, February 16, 2002 12:22 PM To: [EMAIL PROTECTED] Subject: Yoruba characters Begin forwarded message: From: Omotola A. Awofolu [EMAIL PROTECTED] Date: 2002-02-16 11:42:03 -0800 To: [EMAIL PROTECTED] Subject: Yoruba characters I am new to using Unicode and I am trying to incorporate the Unicode characters for Yoruba inside of my JAVA and XML code. I am having two main problems: - I do not know how to use diacritics in either JAVA or XML, and for coding with Yoruba I need to be able to use diacritics for some of the characters - I do not know which encoding would be appropriate for setting my code -Programmer
FW: Using Unicode Characters in ASCII Streams
Begin forwarded message: From: [EMAIL PROTECTED] Date: 2002-02-05 10:44:20 -0800 To: [EMAIL PROTECTED] Subject: Using Unicode Characters in ASCII Streams Hallo, we are a manufacturer of time and attendance terminals which are transfering data using 8-Bit character streams (ASCII + Latin 1 which is a subset of Unicode: to 00FF). Because of the history and the future compatibilty it is not possible to change to 16-Bit characters used with Unicode. But we want to use European Latin A (0100 to 0170) and Extended Latin B (0180 to 01FF) expansions for east Europian countries. So it must be possible to add any Unicode character to the 8-Bit streams. Now we are looking for a standard method to do so. Now here is my question: Is there a method to add any Unicode character to a 8-Bit ASCII stream? Example: The Polish word wyjs´cie with character Latin Small Letter s with Acute (015B) in the middle (s´ is one character) We want to transfer something like this: wyj\u015Bcie; here we use \u015B like coding in Java (our standard development language) Now it is possible to decode the two characters \u and the following four characters as one Unicode character. But we know this is not a standard! It is our intention to use use a standard way. If there is a standard maybe defined by Unicode Organisation it would be nice to hear about it. If this is the wrong contact please forward this request. Thank you. Regards, Reiner Deking Kaba Benzing GmbH, Germany
FW: U.S.Government help
Begin forwarded message: From: sjbaker [EMAIL PROTECTED] Date: 2002-02-04 22:54:10 -0800 To: [EMAIL PROTECTED] Subject: U.S.Government help Is there anyone on the site that can answer a windows 2000 japanese font question for me ? I am an employee of the U.S.Government. You may respond to [EMAIL PROTECTED] We have an app that works great using arial ms unicode with almost every language there is except japanese is the core of the issue.
FW: Database To Web Browser
-Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]] Sent: Monday, February 04, 2002 12:19 AM To: [EMAIL PROTECTED] Subject: Database To Web Browser Hi I'm sure I'm missing something very simple here, I am trying to program a refugee website in multiple languages. The simplified scenario - Switch to Russian Keyboard Enter Russian Text into memo field Establish ODBC Connection via ASP website application Draw out memo field for display on Web Page Problem : The web page text appears as ?? it does not come out as Russian text. However if I export the same DB table as HTML it works - frustrating. I feel there is a simple answer to this - but cannot suss it out Thanks for any help. Jonathan The information in this email (and any attachment) may be for the intended recipient only. If you know you are not the intended recipient, please do not use or disclose the information in any way and please delete this email (and any attachment) from your system.
RE: Questions about Unicode history
Hi Marco, I am currently working on a few web pages that talk about the Unicode history. They are not publicly accessible yet but I'm sure they hold the answers to most of your questions. I will email you the temporary url in a separate email. Regards, Magda. -Original Message- From: Marco Cimarosti [mailto:[EMAIL PROTECTED]] Sent: Wednesday, January 30, 2002 9:29 AM To: [EMAIL PROTECTED] Subject: Questions about Unicode history Hallo. I am writing a short article about Unicode, and I realized that I don't know or I am not sure of many Unicode-related facts and dates that I would like to mention. I apologize for this is a huge list of questions (and I hope that they are not all in the FAQ). Anyway, if anybody is in the mood for trivia, I thank you in advance: - When did the Unicode project start, and who started it? - Is it true Han Unification was the core of Unicode, and the idea of an universal encoding come afterwards? - Who and when invented the name Unicode? - When did the ISO 10646 project start? - When did Unicode and ISO 10646 merge? - What is the name of the GB and JIS standards that have the same repertoire as Unicode? - When did Unicode stop to be 16 bits? (I.e., when were surrogates added?) - I can't remember the version when some scripts were added: Syriac, Thaana, Sinhala, Tibetan, Myanmar, Ethiopic, Cherokee, Canadian Syllabics, Ogham, Runes, Khmer, Mongolian, Yi, Etruscan, Gothic, Deseret, CJK ext. A, CJK ext. B. - Roughly, how many ideographs are in modern use in extensions A and B? - Roughly, when will version 3.2 become official? - Roughly, when will the version 4 book be published? I also have a few non-Unicode questions: - When was ASCII first published and by whom? - What standard was current before ASCII? (BAUDOT, is it?) How many bits did it use? - Did the ASCII standard expire, and when? - When was ISO 646 published? - I think that ISO 646 expired. When? - When was ISO 8859 published? - When did the first double-byte encoding appear? - Are OpenType fonts currently implemented in any platform other than Windows? Thanks again, in advance. _ Marco
FW: Programmatic control of character set
-Original Message- From: Tomosky, Thomas [mailto:[EMAIL PROTECTED]] Sent: Tue 1/29/2002 7:47 AM To: '[EMAIL PROTECTED]' Cc: Subject: Programmatic control of character set I looked through your web site, but could not find an answer to this question: On our web site, can I force the visitor's character set to be UTF-8? Unless their browser is set up properly, they can't view the site properly. As an alternative to providing instructions to them on how to do this, I would like to programmatically set the character encoding to UTF-8. How can I do that? Thanks, Thomas K. Tomosky, Ph.D. Senior Software Engineer FreeMarkets Inc. FreeMarkets Center 210 6th Avenue -- 21st floor Pittsburgh, PA 15222 FreeMarkets Office phone: 412-297-8311 Home Office phone: 724-537-6492 [EMAIL PROTECTED] http://www.freemarkets.com
FW: Unicode Tibetan
-Original Message- From: Robin Sackmann [mailto:[EMAIL PROTECTED]] Sent: Tuesday, January 22, 2002 4:24 AM To: [EMAIL PROTECTED] Subject: Unicode Tibetan Dear Sirs, when I try to use the Tibetan characters contained in the Arial Unicode MS font in Microsoft Word 2002, the vowel signs do not appear above (or below) the base character, as they should, but in the next character position. Example: the base character BA (0F56) cannot be combined with the vowel sign O (0F7C) in the proper way. (The result should actually be a single glyph representing the syllable BO.) I would be grateful for any information on how this problem can be solved. With many thanks and best regards, Robin Sackmann ___ Robin Sackmann, M.A. Research and Teaching Assistant FREIE UNIVERSITAET BERLIN Berlin, Germany [EMAIL PROTECTED] office +49-30-838-51373 secretary +49-30-838-54011 home +49-30-621 46 49 www.germanistik.fu-berlin.de/il ___
FW: Running out of options...
-Original Message- From: Whit Gurley [mailto:[EMAIL PROTECTED]] Sent: Wednesday, January 23, 2002 8:47 AM To: [EMAIL PROTECTED] Subject: Running out of options... Hello! [...] Now, I understand how ASCII text encoding works and therefore have a pretty good understanding of how Unicode builds upon that concept in order to create additional, multi-lingual character sets, but I am having the worst time trying to figure out how to make it actually happen. I realize that it may have to do with the fact that I'm on a Mac, but the unicode.org site seems to think that the Mac can be made Unicode-ready with system software. I've tried following your installation instructions without success - I still can't see Farsi text on your Weblog sites and the sample code that I download is still a series of null characters (?) in my text editor (BBEdit 6.5). [...] So I guess I have two questions: 1. Do you know if it's possible for me, on a Mac with OS 9 and BBEdit, to create a web setup that would allow my boss to add Unicode Farsi content? What tools would he need to create that content? 2. [...] I think that's it. Thank you very much for your time! _ w h i t g u r l e y Art Director, Softeon, Inc. p :: 415 948 4028 e :: [EMAIL PROTECTED] w :: http://www.softeon.com
FW: Please help me
Title: Message -Original Message-From: King of kids [mailto:[EMAIL PROTECTED]] Sent: Saturday, January 19, 2002 1:55 AMTo: [EMAIL PROTECTED]Subject: Please help me Dear Sir/Madam, Recently, I have heard of that all the Uighur (also called Uyghur, which is more standard in Uyghur Langauage) language letters are already in theUnicode Standard 3.1. I have seen all the Uyghur letters in: 1. http://www.unicode.org/charts/PDF/U0600.pdf 2. http://www.unicode.org/charts/PDF/U0600.pdf 3. http://www.unicode.org/charts/PDF/UFE70.pdf But, I could not find some of them within any font sets of Windows98/XP/2000. Could you tell me where can I find a font set (ex:Like Lucida Sans Unicode) in which I can find The Unicode Standard 3.1's Uyghur letters?(A font that contains all codes points within The Unicode Standard 3.1.) Regards, An Uyghur in Xinjiang Uyghur's Autonomous Region, PRC 99' Graduage Student Computer Department, Xinjiang University Waris Abdukerim * I would like to remind you that some Uighur(Uyghur) letters were not available in The Unicode Standard 3.0, but I found all of them in The Unicode Standard 3.1. Thanks very much.
FW: Require Information
Title: Message -Original Message-From: Mohammed Shakeel [mailto:[EMAIL PROTECTED]] Sent: Thursday, January 17, 2002 4:36 AMTo: [EMAIL PROTECTED]Subject: Require Information sir/madam we are development firm who require to build a site that is to display information in multiple languages. (for ex: english, german and french). After doing some study and analysis we have come to know that using of unicode with SQL Server 2000 (has inbuild support for unicode) and ASP we can accomplish our requirement. What we required as information or clarification is that, is it required that we have to store the information to be displayed in theall the 3 languages or can we store the information in one language and use the same data for display in the different languages by using unicode conventions and supportive parameters like CODEPAGE directive etc. Your help in this regards will be highly appreciated. Thanking you. regards Shakeel
FW: Unicode entities and character representation
-Original Message- From: Philip Knoglinger [mailto:[EMAIL PROTECTED]] Sent: Friday, January 11, 2002 1:08 AM To: [EMAIL PROTECTED] Subject: Unicode entities and character representation Hi, I'm looking for a list of all unicode characters with SGML entitie names, character representation and unicode number if possible. Can you help me? Best regards, Phil Knoglinger
FW: glyph bitmaps
-Original Message- From: Blackburn Andrew SLUK [mailto:[EMAIL PROTECTED]] Sent: Thursday, January 03, 2002 5:27 AM To: '[EMAIL PROTECTED]' Subject: glyph bitmaps Dear Sirs I am looking for a supplier of LCD character/glyph bitmaps for the unicode character set. Ideally I am looking for a choice of character sizes as well The target will be a liquid crystal display driven by an embedded microcontroller, so as you can see, sacaleable fonts are not really suitable. I am looking for data in a format similar to the following (8 bit wide by 9 rows): /* 0x20 */ /* 0 */ 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, 0x00, /* space 32*/ /* 1 */ 0x00, 0x08, 0x08, 0x08, 0x08, 0x00, 0x00, 0x08, 0x00, /* ! 33*/ /* 2 */ 0x00, 0x14, 0x14, 0x14, 0x00, 0x00, 0x00, 0x00, 0x00, /* 34*/ /* 3 */ 0x00, 0x14, 0x14, 0x3E, 0x14, 0x3E, 0x14, 0x14, 0x00, /* # 35*/ /* 4 */ 0x00, 0x08, 0x1E, 0x28, 0x1C, 0x0A, 0x3C, 0x08, 0x00, /* $ 36*/ I would appreciate any advice you may be able to offer in this matter. Yours Andrew N. Blackburn Safeline Ltd. Tel +44 (0)161 8488636 Fax +44 (0)161 8488495 [EMAIL PROTECTED] http://www.safeline.co.uk
FW: Unicode fonts
Title: Message -Original Message-From: Per Eriksson [mailto:[EMAIL PROTECTED]] Sent: Wednesday, January 02, 2002 4:58 AMTo: [EMAIL PROTECTED]Subject: Unicode fonts Could you please mention a Unicode font that comes in not only regular, but also bold, italic and bold italic styles? I already know of one: Times New Roman.Unfortunately, for most other Unicode fonts Ive seen the word processor has to producebold and italic looks by for instance leaning the font in the case of the italic style Im looking for a font that preferably consists of four different files for the four different styles. Do you know of any? Per Eriksson [EMAIL PROTECTED]
FW: Mapping ISO-IR-159 (JIS X 0212:1990) to Unicode
-Original Message- From: Folkert Tijdens [mailto:[EMAIL PROTECTED]] Sent: Wednesday, December 12, 2001 7:58 AM To: '[EMAIL PROTECTED]' Cc: Dennis Lardenoye Subject: Mapping ISO-IR-159 (JIS X 0212:1990) to Unicode Dear Unicode Consortium, In order to correctly implement a medical system we need to convert textual information such as the patient name from a medical DICOM file, coded as ISO-IR-159 (JIS X 0212:1990) into Unicode on a Microsoft Windows-NT platform. We are searching for an authoritative mapping table, in order to correctly map these two character sets onto each other. Can you help us to obtain this table? Thanks, Folkert Tijdens Folkert Tijdens, Project Manager Pie Medical Imaging, Becanusstraat 13D01 6216 BX Maastricht, The Netherlands phone: +31(43)3281325 fax: +31(43)3281329 mailto: [EMAIL PROTECTED]
FW: Question about some MS IE options
Title: Message -Original Message-From: Robert M. Gerlach [mailto:[EMAIL PROTECTED]] Sent: Monday, December 03, 2001 3:24 PMTo: [EMAIL PROTECTED]Subject: Question Hi, When saving a webpage from within Microsoft Internet Explorer, there are a few notable options... and I'm really unsure as to what the differences are, which is "better," etc. I know you're not Microsoft or technical support fTM, but I'm betting that you guys would know better thantheywould[...] Here they are: Unicode Unicode (UTF-8) Western European (ISO) Western European (Windows) Thanks a million! -Rob :)
UNICODE ISO 10646 on Palm O/S 3.5
Title: Message -Original Message-From: Giant Ho [mailto:[EMAIL PROTECTED]] Sent: Thursday, November 01, 2001 9:24 PMTo: [EMAIL PROTECTED]Subject: Fw: Hi Gentlmen/Madams, Would you mind telling me if there are any operation system that apply UNICODE ISO 10646 on Palm O/S 3.5? We are having problems of chinese character display on Palm PDA. pls help thanks Giant HoAccount ManagerSchmidt Co., (H.K.) Ltd.Unit 1101, 11/F,Oxford House, Taikoo Place,979 King`s Raod, Quarry Bay,Hong Kong.Tel : 2620 8311Fax : 2507 0921E-mail : [EMAIL PROTECTED]