Dear Pat, I have several questions here.
1. where do you maintain an ASCII list of your language tags? Should it not be supported on the IANA server and common to all the gTLDs?
2. is there a list of the permitted UNICODEs codes per languages? For example I am interested in the French and Ukrainian sets.
3. did you decide them by yourself, or did you gather a group of lingual authorities to assist you. This would be very interesting.
4. would there not be a way to register IDN in using their "xn--" version? It would simplify international management by resellers?
Thank you for your assistance.
At 20:30 15/02/2005, Kane, Pat wrote:
VeriSign does prevent domains with the Russian language tag from commingling A-Z with the Cyrillic characters. It does permit 0-9 and the dash to be used. This filter also applies to other Cyrillic based languages such as Belarusian, Ukrainian, Serbian, Macedonian and Bulgarian.
There are other languages that are listed within ISO 639-2 that today use a combination of Latin and Cyrillic as they were originally Latin based (Tajik was Arabic prior to being Latin based), migrated to Cyrillic during the Soviet era and today are migrating back to Latin. It is common to use Latin and Cyrillic characters in Tajik, from what I understand not being a native speaker. Granted there are not a lot of registrations in com net that are Tajik, but this is just the point of an IDN.
Pat Kane
-----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of "Martin v. L�wis" Sent: Tuesday, February 15, 2005 2:02 PM To: tedd Cc: [email protected]; [EMAIL PROTECTED] Subject: Re: [idn] homograph attacks
tedd wrote: > You all knew this was going to happen. > > http://www.p&1072;ypal.com
Indeed. However, I am somewhat disheartened that this could happen. IMO, Verisign should have never have registered that domain - the registrar should have provided a language for the label, that language should have been "Russian" (or else &1072; should not have been allowed), and this combination of Cyrillic and Latin letters should not be allowed for the Russian language.
Regards, Martin
