This is going to cause some fun.

I wonder, in a case like this, which of the two scripts takes precedence?

- C

On 25/11/2010, Shriramana Sharma <[email protected]> wrote:

> Hello. Here's a Telugu vs Kannada confusables list I cooked up right
> now. As this is an important security issue, I post to all the lists
> so that people may contribute. Also, some of this is probably already
> there but I'm going for completeness:

> ANUSVARA
> VISARGA
> LETTER A
...
<SNIP>
...
> DIGIT NINE

> That makes sixty two characters in all including the ones marked ?.
> Even *without* the ones marked ? (which I did because I suspected
> others may contest these cases) it comes to fifty two.

> Now to count the characters NOT common or confusable (obviously much
> lesser):
>
> LETTER U
> LETTER UU
...
<SNIP>
...
> DIGIT SEVEN

> That comes to sixteen.

> I left out the LLLA of Kannada and the fractions of Telugu which are
> *not* present (as of Unicode 6.0) in the other script because
> obviously there can be no comparison on those.

> So there are at least *thrice* (or at most *four times*) as many
> confusable characters between Kannada and Telugu than there are
> NON-confusables.

> Now can you beat that! Speaking of scripts with a common origin and
> causing potential confusion in IDNs, *I* say Kannada and Telugu takes
> the cake!

> Shriramana Sharma.

Reply via email to