ID:               36130
 Updated by:       [EMAIL PROTECTED]
 Reported By:      mbjr at mbjr dot hu
-Status:           Open
+Status:           Closed
 Bug Type:         Feature/Change Request
 Operating System: Linux
 PHP Version:      5.1.2
 New Comment:

There is PECL/transliterate for this already. See
http://pecl.php.net/translit and http://derickrethans.nl/translit.php
(You need the 'diacritical_remove' filter).


Previous Comments:
------------------------------------------------------------------------

[2006-01-23 08:20:37] mbjr at mbjr dot hu

Description:
------------
Although UTF-8 is becoming widely supported, many people in relevant
countries are placing search string w/o any accents and special
characters, as they got used to the old system.

The only way atm to produce accent-free string is manual strtr for in
every case when such character is found.

Reproduce code:
---------------
n/a

Expected result:
----------------
Árvíztűrő tükörfúrógép -> Arvizturo tukorfurogep

These all below should be converted to "o":

Ò = capital letter o with grave
Ó = capital letter o with acute
Ô = capital letter o with circumflex
Õ = capital letter o with tilde
Ö = capital letter o with diaeresis
Ō = capital letter o with macron
Ŏ = capital letter o with breve
Ő = capital letter o with double acute
Ơ = capital letter o with horn
Ǒ = capital letter o with caron
Ǫ = capital letter o with ogonek
Ǭ = capital letter o with ogonek and macron
Ȍ = capital letter o with double grave
Ȏ = capital letter o with inverted breve
Ȫ = capital letter o with diaeresis and macron
Ȭ = capital letter o with tilde and macron
Ȯ = capital letter o with dot above
Ȱ = capital letter o with dot above and macron
Ṍ = capital letter o with tilde and acute
Ṏ = capital letter o with tilde and diaeresis
Ṑ = capital letter o with macron and grave
Ṓ = capital letter o with macron and acute
Ọ = capital letter o with dot below
Ỏ = capital letter o with hook above
Ố = capital letter o with circumflex and acute
Ồ = capital letter o with circumflex and grave
Ổ = capital letter o with circumflex and hook above
Ỗ = capital letter o with circumflex and tilde
Ộ = capital letter o with circumflex and dot below
Ớ = capital letter o with horn and acute
Ờ = capital letter o with horn and grave
Ở = capital letter o with horn and hook above
Ỡ = capital letter o with horn and tilde
Ợ = capital letter o with horn and dot below

Those 34 pieces above are latin capital letters but there're another 34
pieces for their small case, which means in the extended latin script
set we have 68 matches for an "o".

Same applies to e,u,i,a

Actual result:
--------------
n/a


------------------------------------------------------------------------


-- 
Edit this bug report at http://bugs.php.net/?id=36130&edit=1

Reply via email to