On Thu, Sep 26, 2002 at 02:10:04AM +0100, Wez Furlong wrote: > All: > > I've just committed a php-style version of the ucdata package that Stig > directed me to.
Great! > Stig: > Rather than generate binary data files at configure time, based on > a bundled UnicodeData.txt file which is quite large, causes problems > for win32 builds, and has run-time thread safety and data file location > issues (for freshly built but not installed php binaries), I settled on > having ucgendat generate a header file with the ctype and case data tables > declared within it. > All that is needed is to add these files to the build and voila! it works :-) Yes, I think that's a good solution. The tables are very stable. Could you also commit the source for your modified ucgendat? Or is it there somewhere? > There are some interesting functions available in the ucdata package; some > of them might benefit mbstring, so perhaps it is worth a look? I think Unicode normalization should be useful to people. If you want to do matching (looking for "equal" Unicode strings), you really need this. But I haven't really seen anyone ask for this yet. So far I have avoided this problem since I've been using OpenLDAP to store data, and it does the necessary normalization and matching for me, if I stored Unicode in another database... If for instance you use some SQL database, you should normalize all data before storing it, and also normalize strings in queries. Stig -- PHP Development Mailing List <http://www.php.net/> To unsubscribe, visit: http://www.php.net/unsub.php