On Thursday 18 April 2013 10:00:39 Graeme Geldenhuys wrote: > On 2013-04-17 19:58, Ivanko B wrote: > > For StrPos in massive russian text, UTF-8 is approx 9 times as slower > > than UTF16/32. That's a lot-lot slower & without any compensation. > > Could I get a copy of the text and test code? > It counts the number of a known constant Russian character in a random string. In utf-16 and UCS4 this is an operation with numbers and string index, in utf-8 it is a strpos() operation, thus the big difference. But as the "utf-8 everywhere" advocates write, nobody needs this in real code. ;-)
Martin ------------------------------------------------------------------------------ Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis & visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter _______________________________________________ mseide-msegui-talk mailing list mseide-msegui-talk@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/mseide-msegui-talk