On Thursday 18 April 2013 10:00:39 Graeme Geldenhuys wrote:
> On 2013-04-17 19:58, Ivanko B wrote:
> > For StrPos in massive russian text, UTF-8 is approx 9 times  as slower
> > than UTF16/32. That's  a lot-lot slower & without any compensation.
>
> Could I get a copy of the text and test code?
>
It counts the number of a known constant Russian character in a random string. 
In utf-16 and UCS4 this is an operation with numbers and string index, in 
utf-8 it is a strpos() operation, thus the big difference. But as the "utf-8 
everywhere" advocates write, nobody needs this in real code. ;-)

Martin

------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
mseide-msegui-talk mailing list
mseide-msegui-talk@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/mseide-msegui-talk

Reply via email to