On 2013-04-17 18:46, Ivanko B wrote: > AFAIK, it still needs B(ytes)O(rder)M(arker) - for proper placing 2+ > bytes of characters.
No it doesn't! The BOM in UTF-8 is just to indicate that the text is UTF-8, and not some other encoding. All that the BOM marker is used for is to tell editors that the text is encoded in UTF-8, nothing to do with Endianess. UTF-16 and UTF-32 needs the BOM marker, because both of these can use BE or LE. http://www.unicode.org/faq/utf_bom.html#gen7 I recommend you read the whole page though. > But not for string manipulations & drawing, sure too. String manimulation is normally carried out in a front-to-back order. Iterating oven code points in such a way with UTF-8 encoded text is fast. Just remember that not all Unicode implementations are equally good. If you write sloppy code, it will be slow, no matter what Unicode encoding is used. Regards, - Graeme - -- fpGUI Toolkit - a cross-platform GUI toolkit using Free Pascal http://fpgui.sourceforge.net/ ------------------------------------------------------------------------------ Precog is a next-generation analytics platform capable of advanced analytics on semi-structured data. The platform includes APIs for building apps and a phenomenal toolset for data science. Developers can use our toolset for easy data analysis & visualization. Get a free account! http://www2.precog.com/precogplatform/slashdotnewsletter _______________________________________________ mseide-msegui-talk mailing list mseide-msegui-talk@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/mseide-msegui-talk