On 2013-04-17 18:46, Ivanko B wrote:
> AFAIK, it still needs B(ytes)O(rder)M(arker) - for proper placing 2+
> bytes of characters.

No it doesn't! The BOM in UTF-8 is just to indicate that the text is
UTF-8, and not some other encoding. All that the BOM marker is used for
is to tell editors that the text is encoded in UTF-8, nothing to do with
Endianess.

UTF-16 and UTF-32 needs the BOM marker, because both of these can use BE
or LE.

  http://www.unicode.org/faq/utf_bom.html#gen7

I recommend you read the whole page though.

> But not for string manipulations & drawing, sure too.

String manimulation is normally carried out in a front-to-back order.
Iterating oven code points in such a way with UTF-8 encoded text is fast.

Just remember that not all Unicode implementations are equally good. If
you write sloppy code, it will be slow, no matter what Unicode encoding
is used.


Regards,
  - Graeme -

-- 
fpGUI Toolkit - a cross-platform GUI toolkit using Free Pascal
http://fpgui.sourceforge.net/


------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
mseide-msegui-talk mailing list
mseide-msegui-talk@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/mseide-msegui-talk

Reply via email to