> > Is "é" a "character"?
> 
> Yes.  And it is a different character from "e", "ê" or "è".  

Obviously, but the character "é" can be represented by the single Unicode 
codepoint U+00E9 (aka LATIN SMALL LETTER E WITH ACUTE) or by the sequence 
U+0065 U+0301 (aka, LATIN SMALL LETTER E, COMBINING ACUTE ACCENT). My point 
with this example was to show that one "character" may require more than one 
codepoint to represent. You could store "é" in a CHAR(1) if you used the 
representation U+00E9, however you could not store it as U+0065 U+0301.

The point is that saying "n" represents a count of "characters" is misleading, 

> > What I mean is, even if you changed the connection string character
> > set to "UTF8", 0x65 0x00 0x00 0x00 represents four UTF-8 characters
> > (that is, U+0065 U+0000 U+0000 U+0000).
> 
> According to what, I wonder?  

According to the definition UTF-8. See http://en.wikipedia.org/wiki/UTF-8 if 
you need a primer.

Dean.




-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Firebird-net-provider mailing list
Firebird-net-provider@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/firebird-net-provider

Reply via email to