Re: [GENERAL] text column constraint, newbie question

Craig Ringer Mon, 23 Mar 2009 16:55:53 -0700

RebeccaJ wrote:

And I wonder why you like SQL_ASCII better than UTF8, and whether
others have any opinions about those two. (My web server's LC_CTYPE is
C, so I can use any character set.) Wouldn't UTF8 allow more
characters than SQL_ASCII?

I've had a LOT of experience dealing with apps that use 8-bit bytestrings (like SQL_ASCII `text') to store data, and I've rarely seen onethat *doesn't* have text encoding handling bugs.

If you store your text as byte streams that don't know, check, orenforce their own encoding you must keep track of the encodingseparately - either with another value stored alongside the string, orthrough your app logic.

If you start storing data with multiple different text encodings in theDB, you're most likely to land up tracking down annoying "corrupt text"bugs sooner or later.

If, on the other hand, you use UTF-8, you *know* that everything in thedatabase is well-formed UTF-8. You ensure that it is UTF-8 beforestoring it in the DB and know it'll be UTF-8 coming out. The DB takescare of encoding conversion for you if you ask it to, by settingclient_encoding - the only downside being that it'll refuse to returnstrings that can't be represented in your current client_encoding, likesay Cyrillic (Russian etc) text if you're using ISO-8859-1 (latin-1) foryour client encoding.

Even with a UTF-8 database you must still get your I/O to/from librariesand the rest of the system right, converting UTF-8 text to whatever thesystem expects or vice versa. Alternately, if you set client_encoding,you must be prepared for cases where the DB can't send you what you askfor because your encoding can't represent it.

All in all, I personally think a UTF-8 database is considerably betterfor most uses. There are certainly cases where I'd use SQL_ASCII, butnot most.


--
Craig Ringer

--
Sent via pgsql-general mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-general

Re: [GENERAL] text column constraint, newbie question

Reply via email to