[GENERAL] encoding aliases

Vivek Khera Wed, 15 Mar 2006 08:34:19 -0800

We're developing a DB that will be storing email messages. The clearwinner for the DB encoding is UTF8. However, I will need to set theproper client encoding based on the encoding as defined in the emailmessage.

Looking at the docs (http://www.postgresql.org/docs/8.1/static/multibyte.html), there are many encodings that I can use for theclient. However they do not match the canonical names used inemail. For example, WINDOWS-1252 is accepted, presumably as an aliasfor WIN1252, though it is not listed as an alias. The commentary inutils/mb/encnames.c indicates that the dashes are irrelevant, so weknow ISO-8859-1 and ISO88591 are equivalent.

I've only tried a handful of encoding values found in email so far,but the only one that is not accepted is US-ASCII.

My only concern is that names like WINDOWS-1252 is really an aliasfor WIN1252. What would make this 100% clear is if "SHOWclient_encoding" would report the canonical name rather than the namepassed to it. The source shows it is, but the docs do not.

So, is it fair to assume that the longer form names are safe to use(ie, should I submit a doc patch)?


And does it make sense to make US-ASCII an alias for SQL-ASCII?



---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

[GENERAL] encoding aliases

Reply via email to