Peter Eisentraut <pete...@gmx.net> writes:
> On Sat, 2009-10-03 at 11:21 -0400, Tom Lane wrote:
>> Where exactly are you expecting the latter string to get
>> substituted in?

> The way I understand it, the \uxxxx comes from psql, mbprint.c.  So this
> may depend on exactly what locale psql, as run by pg_regress, thinks it
> is in.

[ looks at psql code ... ]  Ah, I think actually the key question is
what the client_encoding is.  It looks to me like the \u0080 is only
likely to come out if psql is working in utf8 encoding.  In particular,
in LATIN2 it is *guaranteed* to think that 0x80 is a displayable
character, because wchar.c will tell it so (look at pg_latin1_dsplen).
So plpython_unicode.out is in fact assuming UTF8 encoding is used.

The results from the _moth buildfarm machines suggest that the
prevailing locale is something Windows-ish, or maybe that's just an
artifact introduced somewhere between the actual test and the web page.

I am inclined to think that we should add another expected-file
showing the single-byte \200 result.  What that might get displayed
as on the local system isn't really our concern.

Alternatively, maybe we should change pg_latin1_dsplen so that it
reports 0x80-0x9F as control characters; but that would have
consequences far beyond this one regression test.

                        regards, tom lane

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Reply via email to