Re: [HACKERS] invalidly encoded strings

Andrew Dunstan Sun, 09 Sep 2007 10:19:15 -0700


Tom Lane wrote:

Andrew Dunstan <[EMAIL PROTECTED]> writes:

Is that going to cover data coming in via COPY? and parameters forprepared statements?


Those should be checked already --- if not, the right fix is still to
fix it there, not in per-datatype code.  I think we are OK though,
eg see "need_transcoding" logic in copy.c.



Well, a little experimentation shows that we currently are not OK:

in foo.data:
\366\66


utf8test=# \copy xx from foo.data
utf8test=# select encode(t::bytea,'hex') from xx;
encode
--------
f636
(1 row)

utf8test=# \copy xx to bb.data
utf8test=# \copy xx from bb.data
ERROR:  invalid byte sequence for encoding "UTF8": 0xf636

HINT: This error can also happen if the byte sequence does not matchthe encoding expected by the server, which is controlled by"client_encoding".

CONTEXT:  COPY xx, line 1
utf8test=#

BTW, all the foo_recv functions that call pq_getmsgtext orpq_getmsgstring are thereby calling pg_verify_mbstr already (viapg_client_to_server). So I am still not 100% convinced that doing thesame directly in the corresponding foo_in functions is such a bad idea.


cheers

andrew


---------------------------(end of broadcast)---------------------------
TIP 7: You can help support the PostgreSQL project by donating at

               http://www.postgresql.org/about/donate

Re: [HACKERS] invalidly encoded strings

Reply via email to