Ben Madin wrote:

G'day all,

I have a set of boundaries, with the locations in Thai. As best as I can tell, the encoding is ISO-8559-11 (not an official encoding, but similar to other latin/non-latin encodings)

does that mean I'm stumped when it comes to importing it using shp2pgsql. I normally set -W UTF-8, but in this case I get utf8: Illegal byte sequence. (the db is utf-8)

If I try using -W WIN874 (the only option in the postgres manual that mentions Thai) I get utf8: iconv_open: Invalid argument

Is this a hopeless case - do I need to (can I?) edit the .dbf file to remove the columns with the Thai encoding and then type them all back in!? Is there another way?

cheers

Ben

Hi Ben,

AFAICT shp2pgsql always outputs UTF-8 encoded text whenever the input encoding is specified via the -W option.

Note that shp2pgsql uses its own iconv conversion routines, and not the PostgreSQL conversion routines. Hence if you take a look at the iconv documentation here: http://www.gnu.org/software/libiconv/ you can see that ISO-8859-11 is actually supported as a -W parameter.

Once your output file has been produced, you should be able to run it directly into a UTF-8 encoded database without any problems. I do agree that the documentation on this is a little vague though (I had to refer to the source to see what was happening...)


ATB,

Mark.

--
Mark Cave-Ayland - Senior Technical Architect
PostgreSQL - PostGIS
Sirius Corporation plc - control through freedom
http://www.siriusit.co.uk
t: +44 870 608 0063

Sirius Labs: http://www.siriusit.co.uk/labs
_______________________________________________
postgis-users mailing list
postgis-users@postgis.refractions.net
http://postgis.refractions.net/mailman/listinfo/postgis-users

Reply via email to