I am receiving data for entry into a mysql database. The information comes to me in gzipped files and I'm using Compress::Zlib to uncompress the file in perl. I'm running perl 5.8.5.

The information in the file is encoded in latin1. It is mostly ASCII 7-bit with HTML entities and some characters with ordinal values > 127.

I'm using DBI to access the database and would like to enter the data into the mysql database in UTF-8 format. Everything I have read has indicated that if I can read the file using PerlIO, perl will automatically convert the data to UTF-8, but since the files are compressed and I'm uncompressing them in perl, I can't figure out how to get perl to translate the data into UTF-8. I would like for perl to translate the HTML entities to UTF-8 encoding as well, if that is possible.

Any help or suggestions would be greatly appreciated.



Carl.



_______________________________________________
Perl-Unix-Users mailing list
Perl-Unix-Users@listserv.ActiveState.com
To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs

Reply via email to