-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Роман Мартынович wrote:
> Hello!
> 
> I use wget on Windows to parse html files form the Web  to my pc. I live
> in Russia and so I parse Russian sites. Sometimes parsed files happen to
> be stored in wrong encoding - they have charset=windows-1251 in their
> <meta> tag, but I have to choose the koi-8 encoding to get them appear
> correctly in Firefox, and in MS Notepad it's impossible to change
> encoding. I can't find the reason why. And I also cannot process these
> files in my applications.
> 
> So I ask you to make it possible to choose encoding of html files as an
> option, or if it is a bug to fix it.

Wget doesn't do transcoding of files; it just stores it directly as the
server gave it. We might add a feature to do so at some point in the
future, perhaps, but not likely any time soon. At some point, we would
like to add arbitrary post-download filters, which could probably also
be used to address this sort of thing.

The real problem, though, is that whoever created the files set the meta
tag incorrectly; you should contact the site to address this problem.

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer.
Maintainer of GNU Wget and GNU Teseq
http://micah.cowan.name/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAkmsQD4ACgkQ7M8hyUobTrG/awCbB/nh+SugovMYKUcDf5r0gTUa
a6YAn0vkyrXpGBmYRjPZ6DgugCWZQkRF
=3dvI
-----END PGP SIGNATURE-----


Reply via email to