Not all of charsets can be converted to ascii or latin1 charaset.
windows-1251 can't be converted to latin1/ascii, at least it's cyrillic
part.
Don't worry about windows-1252, it's letter compatible with latin1.
windows-1250 can be converted to latin1/ascii without having to loose
major
I'm outputting XML from my search engine for use in other people's websites,
and I'm having a small problem.
Some of the sites I'm indexing are made in word [I've no control over this],
and outputted as html.
And they're in strange character sets like windows-125{0,1,2}.
When I output the XML,