> Norman wrote: > >> This is a good question since if you read a file you may or may not >> be able to guess the encoding using some means (BOM, etc) There is a >> GuessJapaneseEncoding which will only guess what Japanese encoding >> encoding text may be. > > Right. And even that isn't completely reliable; it's just using a > series of heuristics.
Guessing UTF-8 is almost always correct. It's way over 99% reliability. As long as the UTF-8 is well formed, it's likely to be UTF-8. The longer the text of course, and the higher fraction of high bytes, the liklier it's UTF-8. It can be done using my ElfData plugin, very easily, using the .Verify function. I use this in practice, for my Encoding Master app. It guesses the encoding of text files for you amoungst other things. -- http://elfdata.com/plugin/ "String processing, done right" _______________________________________________ Unsubscribe or switch delivery mode: <http://www.realsoftware.com/support/listmanager/> Search the archives: <http://support.realsoftware.com/listarchives/lists.html>
