> Norman wrote:
>
>> This is a good question since if you read a file you may or may not
>> be able to guess the encoding using some means (BOM, etc) There is a
>> GuessJapaneseEncoding which will only guess what Japanese   encoding
>> encoding text may be.
>
> Right.  And even that isn't completely reliable; it's just using a
> series of heuristics.

Guessing UTF-8 is almost always correct. It's way over 99%  
reliability. As long as the UTF-8 is well formed, it's likely to be  
UTF-8. The longer the text of course, and the higher fraction of high  
bytes, the liklier it's UTF-8.

It can be done using my ElfData plugin, very easily, using  
the .Verify function.

I use this in practice, for my Encoding Master app. It guesses the  
encoding of text files for you amoungst other things.

--
http://elfdata.com/plugin/
"String processing, done right"


_______________________________________________
Unsubscribe or switch delivery mode:
<http://www.realsoftware.com/support/listmanager/>

Search the archives:
<http://support.realsoftware.com/listarchives/lists.html>

Reply via email to