Status: Untriaged
Owner: ----
CC: [email protected]
Labels: Type-Bug Pri-3 OS-All Area-WebKit Size-Medium I18N

New issue 15898 by [email protected]: Improve encoding detection for a  
very short text which may be useful for query-display
http://code.google.com/p/chromium/issues/detail?id=15898

This is b/1082767.

ICU's autoencoding detector uses byte tri-grams with binary values (good or
bad) and an encoding with the most 'good' votes wins. This works reasonably
well for chunks of data reasonably long (say, 200 bytes), but it works very
poorly for a short chunk.

Firefox uses byte unigrams and bigrams with weights.

--
You received this message because you are listed in the owner
or CC fields of this issue, or because you starred this issue.
You may adjust your issue notification preferences at:
http://code.google.com/hosting/settings

--~--~---------~--~----~------------~-------~--~----~
Automated mail from issue updates at http://crbug.com/
Subscription options: http://groups.google.com/group/chromium-bugs
-~----------~----~----~----~------~----~------~--~---

Reply via email to