Feautures:
1. recoding text & html documents using the 'recode' library
2. detecting the charset of html documents using <meta charset=..
3. detecting the charset of text or html document using the dictionary
