[issue24036] GB2312 codec is using a wrong covert table

2016-10-02 Thread Mingye Wang
Mingye Wang added the comment: > Advice for final user: This seems something worthy of adding to the codecs doc as a footnote. Perhaps something like "(deprecated) ... gb2312 is an obsolete encoding from the 1980s. Use gbk or gb18030 instead." will do. > libiconv-1.14 is also using the wrong

[issue24036] GB2312 codec is using a wrong covert table

2016-01-04 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: I think we can close this issue as "won't fix". It's a bug, but one which is present in a lot of other systems as well, so we'd potentially make it impossible to write GB2312 data which is supposed to be read back by these other systems. Ma Lin: Do you

[issue24036] GB2312 codec is using a wrong covert table

2016-01-04 Thread Ma Lin
Ma Lin added the comment: I agree with you, "won't fix". -- ___ Python tracker ___ ___ Python-bugs-list

[issue24036] GB2312 codec is using a wrong covert table

2016-01-04 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: Thanks, Ma Lin. -- resolution: -> wont fix status: open -> closed ___ Python tracker ___

[issue24036] GB2312 codec is using a wrong covert table

2016-01-02 Thread Ezio Melotti
Changes by Ezio Melotti : -- stage: -> patch review ___ Python tracker ___ ___

[issue24036] GB2312 codec is using a wrong covert table

2015-04-24 Thread Ma Lin
Ma Lin added the comment: Andre Lemburg, We don't need any modify, A844 is in GBK but not in GB2312, so no need to add it into GB2312. Your logic is right, it's hard to judge which one is wrong. But U+30FB (· KATAKANA MIDDLE DOT) and U+2015 (— HORIZONTAL BAR) have no reason among these

[issue24036] GB2312 codec is using a wrong covert table

2015-04-24 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: Hi Ma Lin, thank you for your investigation. In order to fix these tables, we'd need an official reference which shows that there is in fact an error. If most programming languages you have tested use the wrong version, then maybe it's not wrong after

[issue24036] GB2312 codec is using a wrong covert table

2015-04-23 Thread Ma Lin
Ma Lin added the comment: Today, I investigated these popular programming languages, all are the latest version. iconv-1.14 wrong version php-5.6.8 wrong version (php is using iconv) ActivePerl-5.20.2wrong version GoLang-1.4.2 no GB2312, only has

[issue24036] GB2312 codec is using a wrong covert table

2015-04-23 Thread Ma Lin
New submission from Ma Lin: While I was trying to optimize GB2312/GBK/GB18030-2000 codecs (they are three encodings that widely used in China), I found there is a bug. The three encodings, their relation should be: GB2312 ⊂ GBK ⊂ GB18030-2000. However, in Python's implement: GB2312 ⊄ GBK ⊂

[issue24036] GB2312 codec is using a wrong covert table

2015-04-23 Thread Ma Lin
Ma Lin added the comment: Since MIDDLE DOT is a common character, this bug is annoying. Sorry, it's MIDDLE DOT, not DASH. -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue24036 ___

[issue24036] GB2312 codec is using a wrong covert table

2015-04-23 Thread Serhiy Storchaka
Changes by Serhiy Storchaka storch...@gmail.com: -- nosy: +lemburg, loewis ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue24036 ___ ___