https://bugzilla.wikimedia.org/show_bug.cgi?id=17794

           Summary: establish tests to catch non-Traditional Chinese
                    mainstream characters
           Product: MediaWiki
           Version: 1.14.0
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: Normal
         Component: Internationalization
        AssignedTo: wikibugs-l@lists.wikimedia.org
        ReportedBy: jida...@jidanni.org


Hello. There are a few non-Taiwan variant characters that have crept
into the Taiwan message files.

This is a different bug that just plain translating.

It involves adding another layer of caution to catch things even a good
translations would miss.

The translations are fine, it is just that a final test must be added
to catch very similar looking wrong characters.

They are not Unicode variants, no, but instead characters that have
never been seen before in Taiwan, as shown by the fact they don't make
the round trip to big5 and then back to Unicode. They are typos for
the common Taiwan version. For example, they are not simplified
Chinese 钩 present in GB2312, nor the common Taiwan version present in
big5, 鉤, but instead a third variant: 鈎.

What I am hoping you will do is add a test to make sure such
characters don't again creep in again.

The test should say "**Non Taiwan characters found in file ...;
Please pick the Taiwan versions (e.g., replace 鈎 with 鉤) before
this version of MediaWiki can be released**" die(1);

Here is the makefile I used:
d=/var/lib/mediawiki/languages/messages
v:$d/MessagesZh_hant.twdiff $d/MessagesZh_tw.twdiff
%.twdiff:%.php
        iconv -ct big5 $?|iconv -f big5|diff -U0 $? -|sed /^@@/d

Note that it is crude, in that it also catches superscript numbers
etc., though all we want to be on the lookout for is the Chinese
characters.

And here is the results. You will notice the missing characters
are the ones that didn't make the round trip to big5 and back.

No I'm not just asking you to correct those characters and forget this
bug.

I'm saying that a test needs to be added to always catch such things
before each MediaWiki release can proceed.

Also consider extending the test to MessagesZh_classical.php etc.

(Lastly, this is not a diff to be applied to anything!)

make v
iconv -ct big5 /var/lib/mediawiki/languages/messages/MessagesZh_hant.php|iconv
-f big5|diff -U0 /var/lib/mediawiki/languages/messages/MessagesZh_hant.php
-|sed /^@@/d
--- /var/lib/mediawiki/languages/messages/MessagesZh_hant.php   2009-03-01
23:31:04.000000000 +0800
+++ -   2009-03-05 08:07:19.263443332 +0800
-/** Traditional Chinese (‪中文(繁體)‬)
+/** Traditional Chinese (中文(繁體))
-'usercssjsyoucanpreview'           => "'''提示:'''
在保存前請用'顯示預覧'按鈕來測試您新的 CSS/JS 。",
+'usercssjsyoucanpreview'           => "'''提示:'''
在保存前請用'顯示預'按鈕來測試您新的 CSS/JS 。",
-'edit-hook-aborted'                => '編輯被鈎取消。
+'edit-hook-aborted'                => '編輯被取消。
-'post-expand-template-argument-category'  =>
'包含着略過模板參數的頁面',
+'post-expand-template-argument-category'  =>
'包含略過模板參數的頁面',
-'timezonetext'              =>
'¹輸入當地時間與伺服器時間(UTC)的時差。',
+'timezonetext'              =>
'輸入當地時間與伺服器時間(UTC)的時差。',
-'timezoneoffset'            => '時差¹:',
+'timezoneoffset'            => '時差:',
-Template:消歧义
-Template:消除歧义
+Template:消歧
+Template:消除歧
-'protect-cascadeon'           =>
'以下的{{PLURAL:$1|一個|多個}}頁面包含着本頁面的同時,啟動了連鎖保護,因此本頁面目前也被保護,未能編輯。您可以設定本頁面的保護級別,但這並不會對連鎖保護有所影響。',
+'protect-cascadeon'           =>
'以下的{{PLURAL:$1|一個|多個}}頁面包含本頁面的同時,啟動了連鎖保護,因此本頁面目前也被保護,未能編輯。您可以設定本頁面的保護級別,但這並不會對連鎖保護有所影響。',
-'trackbackremove'   => '([$1删除])',
+'trackbackremove'   => '([$1除])',
-'version-parserhooks'              => '語法鈎',
+'version-parserhooks'              => '語法',
-'version-hooks'                    => '鈎',
+'version-hooks'                    => '',
-'version-parser-function-hooks'    => '語法函數鈎',
+'version-parser-function-hooks'    => 語法函數',
-'version-hook-name'                => '鈎名',
+'version-hook-name'                => '名',
iconv -ct big5 /var/lib/mediawiki/languages/messages/MessagesZh_tw.php|iconv -f
big5|diff -U0 /var/lib/mediawiki/languages/messages/MessagesZh_tw.php -|sed
/^@@/d
--- /var/lib/mediawiki/languages/messages/MessagesZh_tw.php     2009-03-01
06:04:42.000000000 +0800
+++ -   2009-03-05 08:07:19.292322989 +0800
-/** Chinese (Taiwan) (‪中文(台灣)‬)
+/** Chinese (Taiwan) (中文(台灣))
- * @author לערי ריינהארט
+ * @author  
-'usercssjsyoucanpreview'    => "'''提示:'''
在保存前請用'顯示預覧'按鈕來測試您新的 CSS/JS 。",
+'usercssjsyoucanpreview'    => "'''提示:'''
在保存前請用'顯示預'按鈕來測試您新的 CSS/JS 。",
-'timezoneoffset'           => '時差¹',
+'timezoneoffset'           => '時差',
-Template:消歧义
-Template:消除歧义
+Template:消歧
+Template:消除歧
-'protect-cascadeon'           =>
'以下的{{PLURAL:$1|一個|多個}}頁面包含着本頁面的同時,啟動了連鎖保護,因此本頁面目前也被保護,未能編輯。您可以設定本頁面的保護級別,但這並不會對連鎖保護有所影響。',
+'protect-cascadeon'           =>
'以下的{{PLURAL:$1|一個|多個}}頁面包含本頁面的同時,啟動了連鎖保護,因此本頁面目前也被保護,未能編輯。您可以設定本頁面的保護級別,但這並不會對連鎖保護有所影響。',
-'trackbackremove'   => '([$1删除])',
+'trackbackremove'   => '([$1除])',


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to