On 07/09/16 10:04, I wrote: > How about using 'backslashreplace' instead of 'replace' to encode to > list's preferred language in Mailman/Handlers/SpamDetect.py ? > > Then, desirable pattern in this case seems to be > > ~Subject.*(\\u[0-9a-f]{4}){4} ^Subject:.*(\\u[0-9a-f]{4,5}){4}
(CJK Unified Ideographs Extension uses code point scope >= u+20000) Of course, you can specify code point scope more pricisely, not to match other than Chinese characters. -- Yasuhito FUTATSUKI <futat...@poem.co.jp> ------------------------------------------------------ Mailman-Users mailing list Mailman-Users@python.org https://mail.python.org/mailman/listinfo/mailman-users Mailman FAQ: http://wiki.list.org/x/AgA3 Security Policy: http://wiki.list.org/x/QIA9 Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/ Unsubscribe: https://mail.python.org/mailman/options/mailman-users/archive%40jab.org