https://bugzilla.wikimedia.org/show_bug.cgi?id=23629

--- Comment #3 from Innocenti Maresin <q...@roger.net.ru> 2011-12-24 16:30:19 
UTC ---
Let us browse exactly to the query mentioned by me in the bugzilla's "URL"
field and examine the resulting document.

% wget
'http://ru.wikipedia.org/w/index.php?title=Special:Search&fulltext=1&search=%D0%B0&ns4=1&uselang=en'
--20:09:28-- 
http://ru.wikipedia.org/w/index.php?title=Special:Search&fulltext=1&search=%D0%B0&ns4=1&uselang=en
           =>
`index.php?title=Special:Search&fulltext=1&search=а&ns4=1&uselang=en'
…
20:09:30 (124.34 KB/s) -
`index.php?title=Special:Search&fulltext=1&search=а&ns4=1&uselang=en' stored
[41804/41804]

% hexdump -C -s 0x5d90 -n 128
index.php\?title=Special:Search\&fulltext=1\&search=а\&ns4=1\&uselang=en
00005d90  d0 be d0 b2 20 7c 20 3c  73 70 61 6e 20 63 6c 61  |.... | <span cla|
00005da0  73 73 3d 27 73 65 61 72  63 68 6d 61 74 63 68 27  |ss='searchmatch'|
00005db0  3e d0 3c 2f 73 70 61 6e  3e 90 2e d0 9a d1 80 d1  |>.</span>.......|
00005dc0  8b d0 bc d0 be d0 b2 20  7c 20 32 30 30 38 2d 31  |....... | 2008-1|
00005dd0  31 2d 30 39 20 7c 20 39  37 34 35 20 7c 20 d0 9f  |1-09 | 9745 | ..|
00005de0  d0 b0 d1 82 d1 80 d1 83  d0 bb d0 b8 d1 80 d1 83  |................|
00005df0  d1 8e d1 89 d0 b8 d0 b9  2c 20 d0 be d1 82 d0 ba  |........, ......|
00005e00  d0 b0 d1 82 d1 8b d0 b2  d0 b0 d1 8e d1 89 d0 b8  |................|

Here you can see invalid byte string 0xd0 (without continuation bytes) at
offset 0x00005db1 and misplaced continuation byte 0x90 at 0x00005db9.
This is U+0410 — Cyrillic letter "А" — split to 2 portions. This is clearly
visible in a browser too, as replacement characters. Is this exercise really so
complicated or boring for MediaWiki programmers?

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.
_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to