Mahir256 renamed this task from "Wikidata search suggestions do not return anything if a character containing nukta is present" to "Wikidata search suggestions do not return anything if a character whose decomposition contains nukta is present".
Mahir256 triaged this task as "Normal" priority.
Mahir256 updated the task description. (Show Details)

CHANGES TO TASK DESCRIPTION
Most Indic-language sites and Commons appear to process characters such as ढ़, য়, ਖ਼, and ଡ଼—note that these are combined, i.e. //not// already decomposed into a consonant and a nukta—appropriately when they are present in search queries, returning appropriate suggestions. (The bolding of the text within the search suggestions corresponding to what was typed does not appear, but that's not quite as troublesome of a matter.) Wikidata's search functionality does not handle these characters properly at all.

To see this for yourself, you can change your interface language to Bengali, copy the text "বিষয়শ্রেণী:" ("Category:" in Bengali) and paste it into Wikidata's search box, and see no category pages pop up. Change the "য়" in that word to "য + ়", after removing the two spaces and the plus from that quotation, and such category pages will appear.

This does not appear to be an issue with all characters for which a decomposition exists in the Unicode standard, as searches such as "Cañada" (where the "ñ" decomposes into "n + U+0303") do return suggestions properly.

TASK DETAIL
https://phabricator.wikimedia.org/T170779

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Mahir256
Cc: Aftabuzzaman, Mahir256, Aklapper, GoranSMilovanovic, QZanden, Izno, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to