Michael added a comment.
In T327514#8632160 <https://phabricator.wikimedia.org/T327514#8632160>, @Lucas_Werkmeister_WMDE wrote: > [...] MediaWiki core’s MediaWikiTitleCodec::splitTitleString() <https://gerrit.wikimedia.org/g/mediawiki/core/+/3cc288eac4/includes/title/MediaWikiTitleCodec.php#369> hard-codes the bidi characters as forbidden: U+200E-F and U+202A-E. I guess we could do the same, and re-encode those seven while allowing the rest of the `Cf` category? [...] Could we maybe go the opposite way? Having an allow-list of characters in `Cf` that we explicitly decode? Then we could maybe start with ZWJ/ZWNJ and add further chars as needed. I imagine that this would feel safer and more understandable to me when reading the code. TASK DETAIL https://phabricator.wikimedia.org/T327514 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Lucas_Werkmeister_WMDE, Michael Cc: Michael, ItamarWMDE, Aklapper, Arian_Bozorg, Nikki, Sarai-WMDE, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Mahir256, QZanden, EBjune, merbst, LawExplorer, Salgo60, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Lydia_Pintscher, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org