Hi,

On Tue, 2023-01-17 at 12:03 +0100, Martin Domdey wrote:
> isn't it better to avoid invisible characters in page titles
> while creating the pages? 
> 
> Please look here, there has been problems with invisible characters
> working with it when parsing or page linking those page titles with
> invisible unicode
> characters: https://de.wikipedia.org/wiki/Benutzer_Diskussion:Wurgl#L
> iste_der_Biografien/Ci

See also
https://en.wikipedia.org/wiki/Wikipedia:Village_pump_%28technical%29#UTF-8_ZERO_WIDTH_SPACE_in_page_title

> Instead of this there will never be a problem when invisible
> characters within the page title name will be deleted when
> creating the page.
> 
> What do you think about it and what technical approaches do
> already exist? How are LTR and RTL marks dealt if creating pages with
> them?

See https://phabricator.wikimedia.org/maniphest/query/GDxAs4QdEDTG/#R
for related bugs, and a ticket about improving cleanupTitles.php.

Cheers,
andre
-- 
Andre Klapper (he/him) | Bugwrangler / Developer Advocate
https://blogs.gnome.org/aklapper/
_______________________________________________
Wikitech-l mailing list -- wikitech-l@lists.wikimedia.org
To unsubscribe send an email to wikitech-l-le...@lists.wikimedia.org
https://lists.wikimedia.org/postorius/lists/wikitech-l.lists.wikimedia.org/

Reply via email to