https://bugzilla.wikimedia.org/show_bug.cgi?id=9530

Philippe Verdy <verd...@wanadoo.fr> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |verd...@wanadoo.fr

--- Comment #26 from Philippe Verdy <verd...@wanadoo.fr> 2010-12-05 10:47:49 
UTC ---
May be the solution would be to use Punycode encoding ? (and really, you should
avoid dots everywhere because they are not liked in CSS selectors.
Punycode (used in IDN) solves all these problems. We just don't have to
restrict dots (don't need nameprep and its internal very complex character
equivalence mappings), they can be Punycoded like the rest.

The good thing about Punycode is that it just uses letters and digits and is
case insenstive ; minus-hyphens are use to separate "words" made of cese
insensitive letters and digits, and the result can be mappedto javascript
properties (like in HTYML5 dataset).

Note that this won't make the ID's built from section headers necesarily unique
(there's still a frequent case where multiple headers for distcint sections
have identical text content; if this ever happens, some suffix should be
appended to the duplicate section headers only.

Oh, my... chaging the algorithm will force all preparsed pages to have their
HTML flushed from the server cache, and could break existing URLs that are
inserted in discussions as is. But not a major problem, as it would break these
URLS, only anchors won't be found. This already happens frequently
independantly of this bug, simply because not enough people know how to use the
MediaWiki parser functions for computing anchro links. We have utility template
on all wikis for this, this is a matter of training, but it is not critical in
discussions.

In main articles however, we occasionnly find links to other article sections:
a quelity check of these pages should really use the URL-building parser
funtions. Or the aricles should contain manually inserted (and predictable)
anchor (using <span id=""></psan>), independantly of the text used in section
headings.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to