Epidosis added a comment.

  In a case like https://www.wikidata.org/wiki/Q191118, I tend to think a 
general label is not applicable (each language has its name for "tonne"); 
having "mul-lat" and "mul-cyr", I would set "t" as "mul-lat" alias and "т" as 
"mul-cyr" alias; having only "mul", I would set both "t" and "т" as "mul" 
aliases. Anyway, it is reasonable to start applying "mul" (or similar) with 
specific types of items (I tried to list them above; starting with some of 
them, as disambiguations, could work well).
  
  The idea of an apposite code for Latin name of taxa seems very interesting to 
me.
  
  In T285156#7322423 <https://phabricator.wikimedia.org/T285156#7322423>, 
@Epidosis wrote:
  
  > There are at least 3 categories of items which strongly need this:
  >
  > 1. `persons` 
(https://www.wikidata.org/wiki/Special:Search/haswbstatement:P31=Q5, as of now 
9.2M): in most cases the same label and the same aliases are repeated in 
different languages (e.g. in 
wikidata.org/w/index.php?title=Q19667413&action=history I can count 6 
same-label additions: fr, nl, sl, ca, ast, sq; many other items are similar)
  >   - in the case of people, "mul-<script>" is required: names are the same 
only considering languages with the same alphabet, I'm mostly thinking about 
Latin alphabet
  >   - in some cases there could be the following problem: one Latin-script 
language may prefer a form (e.g. "Philip L. Brown"), another Latin-language 
script another form (e.g. "Philip Larry Brown" or "Philip Brown"); while the 
group of labels and aliases is the same for all same-script languages, which is 
the label and which is the alias may vary from language to language; of course, 
this problem occurs only when there is more than one form of the name, but in 
many cases this doesn't happen
  > 2. `given names` and `family names` (https://w.wiki/3zWT, which counts 
Q202444 and Q101352 including subclasses, as of now 590k): in all cases the 
same label are repeated in different same-script languages (e.g. 
https://www.wikidata.org/wiki/Q21448867)
  > 3. `scientific articles` 
(https://www.wikidata.org/wiki/Special:Search/haswbstatement:P31=Q13442814, as 
of now 37.3M): in most cases the same label is repeated in different languages 
(e.g. https://www.wikidata.org/wiki/Q27860672)
  >   - in some cases there could be articles with parallel titles in different 
languages (e.g. https://www.wikidata.org/wiki/Q59238742)
  >
  > Considering also
  >
  > 4. `asteroids` 
(https://www.wikidata.org/wiki/Special:Search/haswbstatement:P31=Q3863, as of 
now 247k)
  > 5. `galaxies` 
(https://www.wikidata.org/wiki/Special:Search/haswbstatement:P31=Q318, as of 
now 2.1M)
  > 6. `taxa` 
(https://www.wikidata.org/wiki/Special:Search/haswbstatement:P31=Q16521, as of 
now 3.1M)
  >
  > and still leaving out disambiguation pages 
(https://www.wikidata.org/wiki/Special:Search/haswbstatement:P31=Q4167410, as 
of now 1.3M), we obtain **52.6M** items, out of 94.9M items, and probably the 
count could be further increased.
  >
  > Ideally, as I noted, "mul" and "mul-<script>" should allow exceptions in 
some way, but their necessity is very clear. I would suggest at least medium 
priority.

TASK DETAIL
  https://phabricator.wikimedia.org/T285156

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Epidosis
Cc: Esc3300, Ash_Crow, Moebeus, Lucas_Werkmeister_WMDE, So9q, Ainali, Epidosis, 
Shushugah, Manuel, Nikki, Mbch331, jhsoby, Amire80, Lydia_Pintscher, 
ChristianKl, Mahir256, Aklapper, Invadibot, maantietaja, Akuckartz, Nandana, 
Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude
_______________________________________________
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org

Reply via email to