[Wikidata-bugs] [Maniphest] T162331: Provide tools for processing obfuscated Chinese geodata (GCJ-02, BD-09)

2022-08-01 Thread C933103
C933103 reopened this task as "Open".
C933103 added a comment.


  In T162331#8121512 <https://phabricator.wikimedia.org/T162331#8121512>, 
@MPhamWMF wrote:
  
  > Closing out low/est priority tasks over 6 months old with no activity 
within last 6 months in order to clean out the backlog of tickets we will not 
be addressing in the near term. Please feel free to reopen if you think a 
ticket is important, but bare in mind that given current priorities and 
resourcing, it is unlikely for the Search team to pick up these tasks for the 
indefinite future. We hope that the requested changes have either been 
addressed by or made irrelevant by work the team has done or is doing -- e.g. 
upgrading Elasticsearch to a newer version will solve various ES-related 
problems -- or will be subsumed by future work in a more generalized way.
  
  This belongs to search team?

TASK DETAIL
  https://phabricator.wikimedia.org/T162331

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: MPhamWMF, Ericliu1912, Esc3300, Nikki, PokestarFan, Pnorman, C933103, 
Aklapper, Arthur2e5, Stang, Liuxinyu970226, mrephabricator, Astuthiodit_1, 
karapayneWMDE, Jonathan5566, Invadibot, LaMagiaaa, Func, LuciferianThomas, 
RuiyuShen, maantietaja, CBogen, ItamarWMDE, Akuckartz, Alilje, Joye_Zhang, 
VulpesVulpes825, SunAfterRain, 94rain, Nandana, Hamishcn, KirkLU, Lahi, Gq86, 
BJ6123C7BTD, Looniverse, GoranSMilovanovic, lisong, Sturm, Allthingsgo, 
QZanden, EBjune, Orienteerix, LawExplorer, _jensen, rosalieper, Htchien, 
JGirault, Taiwania_Justo, Scott_WUaS, phabyogi, Susannaanas, lxbarth, Fuzheado, 
Planemad, Cwek, Wikidata-bugs, aude, zhuyifei1999, Shizhao, awight, Yurik, 
TheDJ, Mbch331, Jay8g
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T236593: Cannot enter multiple forms for the same language variant

2022-07-20 Thread C933103
C933103 added a comment.


  In T236593#8092471 <https://phabricator.wikimedia.org/T236593#8092471>, 
@LucasWerkmeister wrote:
  
  > It’s still not clear to me which problem the `-x-Q123-1` patch is trying to 
solve. Several languages have been mentioned in this task, but which of them 
would benefit from this system? I feel like for several of them, we’ve already 
reached the conclusion that separate forms are in fact the way to go.
  >
  > I’d like to extract a general rule from @Fnielsen’s comment above 
(T236593#5610903 <https://phabricator.wikimedia.org/T236593#5610903>): if you 
need separate statements, then you need separate forms or lexemes. (I think 
this is a sufficient condition, though it might not be a necessary one.) 
Pronunciation (whether pronunciation audio 
<https://www.wikidata.org/wiki/Property:P443> or IPA transcription 
<https://www.wikidata.org/wiki/Property:P898>) is probably the most significant 
kind of statement here: if a speaker would pronounce the spellings differently, 
then they should be different forms – regardless of whether the difference is a 
completely different ending as in octopuses/octopi, or just an extra schwa as 
in aft(e)nen. (I don’t find the hyphenation example as convincing… don’t you 
need a different hyphenation for every spelling variant, even for cases that 
really should just be multiple representations of one form? E.g. co‧lor/co‧lour 
– that could just be multiple statements on the same form, with different 
monolingual text language codes.)
  >
  > I suspect this rule covers the Norwegian example that originally motivated 
this task: I feel like “parametere” and “parametre” are probably pronounced 
differently, much like “aftnen” and “aftenen” are pronounced differently in 
Danish according to Finn. For Vietnamese chữ Nôm, I feel like @mxn’s comment at 
T236593#8024999 <https://phabricator.wikimedia.org/T236593#8024999> goes in a 
similar direction, though I admit I find the whole Chinese-characters part of 
this discussion hard to follow.
  >
  > For the cases where you really only want to have one form with multiple 
representations, I still agree with @daniel’s comment (T236593#5610378 
<https://phabricator.wikimedia.org/T236593#5610378>): “you make up a code for 
each of the spellings”. In practice, the only way to “make up a code” that we 
currently support is to append -x-Q//12345// to an existing, established 
language code. As far as I understand, this solution works well for Hebrew: 
e.g. ספר/סֵפֶר (L67105) <https://www.wikidata.org/wiki/Lexeme:L67105> (the 
“book” word) uses the language codes `he` and `he-x-Q21283070`, where Q21283070 
<https://www.wikidata.org/wiki/Q21283070> represents Tiberian vocalization, the 
orthography with diacritics. At some point, an editorial decision was made that 
the spelling without diacritics “deserves” the unsuffixed `he` language code 
(instead of both spellings using an -x-Q//12345// language code), which I think 
is reasonable: data reusers who don’t care about the different spellings can 
use the most standard language code (`he`) and its single representation per 
form.
  >
  > Allowing people to append an integer number to the item ID adds a second 
way to make up a code, and one that seems less useful to me: without knowing 
what the number means, how do I know which form representation to use? To me 
this runs counter to the goal of “allow[ing] the consumer to choose which 
variant they prefer”. For the languages that appear to need multiple 
representations for the same language code per form (e.g. the Indian languages 
@Mahir256 mentioned in T236593#5608530 
<https://phabricator.wikimedia.org/T236593#5608530>?), is it not possible to 
make the item ID approach work, by creating more special-purpose items? 
Wikidata editors would then make a decision which of the possible spellings 
“deserves” the standard language code, and which additional items need to be 
created (“spelling with character X”, “spelling with sequence Y”?). I 
understand that not all languages have standardized spellings where you can use 
a single item ID to refer to the spelling variants of a wide range of lexemes 
(like in Hebrew), but I think it should still be possible to describe different 
spellings using items that carry more meaning than just a number.
  
  As an English example, some religious people might refuse to write the name 
"God" out directly as it is as this would constitute idolatry. For this we can 
tag it as en-x-Q for which Q refer to religious group of people, but 
there are more than one alternative way to write "God". They can either write 
"G-d", "G*d", "G_d", "G-o-d", and so on. It would make no contextual 
differences in whether a hyphen or a underscore is being used, and the change 
in which exact symbol being used in place of original alphabet woul

[Wikidata-bugs] [Maniphest] T236593: Cannot enter multiple forms for the same language variant

2022-06-27 Thread C933103
C933103 added a comment.


  In T236593#5610378 <https://phabricator.wikimedia.org/T236593#5610378>, 
@daniel wrote:
  
  > I recall that we had long discussions about this when initially deciding on 
the data model. In technical terms, the question was whether we would allow 
only a single literal value for a spelling variant, or a list or set of words. 
Allowing a list or set would enable the kind of flexibility @jhsoby is asking 
for. But the down side is that it introduces ambiguity when listing forms (you 
would always have to list all of them, in undefined order), and when generating 
text (which one should you use)?
  >
  > If I recall correctly, we decided that we want to give the consumer of the 
data maximum control over which variant they prefer, by forcing the producer to 
provide different variant codes for all different spellings. We had discussions 
about how to encode this in the variant (language) codes, and how to represent 
it in the UI, but decided to leave that for later.
  >
  > So, the solution that we envisioned when originally discussing this about 
four years ago was: you make up a code for each of the spellings, in a way that 
allows the consumer to choose which variant they prefer. If that is done by 
encoding a region or a rhyme or a tradition or school or whatever will depend 
on the language. If it's a stylistic choice, name the style.
  >
  > The same approach can be used for historical spellings. codes could look 
something like de-x-hist-nd-15jh or something (this code is totally made up and 
probably linguistically nonsense).
  
  The underlying assumption behind this decision is that, different spelling 
forms must be associated with certain variant, or that there are some of the 
spelling being preferred over other spellings, or that some spelling is more 
commonly used for some spoken variant/sociolet/etc than others and is other 
spelling.
  
  None of these are correct assumption, when it come to non-Chinese languages 
that use Chinese characters, or even some Chinese languages that need to apply 
Chinese characters.
  
  Example of Vietnamese chu nom have already been presented above. Other 
examples includes Japanese ateji when Kanji are used for Japanese native words 
except cases where there have been full established transliteration, and its 
Korean equivalent in history, as well as in languages like Cantonese when 
non-Mandarin words need to be expressed in Chinese characters.

TASK DETAIL
  https://phabricator.wikimedia.org/T236593

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: C933103, AGutman-WMF, mxn, So9q, Ijon, daniel, Asaf, Mahir256, Danmichaelo, 
Fnielsen, Lucas_Werkmeister_WMDE, Denny, Lydia_Pintscher, jeblad, jhsoby, 
Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, 
Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, 
rosalieper, Bodhisattwa, Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani

2022-05-10 Thread C933103
C933103 added a comment.


  In T180345#7916540 <https://phabricator.wikimedia.org/T180345#7916540>, 
@GerardM wrote:
  
  > Hoi,
  > We can only support scripts supported in Unicode.
  
  As mentioned in previous posts, chu nom are already supported and added into 
Unicode through CJKV unified ideographic characters extensions.
  
  > So what is the font to be
  > used?
  
  Please see this list of fonts: 
https://en.m.wikipedia.org/wiki/Template:Vi-nom/fonts.css
  
  > Thanks,
  >
  >   GerardM

TASK DETAIL
  https://phabricator.wikimedia.org/T180345

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: Yellowtailshark, Popolon, Esc3300, Nikki, Mahir256, Mbch331, Amire80, 
jhsoby, GerardM, mxn, Liuxinyu970226, Aklapper, revi, C933103, Astuthiodit_1, 
karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, 
Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani

2022-05-09 Thread C933103
C933103 added a comment.


  In T180345#7916094 <https://phabricator.wikimedia.org/T180345#7916094>, @mxn 
wrote:
  
  > Ken Lunde (2009) <https://books.google.com/books?id=SA92uQqTB-AC&pg=PA570> 
writes that //chữ Nôm// had a dedicated script code of `Cu` in ISO 15924:2004 
(which has since been superseded by ISO 15924:2022). Does anyone have more 
information about these two-letter codes or why //chữ Nôm// didn’t get a 
four-letter code corresponding to this two-letter code?
  
  That table is same as the table in Japanese version of the book, published in 
year 2002. <https://books.google.com/books?id=U36IQGjmfqMC&pg=PA413> So it 
would be before the official publication of the ISO 15924. Given Unicode 
maintain a list of changes to ISO 15924 since the standard's official 
publication in 2004, and the list didn't include Chu nom, 
<https://unicode.org/iso15924/codechanges.html> I would assume the table in the 
book reflect content in the standard's draft that didn't make it to the 
ultimate list.

TASK DETAIL
  https://phabricator.wikimedia.org/T180345

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: Yellowtailshark, Popolon, Esc3300, Nikki, Mahir256, Mbch331, Amire80, 
jhsoby, GerardM, mxn, Liuxinyu970226, Aklapper, revi, C933103, Astuthiodit_1, 
karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, 
Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani

2022-05-09 Thread C933103
C933103 added a comment.


  "Hani" simply mean "Chinese[Han] characters".
  "vi-Hani" mean "Vietnamese, written in Chinese[Han] characters".
  Chu Nho, despite widely used in Vietnam in ancient time, are written 
according to Classical Chinese grammar, and as such should classify as 
Classical Chinese text, with code "lzh", similar to comparable works from 
Japan, Korea, and other neighboring regions.
  The existence of ISO code Jpan is for the mixed use of Kana together of Kanji 
in Japanese text, which is still the common writing system for Japanese system 
nowadays.
  The existence of the ISO code Kore is for the mixed use of Hangul together 
with Hanja in Korean text. Although Hanja's role in Korean language have 
greatly diminished, it is still not unexpected to see Hanja in modern Korean 
text, hence the code "Kore" which represent Hanja+Hangul is still the default 
code for Korean language writing system, at least in South Korea.
  On the other hand, I do not think the mixed use of Han characters with other 
writing systems, say Latin alphabets, is an expected usage in Vietnam nowadays, 
hence I don't think it is necessary to apply for a new ISO 15924 code for such 
mixed use to reflect this.
  
  As for "characters common between Chinese characters as used in China vs 
characters that only exists in Chu nom", note that both Japanese and Korean 
languages also have some Han characters uniquely created by them for their 
countries, but they simply treat them as part of the Han characters in their 
language, in the same way as all other imported Han characters. And would be 
tagged with script code "Hani".
  
  But I do note that one thing that separate the Vietnamese Chu Nom from those 
unique characters from Japanese/Korean is that, there are large number of them, 
and they are formed according to some rule for many Vietnamese indigenous 
words. They can be treated as Han characters, and also followed typical ways of 
Han characters formation by combining meanings and sounds of characters, and 
thus "Hani" code is applicable. But I think it is also not impossible to apply 
for another ISO 15924 code, given how Traditional Chinese and Simplified 
Chinese which have much less different from each others still received their 
individual code. On the other hand however, the classification of Hans versus 
Hant is necessary in rendering text in two different writing systems that both 
are part of the Chinese language, but it is not really the case for Vietnamese 
when Chu Nho are coded lzh.
  
  As for situation of other ISO 15924 codes like Latg, I don't think they are 
comparable, as they represent different characters and different ways to write 
the language. I guess it would be more comparable to say Seal scripts should 
get their own ISO 15924 codes.

TASK DETAIL
  https://phabricator.wikimedia.org/T180345

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: Yellowtailshark, Popolon, Esc3300, Nikki, Mahir256, Mbch331, Amire80, 
jhsoby, GerardM, mxn, Liuxinyu970226, Aklapper, revi, C933103, Astuthiodit_1, 
karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, 
Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani

2022-03-25 Thread C933103
C933103 added a comment.


  I have removed the Korean part of the ticket and focus on Vietnamese writing, 
due to problem of ambiguity of "Kore" script tag in ISO 15924, as mentioned in 
December 2020.

TASK DETAIL
  https://phabricator.wikimedia.org/T180345

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: Popolon, Esc3300, Nikki, Mahir256, Mbch331, Amire80, jhsoby, GerardM, mxn, 
Liuxinyu970226, Aklapper, revi, C933103, Astuthiodit_1, karapayneWMDE, 
Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani

2022-03-25 Thread C933103
C933103 renamed this task from "Add monolingual language code vi-hani, ko-kore" 
to "Add monolingual language code vi-hani".
C933103 updated the task description.

TASK DETAIL
  https://phabricator.wikimedia.org/T180345

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: Popolon, Esc3300, Nikki, Mahir256, Mbch331, Amire80, jhsoby, GerardM, mxn, 
Liuxinyu970226, Aklapper, revi, C933103, Astuthiodit_1, karapayneWMDE, 
Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, 
GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T215032: Add monolingual language code cmg

2022-02-19 Thread C933103
C933103 added a comment.


  So with the example from July 7 last year, why isn't this request proceeding?

TASK DETAIL
  https://phabricator.wikimedia.org/T215032

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: Esc3300, Lydia_Pintscher, jhsoby, Amire80, Mbch331, Nikki, C933103, 
Liuxinyu970226, Aklapper, Popolon, karapayneWMDE, Invadibot, maantietaja, 
Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T206426: Storing multiple sitelinks to a multilingual wiki

2022-01-16 Thread C933103
C933103 added a comment.


  Note that in addition to multilingual wiki, there are also wikiprojects like 
Wikipedia for Min Nan, where same article on same subject are being written in 
same language in multiple different article using multiple script, due to 
technical difficulty making automatic conversion between these different 
articles impossible.
  Currently, wikidata can only handle one site link to one script version of 
article on such sort of Wikipedia, and linking to another article through 
another wikidata QID entity, making article in the other script cannot be 
easily accessed through interlanguage link.

TASK DETAIL
  https://phabricator.wikimedia.org/T206426

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: C933103, Lectrician1, mxn, Nintendofan885, Koavf, Uzume, Sotiale, Xbspiro, 
Awangba_Mangang, Liuxinyu970226, Aklapper, Yurik, Invadibot, maantietaja, 
Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, 
_jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T215032: Add monolingual language code cmg

2021-07-02 Thread C933103
C933103 added a comment.


  In T215032#7193390 <https://phabricator.wikimedia.org/T215032#7193390>, 
@Esc3300 wrote:
  
  > If you use "mul", you wouldn't actually need "cmg".
  
  Are you trying to play catch 22? The data cannot be input into wikidata under 
the correct code unless it's approved and added here, but to approve a code and 
add it, it is necessary for the code to first be used inside wikidata?

TASK DETAIL
  https://phabricator.wikimedia.org/T215032

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: Esc3300, Lydia_Pintscher, jhsoby, Amire80, Mbch331, Nikki, C933103, 
Liuxinyu970226, Aklapper, Popolon, Invadibot, maantietaja, Akuckartz, Nandana, 
Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, 
rosalieper, Scott_WUaS, Wikidata-bugs, aude
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant

2021-07-02 Thread C933103
C933103 added a comment.


  In T180771#7193232 <https://phabricator.wikimedia.org/T180771#7193232>, 
@Esc3300 wrote:
  
  > The request isn't complete: it lacks samples.
  
  I added in comments above?

TASK DETAIL
  https://phabricator.wikimedia.org/T180771

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Mbch331, C933103
Cc: Esc3300, Addshore, Manuel, noarave, Mbch331, Nikki, Lydia_Pintscher, 
Nikerabbit, wikibugs-l-list, Nemo_bis, siebrand, liangent, bzimport, Amire80, 
Ladsgroup, Ab6399, Kizule, jhsoby, GerardM, Davidzdh, Yejianfei, 
Liuxinyu970226, Aklapper, C933103, Biggs657, Invadibot, Lalamarie69, 
maantietaja, Alter-paule, Beast1978, Un1tY, Akuckartz, Hook696, Iflorez, 
Kent7301, alaa_wmde, joker88john, CucyNoiD, Nandana, Gaboe420, lucamauri, 
Giuliamocci, Cpaulf30, Lahi, Gq86, Af420, Bsandipan, GoranSMilovanovic, 
Mahir256, QZanden, LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, 
Scott_WUaS, Wikidata-bugs, aude
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant

2021-06-15 Thread C933103
C933103 added a comment.


  Autonym for nan-hani:
  
  In T180771#6932367 <https://phabricator.wikimedia.org/T180771#6932367>, 
@Amire80 wrote:
  
  > In T180771#6916776 <https://phabricator.wikimedia.org/T180771#6916776>, 
@Mbch331 wrote:
  >
  >> @Yejianfei There is no Langcom approval yet to add those languages.
  >
  > To clarify, the keyword here is "yet". I'm not against against cdo-hani in 
principle. I just wanted to make sure that when `nan-hani` is deployed, it 
works as expected. Is `nan-hani` now deployed? Does it work as expected? Can 
anyone give some examples?
  
  I have just added the nan-hani label to a few wikidata, according to either 
the hani version of article title on nan wikipedia, or hani lang template for 
title on latin character articles on the wikipedia. Examples include Q703914, 
Q127031, Q45190, Q660947, Q36778, Q2914034. I think it is working as expected.
  
  p.s. It seems like Nan wikipedia is trying to use either namespace or 
category to categorize articles written in Hani but none appears to be 
comprehensive, and due to problem in wikidata those articles are also 
undiscoverable from wikidata, making it hard to find them ...
  
  p.p.s. Should someone post about this on nan wikipedia Village pump?

TASK DETAIL
  https://phabricator.wikimedia.org/T180771

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Mbch331, C933103
Cc: Manuel, noarave, Mbch331, Nikki, Lydia_Pintscher, Nikerabbit, 
wikibugs-l-list, Nemo_bis, siebrand, liangent, bzimport, Amire80, Ladsgroup, 
Ab6399, Kizule, jhsoby, GerardM, Davidzdh, Yejianfei, Liuxinyu970226, Aklapper, 
C933103, Invadibot, Lalamarie69, maantietaja, Alter-paule, Beast1978, Un1tY, 
Akuckartz, Hook696, Iflorez, Kent7301, alaa_wmde, joker88john, CucyNoiD, 
Nandana, Gaboe420, lucamauri, Giuliamocci, Cpaulf30, Lahi, Gq86, Af420, 
Bsandipan, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, Lewizho99, 
Maathavan, _jensen, rosalieper, Scott_WUaS, Jonas, Wikidata-bugs, aude
___
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org


[Wikidata-bugs] [Maniphest] T206426: Storing multiple sitelinks to a multilingual wiki

2021-02-24 Thread C933103
C933103 added a comment.


  While interlinking of "(Wikidata, Commons, OSM wiki)" using the translation 
plugin could also be a problem, they should be interlinked within each 
individual wiki already and thus I don't think they have the most pressing meed
  Instead a more problematic and oressing case that require multiple site link 
for multilingual wiki, is things like Wikipedia for Min Nan, where same article 
on same subject are being written in same language in multiple different 
article using multiple script, due to technical difficulty making automatic 
conversion between these different articles impossible.
  Currently, wikidata can only handle one site link to one script version of 
article on such sort of Wikipedia, making srticlr in the other

TASK DETAIL
  https://phabricator.wikimedia.org/T206426

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: C933103, Lectrician1, mxn, Nintendofan885, Koavf, Uzume, Sotiale, Xbspiro, 
Awangba_Mangang, Liuxinyu970226, Aklapper, Yurik, Akuckartz, Nandana, Lahi, 
Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, 
Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani, ko-kore

2020-12-23 Thread C933103
C933103 added a comment.


  Should the task be spilt to two?
  
  In T180345#6709988 <https://phabricator.wikimedia.org/T180345#6709988>, 
@Mahir256 wrote:
  
  > I'd just like to note that the Suppress-Script value for Korean according 
to the official subtag registry 
<https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry>
 is in fact Kore (meaning ko-Kore as a code is redundant in the eyes of a 
number of organizations).
  
  The thing with the "Kore" script tag is that, it indicate mixed use of Korean 
Hangul (Phonetic) and Hanja (Ideographic) characters, but it didn't indicate 
the ratio.
  Currently in regular Korean text, almost everything are being written in 
Hangul. But from time to time there can still be a few common shorthand Hanja 
characters in use in Korean text, in addition pf writing out Hanja as 
disambiguation in some situation. So I guess you can say it is a mixed script 
as well.
  On the other hand, what I originally have in mind was that, since many Korean 
term, especially proper noun, are originated or created based on Han characters 
and that they can be written as Hanja completely, it would not be possible to 
write Hanja for terms natively used in Korean or terms imported from Western 
languages in modern time. Such that, there are terms that can partially be 
written in Hanja but other parts need to be written as Hangul instead, like 
"Seoul Special City" or "Asiana Airlines", thus constitute another forn of 
mixed usage.
  I guess one can say it should use the "Hani" script tag instead since the 
intention is to show the Hanja characters of the terms, but then a problem is I 
don't think Hangul characters are to be expected for the Hani scripte value?

TASK DETAIL
  https://phabricator.wikimedia.org/T180345

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: Nikki, Mahir256, Mbch331, Amire80, jhsoby, GerardM, mxn, Liuxinyu970226, 
Aklapper, revi, C933103, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T202602: Consider renaming language identifier (i.e. on the left of sitelinks of items) of Cantonese from "zh_yue" to "yue"

2020-05-25 Thread C933103
C933103 added a comment.


  Why "consider" instead of just requesting it this way?

TASK DETAIL
  https://phabricator.wikimedia.org/T202602

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: C933103
Cc: C933103, Addshore, daniel, Ladsgroup, deryckchan, Hello903hello, Aklapper, 
Liuxinyu970226, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, 
LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant

2020-02-23 Thread C933103
C933103 added a comment.


  In T180771#5627406 <https://phabricator.wikimedia.org/T180771#5627406>, 
@Davidzdh wrote:
  
  > 在T180771#4544573中,@C933103写道:
  >
  >> And then for hak... Can someone verify that "Hakka (Traditional Han 
script)" and "Hakka (Simplified Han Script)" are proper way to describe how 
Hakka speakers would write their language in Han scripts?
  >
  > Of course it is one of the correct way to write this language. Ministry of 
Education, ROC awards Literary Award of Taiwanese and Hakka (教育部閩客語文學獎, their 
website is https://www.edu.tw) every year. You can see hak-hant here.
  
  Hello, my question was NOT about whether it can be written in Chinese script 
(which I know it can), instead my question was that, whether there are 
meaningful differences between "Hakka with Simplified characters" and "Hakka 
with Traditional characters", as some previously mentioned that in certain 
other Chinese languages, characters that are currently used by the Simplified 
script have other function in the written version of that language, making it 
almost impossible to write the language using Simplified script and thus there 
are no need to distinguish Simplified - Traditional Chinese for that language. 
What I would like to know is whether Hakka also fit this situation being 
described.

TASK DETAIL
  https://phabricator.wikimedia.org/T180771

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Ab6399, C933103
Cc: Nikerabbit, wikibugs-l-list, Nemo_bis, siebrand, liangent, bzimport, 
Amire80, Ladsgroup, Ab6399, Zoranzoki21, jhsoby, GerardM, Davidzdh, ztl8702, 
TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Beast1978, Un1tY, 
Chaytanya, Hook696, Daryl-TTMG, RomaAmorRoma, 0010318400, E.S.A-Sheild, 
darthmon_wmde, Meekrab2012, joker88john, Dinadineke, DannyS712, CucyNoiD, 
Nandana, NebulousIris, Kieubinhtb, Tks4Fish, Gaboe420, Mh-3110, Versusxo, 
Majesticalreaper22, Giuliamocci, tabish.shaikh91, Adrian1985, Asad_Ali_Palijo, 
Cpaulf30, Lahi, Gq86, Af420, Darkminds3113, Bsandipan, Lordiis, 
GoranSMilovanovic, Adik2382, Soteriaspace, Jayprakash12345, Th3d3v1ls, 
JakeTheDeveloper, Ramalepe, Liugev6, QZanden, merbst, LawExplorer, WSH1906, 
Lewizho99, Maathavan, _jensen, rosalieper, D3r1ck01, Scott_WUaS, 
MuhammadShuaib, Tmalhotra, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331
___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Changed Subscribers] T215032: Add monolingual language code cmg

2019-01-31 Thread C933103
C933103 added subscribers: Liuxinyu970226, C933103.C933103 added a comment.
Please consider the following email response given to @Liuxinyu970226 when they asked certain linguistic expert about their opinion on the matter: https://imgur.com/a/YT8bnzJ (I am not sure whether sufficient permissions have been obtained by the user for me to link the mail on the public internet but let's just look at it for now)TASK DETAILhttps://phabricator.wikimedia.org/T215032EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, Liuxinyu970226, Aklapper, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2019-01-31 Thread C933103
C933103 added a comment.
Well, as mentioned, the code cmg previously suggested as possible alternative is actually not appropriate according to email exchanges you have conducted with professors that know more about these terminology. And given the email exchange also confirmed that the current ISO language codes for Mongolian languages doesn't really make much sense either, it would also be wrong to use individual language code for such purpose. So following the convention already used by others should be the most sensible way to represent such text string in the wiki. But then, if certain member of Langcom stand firm on their position and unwilling to change, then no amount of sensibility can force them to change.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Changed Subscribers] T137810: [Task] Add monolingual language code mn-Mong

2019-01-31 Thread C933103
C933103 added a subscriber: Liuxinyu970226.C933103 added a comment.Restricted Application removed a subscriber: Liuxinyu970226.

In T137810#4920450, @Liuxinyu970226 wrote:
@C933103 Because he said me "shame".


I am pretty sure you have misunderstood the English phrase being used. Please check the following link: https://www.macmillandictionary.com/dictionary/british/oh-what-a-shameTASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Changed Subscribers] T137810: [Task] Add monolingual language code mn-Mong

2019-01-31 Thread C933103
C933103 added a subscriber: Liuxinyu970226.C933103 added a comment.Restricted Application removed a subscriber: Liuxinyu970226.

In T137810#4920436, @Liuxinyu970226 wrote:
@Badaa

ISO-639 is a total mess. What a shame!

https://www.mediawiki.org/wiki/Code_of_Conduct#Unacceptable_behavior states: 1. Personal attacks, violence, threats of violence, or deliberate intimidation.


It is inappropriate to see view point against a system as attack/threat/violence against individual person.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2018-10-04 Thread C933103
C933103 added a comment.

In T137810#4641351, @Popolon wrote:
Just a little tip, on firefox, go to about:config, then search for the browser.urlbar.decodeURLsOnCopy option and change it from false to true, this is essential for non pure-english characters  (aka any accent or other char from any other language using latin script or any other script, so the url) :

https://zh.wikipedia.org/wiki/%E8%92%99%E5%8F%A4%E8%AA%9E%E6%96%B9%E8%A8%80%E5%88%97%E8%A1%A8

Thas is unreadable (will be the same with mongolian name in both cyrillic/mongolian script) will become

https://zh.wikipedia.org/wiki/蒙古語方言列表

shorter and readable (at least if you know the script).


The previous comment said he/she cannot understand Chinese.


In T137810#4641252, @Badaa wrote:

In T137810#4638007, @Liuxinyu970226 wrote:
@C933103

however there are only one Classical Mongolian Script just like there are only one Modern Standard Arabic.

Huh? Phags'pa ≠ Mongolian?

@Badaa

Actually, the big misconception is that there doesn't exist many Mongolian languages but many scripts.

https://zh.wikipedia.org/wiki/%E8%92%99%E5%8F%A4%E8%AA%9E%E6%96%B9%E8%A8%80%E5%88%97%E8%A1%A8 I won't repeat the likely problem answers again, unless if your both are even concerning this zhwiki list article.


@Liuxinyu970226
 Is there any English or Mongolian version? I don't know Chinese.


That linked page is a list of Mongolian Dialects, which Liuxinyu970226 wanted to use it to shows there are many different dialects of Mongolian, including also Oirat and Buryat, to show that they cannot be merged into a single entity of "Mongolian" in term of language code, because each of these language variants have their own script [cyrillic, especially for those that are in Modern Russian territory] that write their language variant out directly, and that they are different from each other and thus cannot be merged

However that's very different from what we are talking about here when Oirat and Buryat aren't even considered as part of the Mongolian Macrolanguage in ISO 639

And then the list also have Kalmyk and Qaracin, however they don't even have their own ISO 639 language code from what I knowTASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2018-10-04 Thread C933103
C933103 added a comment.
@Popolon I believe Monguor and all that do not/no longer use Mongolian Script in writing so that's not really relevant to the context.



@Liuxinyu970226 what about Phagspa?

Also I would remind you that in ISO 639, the macrolanguage Mongolian only cover khk/mvf but not othersTASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2018-10-02 Thread C933103
C933103 added a comment.
@Popolon According to my understanding assuming they are correct understanding, using Arabic as analogy, what you propose would be like making different monolingual value for "Libyan Modern Standard Arabic", "Egyptian Modern Standard Arabic", "Tunisian Modern Standard Arabic". Yes, Libyan/Egyptian/Tunisian Arabic are all different and could be considered as different languages, however there are only one single literary standard here. Surely, there are different phonetic literary standard that more closely reflect individual languages, like the Cyrillic alphabet being used to spell different Mongolic languages, which would warrant the establishment of wiki in each of their individual languages, however there are only one Classical Mongolian Script just like there are only one Modern Standard Arabic. You can say mvf is closest to Classical Mongolian in the same way as Egyptian Arabic being closest to the standard of Modern Standard Arabic, however they are not equal.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2018-09-29 Thread C933103
C933103 added a comment.
@Liuxinyu970226 If the concern of ISO639's RA is "users of the codes understand that part 2 of the standard has a code that includes several coded languages in part 3.", then probably what can be done is ask for cancellation of the mvf code and khk code in the ISO639-3?

Also, another thing is that I have just discovered that there is an ISO639-3 language code cmg for "Classical Mongolian", would that actually be appropriate to be used to describe text written in Classical Mongolian language?TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant

2018-08-30 Thread C933103
C933103 added a comment.
Actually my original ticket could be a little clearer...
Like clarifying that the "example" there was meant to mean there are articles in cdo/nan/hak wikipedia that are written in alternative script and thus there should be related monolingual code that would allow recording of those article names in wikidata language field.
Thus I would like to bump the request for monolingual language code cdo-hani and nan-hani.
And then for hak... Can someone verify that "Hakka (Traditional Han script)" and "Hakka (Simplified Han Script)" are proper way to describe how Hakka speakers would write their language in Han scripts?TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: GerardM, Davidzdh, ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Kieubinhtb, tabish.shaikh91, Asad_Ali_Palijo, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, merbst, LawExplorer, D3r1ck01, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T145531: [feature request] allow calculated properties, derived properties

2018-08-20 Thread C933103
C933103 added a comment.
The wikidata property proposal https://www.wikidata.org/wiki/Wikidata:Property_proposal/coordinate_location_GCJ02 would depend on this property datatype.TASK DETAILhttps://phabricator.wikimedia.org/T145531EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, Jarekt, Yair_rand, Liuxinyu970226, Aklapper, Esc3300, Lahi, Gq86, Darkminds3113, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Avner, Gehel, Jonas, FloNight, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T63958: Use existing $dateFormats to format dates on Wikidata

2018-08-20 Thread C933103
C933103 added a comment.
If there's no way to fix the internationalized format now then please change the format into ISO date format as a temporary fix. There's currently no way for me to tell which day a date value actually represent without trying to edit it and see the calendar pop up.TASK DETAILhttps://phabricator.wikimedia.org/T63958EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: thiemowmde, C933103Cc: C933103, Capankajsmilyo, jeblad, PokestarFan, revi, Phreelance, KTC, Samat, deryckchan, Nikki, Snaevar, Agabi10, gerritbot, thiemowmde, RP88, Aklapper, Mbch331, Ricordisamoa, Liuxinyu970226, Wikidata-bugs, Addshore, He7d3r, PiRSquared17, Lydia_Pintscher, Stryn, Gaboe420, Versusxo, Majesticalreaper22, Giuliamocci, Adrian1985, Cpaulf30, Lahi, Gq86, Baloch007, Darkminds3113, Bsandipan, Lordiis, GoranSMilovanovic, Adik2382, Jayprakash12345, Th3d3v1ls, Ramalepe, Liugev6, QZanden, LawExplorer, Lewizho99, Maathavan, Srdjan_m, MuhammadShuaib, LNDDYL, Psychoslave, aude, Gryllida, Shizhao, Arrbee, Jay8g___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant

2017-12-10 Thread C933103
C933103 added a comment.
Remove Japanese Kyujitai request as might be using variant subtag instead of script subtag could be a better idea? Although there are also problems in using variant subtagTASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Retitled] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant

2017-12-10 Thread C933103
C933103 renamed this task from "Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, ja-hant" to "Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant".C933103 updated the task description. (Show Details)
CHANGES TO TASK DESCRIPTION...The Wikidata item id: Q18165189

The language code: ja-Hant
Language name in the language itself or English: Japanese (Kyūjitai)
The used script, if not obvious: Jpan + Hant
Where and when the language was or is used: Japan, untiln Mid-20th century
The Wikidata item id: Q1147857

Usage example: Use for wikidata items like Q865

Note: I am not too sure about the code for zh-Nshu and ja-Hant, because:
- Hant is not commonly used to represent Japanese text written in Kyujitai, and Japanese text written in Kyujitai will also contain kana
But others should have little problems.TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, ja-hant

2017-12-10 Thread C933103
C933103 added a comment.
Remove Nushu as use case related to the language and script can be covered by using monolingual code mis due to the lack of language code for TuhuaTASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Retitled] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, ja-hant

2017-12-10 Thread C933103
C933103 renamed this task from "Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant" to "Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, ja-hant".C933103 updated the task description. (Show Details)
CHANGES TO TASK DESCRIPTION...The Wikidata item id: Q18165189

The language code: zh-Nshu
Language name in the language itself or English: Xiangnan Tuhua (Nüshu)
The used script, if not obvious: Nshu
Where and when the language was or is used: around Jiangyong County, Hunan, during imperial era
The Wikidata item id: Q56436 (wikidata item id for the language: Q17031342)

The language code: ja-Hant...Note: I am not too sure about the code for zh-Nshu and ja-Hant, because:
- Xiangnan Tuhua have no ISO 639-3 code, despite being part of the macrolanguage group "zh"/"zho"
- Hant is not commonly used to represent Japanese text written in Kyujitai, and Japanese text written in Kyujitai will also contain kana...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Edited] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant

2017-12-08 Thread C933103
C933103 updated the task description. (Show Details)
CHANGES TO TASK DESCRIPTION...The Wikidata item id: Q56436 (wikidata item id for the language: Q17031342)

The language code: ja-Hant...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Edited] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant

2017-12-08 Thread C933103
C933103 updated the task description. (Show Details)
CHANGES TO TASK DESCRIPTION...Language name in the language itself or English: ShaozhouXiangnan Tuhua (Nüshu)...Note: I am not too sure about the code for zh-Nshu and ja-Hant, because:
- It does not signify the particular varian- Xiangnan Tuhua have no ISO 639-3 code, despite being part of chinese being used to write in Nushuthe macrolanguage group "zh"/"zho"
- Hant is not commonly used to represent Japanese text written in Kyujitai, and Japanese text written in Kyujitai will also contain kana...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2017-12-05 Thread C933103
C933103 added a comment.

In T137810#3811581, @Liuxinyu970226 wrote:

In T137810#3771454, @C933103 wrote:
What is the rationale of macrolanguage being not usable to identify text?


If you could make a language conversion system between two writting system, Okay, otherwise my aganist to mn-Mong is still valid.


Inner Mongolian University has developed a sentence-based conversion engine (As in the engine require context from other words in the sentence to determine which candidate word should be used for transliteration) and it was claimed in another essay which I linked on somewhere else that the engine has up to 85% correct rate (Although 85% correct rate still mean about one error per every six words on average). The conversion engine was linked from the English WIkipedia's article for Mongolian Script's external link section.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant

2017-11-27 Thread C933103
C933103 added a comment.
hum edited task description accordinglyTASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Retitled] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant

2017-11-27 Thread C933103
C933103 renamed this task from "Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant" to "Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant".C933103 updated the task description. (Show Details)
CHANGES TO TASK DESCRIPTIONPlease add the following language code nan-Hants to the list of language codes supported for monolingual text values.

The language code: nan-Hant-TW
Language name in the language itself or English: Min Nan (Hanji, Taiwan)
The used script, if not obvious: Hant
Where and when the language was or is used: Taiwan, modern era
The Wikidata item id: Q15901848

The language code: nan-Hans-CNi
Language name in the language itself or English: Min Nan (Hanji, China)
The used script, if not obvious: Hansi
Where and when the language was or is used: mainland ChinaMinnan-speaking area, modern era...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant

2017-11-26 Thread C933103
C933103 added a comment.
Almost all the Hani text being discussed and used related to the nan.wp project now are Hant. Disregarding Hans for now and use Hani instead of Hant would probably do the job in the current setting but what about when mainland China Hans users start visiting and editing the site?TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant

2017-11-24 Thread C933103
C933103 added a comment.

In T180771#3785764, @Liuxinyu970226 wrote:
Why requesting separated BCP47 codes for Min nan? Why don't we just split the TWN Min nan translations instead (so we can also make Min nan labels separated).


TWN min nan? as in nan-TW? According to my understanding, there are also POJ users in Taiwan and Hani users in mainland China so that doesn't seem to solve the situation entirely, although my understanding is not necessarily complete.TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Retitled] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant

2017-11-23 Thread C933103
C933103 renamed this task from "Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans, cdo-hant, hak-hans, hak-hant, zh-nshu, ja-hant" to "Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant".C933103 updated the task description. (Show Details)
CHANGES TO TASK DESCRIPTION...The language code: cdo-Hans
Language name in the language itself or English: Min Dong (Chinese character, Simplified)
The used script, if not obvious: Hans
Where and when the language was or is used: mainland China, modern era (mainly used under the name of dialect of individual specific regions, eg Fuzhou)
The Wikidata item id: Q5365165

The language code: cdo-Hanti
Language name in the language itself or English: Min Dong (Chinese character, Traditionals)
The used script, if not obvious: Hanti
Where and when the language was or is used: mainly for people on and from Matsu islandsMin-Dong-speaking people. Modern eraTASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans, cdo-hant, hak-hans, hak-hant, zh-nshu, ja-hant

2017-11-23 Thread C933103
C933103 added a comment.
See also https://translatewiki.net/wiki/Thread:Support/Request_for_enabling_new_variants_for_Hakka_language_(2)TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Retitled] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans, cdo-hant, hak-hans, hak-hant, zh-nshu, ja-hant

2017-11-23 Thread C933103
C933103 renamed this task from "Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans-cn, cdo-hant-tw, hak-hans-cn, hak-hant-tw, zh-nshu, ja-hant" to "Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans, cdo-hant, hak-hans, hak-hant, zh-nshu, ja-hant".C933103 updated the task description. (Show Details)
CHANGES TO TASK DESCRIPTION...The language code: cdo-Hans-CN
Language name in the language itself or English: Min Dong (Chinese character, ChinaSimplified)...The language code: cdo-Hant-TW
Language name in the language itself or English: Min Dong (Chinese character, TaiwanTraditional)...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Edited] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans-cn, cdo-hant-tw, hak-hans-cn, hak-hant-tw, zh-nshu, ja-hant

2017-11-23 Thread C933103
C933103 updated the task description. (Show Details)
CHANGES TO TASK DESCRIPTION...The language code: hak-Hans-CN
Language name in the language itself or English: Hakka (Chinese character, ChinaSimplified)...The language code: hak-Hant-TW
Language name in the language itself or English: Hakka (Chinese character, TaiwanTraditional)...Where and when the language was or is used: Taiwan, Hong Kong, etc., modern era...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2017-11-17 Thread C933103
C933103 added a comment.
What is the rationale of macrolanguage being not usable to identify text?TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T144272: new monolingual language code requests for Wikidata (tracking)

2017-11-17 Thread C933103
C933103 added a comment.
Are you implying that those monolingual language code I'm submitting does not represent anything useful? nan/cdo/hak-Hant/hans are language-script combinations being used to write wikipedia articles, and vi-hani, ko-kore, ja-Kyujitai are used to name people and things in respective countries. How do you write the name of "Ho Chi Minh City" in Vietnamese Han nom? The only place providing this info in wikidata for now is in the Japanese alias for the entry name. How about "Kim Jong-Il" in ko-Kore? Look at the Slovak alias. Is it better than having labels for each of these script variants?TASK DETAILhttps://phabricator.wikimedia.org/T144272EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, Nikerabbit, Mbch331, Lydia_Pintscher, Zoranzoki21, Nikki, Amire80, jhsoby, Liuxinyu970226, GerardM, thiemowmde, Aklapper, Esc3300, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2017-11-17 Thread C933103
C933103 added a comment.
Then mon, mon is ISO 639-3TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Block] T144272: new monolingual language code requests for Wikidata (tracking)

2017-11-16 Thread C933103
C933103 created subtask T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans-cn, cdo-hant-tw, hak-hans-cn, hak-hant-tw, zh-nshu, ja-hant.
TASK DETAILhttps://phabricator.wikimedia.org/T144272EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Nikerabbit, Mbch331, Lydia_Pintscher, Zoranzoki21, Nikki, Amire80, jhsoby, Liuxinyu970226, GerardM, thiemowmde, Aklapper, Esc3300, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Created] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans-cn, cdo-hant-tw, hak-hans-cn, hak-hant-tw, zh-nshu, ja-hant

2017-11-16 Thread C933103
C933103 created this task.C933103 added a project: Wikidata.Herald added a subscriber: Aklapper.
TASK DESCRIPTIONPlease add the language code nan-Hant to the list of language codes supported for monolingual text values.

The language code: nan-Hant-TW
Language name in the language itself or English: Min Nan (Hanji, Taiwan)
The used script, if not obvious: Hant
Where and when the language was or is used: Taiwan, modern era
The Wikidata item id: Q15901848

The language code: nan-Hans-CN
Language name in the language itself or English: Min Nan (Hanji, China)
The used script, if not obvious: Hans
Where and when the language was or is used: mainland China, modern era
The Wikidata item id: Q15901848

The language code: cdo-Hans-CN
Language name in the language itself or English: Min Dong (Chinese character, China)
The used script, if not obvious: Hans
Where and when the language was or is used: mainland China, modern era (mainly used under the name of dialect of individual specific regions, eg Fuzhou)
The Wikidata item id: Q5365165

The language code: cdo-Hant-TW
Language name in the language itself or English: Min Dong (Chinese character, Taiwan)
The used script, if not obvious: Hant
Where and when the language was or is used: mainly for people on and from Matsu islands. Modern era.
The Wikidata item id: Q5365165

The language code: hak-Hans-CN
Language name in the language itself or English: Hakka (Chinese character, China)
The used script, if not obvious: Hans
Where and when the language was or is used: mainland China, modern era
The Wikidata item id: Q22827960

The language code: hak-Hant-TW
Language name in the language itself or English: Hakka (Chinese character, Taiwan)
The used script, if not obvious: Hant
Where and when the language was or is used: Taiwan, modern era
The Wikidata item id: Q18165189

The language code: zh-Nshu
Language name in the language itself or English: Shaozhou Tuhua (Nüshu)
The used script, if not obvious: Nshu
Where and when the language was or is used: around Jiangyong County, Hunan, during imperial era
The Wikidata item id: Q56436

The language code: ja-Hant
Language name in the language itself or English: Japanese (Kyūjitai)
The used script, if not obvious: Jpan + Hant
Where and when the language was or is used: Japan, untiln Mid-20th century
The Wikidata item id: Q1147857

Usage example: Use for wikidata items like Q865

Note: I am not too sure about the code for zh-Nshu and ja-Hant, because:


It does not signify the particular variant of chinese being used to write in Nushu
Hant is not commonly used to represent Japanese text written in Kyujitai, and Japanese text written in Kyujitai will also contain kana


But others should have little problems.TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, C933103, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Retitled] T180345: Add monolingual language code vi-hani, ko-kore

2017-11-13 Thread C933103
C933103 renamed this task from "Add monolingual language code vi-hani, ko-hani" to "Add monolingual language code vi-hani, ko-kore".C933103 updated the task description. (Show Details)
CHANGES TO TASK DESCRIPTION...*The language code: ko-HaniKore
*Language name in the language itself or English: English: Korean Hanja Mixed Script
*The used script, if not obvious: Hang + Hani mixed script (the proposed code is only Hani as there doesn't seems to be a code for mixed script in ISO 15924 database yet unlike Japanese)Kore
*Where and when the language was or is used: Korea before WWII...TASK DETAILhttps://phabricator.wikimedia.org/T180345EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, revi, C933103, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Created] T180345: Add monolingual language code vi-hani, ko-hani

2017-11-13 Thread C933103
C933103 created this task.C933103 added a project: Wikidata.Herald added subscribers: revi, Aklapper.
TASK DESCRIPTIONPlease add the language code vi-hani and ko-hani to the list of language codes supported for monolingual text values.

*The language code: vi-Hani
*Language name in the language itself or English: English: Vietnamese Chu Nom
*The used script, if not obvious: Hani
*Where and when the language was or is used: Mainly in Vietnam before 20th century
*The Wikidata item id: Q875344

*The language code: ko-Hani
*Language name in the language itself or English: English: Korean Hanja Mixed Script
*The used script, if not obvious: Hang + Hani mixed script (the proposed code is only Hani as there doesn't seems to be a code for mixed script in ISO 15924 database yet unlike Japanese)
*Where and when the language was or is used: Korea before WWII
*The Wikidata item id: Q711797

Usage example: I was going to add entry for both of them in wikidata item Q1 but get rejected.

Note that according to https://www.wikidata.org/wiki/Help:Monolingual_text_languages , although the policy was not finished yet, it was stated that a language code does not have to fulfill requirement of the language proposal policy for new wikis, and in general a code would be acceptable as long as the language code is valid.

By the way the application form for new monolingual code on this template is greatly different from the form on wikidata. Which of these fields are actually needed?TASK DETAILhttps://phabricator.wikimedia.org/T180345EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, revi, C933103, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Block] T144272: new monolingual language code requests for Wikidata (tracking)

2017-11-13 Thread C933103
C933103 created subtask T180345: Add monolingual language code vi-hani, ko-hani.
TASK DETAILhttps://phabricator.wikimedia.org/T144272EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Nikerabbit, Mbch331, Lydia_Pintscher, Zoranzoki21, Nikki, Amire80, jhsoby, Liuxinyu970226, GerardM, thiemowmde, Aklapper, Esc3300, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2017-11-11 Thread C933103
C933103 added a comment.
mvf only refer to Mongolian spoken in Central part of Inner Mongolia while mn-Mong is written by all mn users.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T162331: Provide tools for processing obfuscated Chinese geodata (GCJ-02, BD-09)

2017-04-06 Thread C933103
C933103 added a comment.
According to some pages I have read from google, it seems like in the US only the compilation of data is protected while data itself are not and the creation of databasr also need to have some creativity in order to make the database fulfil copyright law, and in the EU there is an extra protection of investment being put to collect, arrange and present data. So it seems like it should not have problem under the US law in most cases although it might be better to let a legal expert to answer the question ..TASK DETAILhttps://phabricator.wikimedia.org/T162331EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Pnorman, C933103, Aklapper, Arthur2e5, Cosine02, Liuxinyu970226, QZanden, Salgo60, D3r1ck01, Izno, Cwek, Wikidata-bugs, aude, zhuyifei1999, Shizhao, MaxSem, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T54971: Sitelinks to Incubator, OldWikisource and BetaWikiversity

2017-03-02 Thread C933103
C933103 added a comment.
Is it within the scope of this task that ordinary wikipedia with multiple page for every single concept written in multiple script cannot be linked to same wikidata concept entry?TASK DETAILhttps://phabricator.wikimedia.org/T54971EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Samwilson, Esc3300, srishakatux, C933103, Stashbot, hoo, aude, JanZerebecki, TTO, Liuxinyu970226, Accurimbono, Aklapper, Ricordisamoa, Purodha, liangent, Wikidata-bugs, Vogone, Candalua, SPQRobin, mxn, Filceolaire, jayvdb, Micru, revi, Billinghurst, Lydia_Pintscher, MF-Warburg, zhuyifei1999, Tpt, JohnLewis, D3r1ck01, Izno, Mbch331, Jay8g___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2016-10-08 Thread C933103
C933103 added a comment.
btw would it be a good idea to apply for a literary mongolian code like literary chinese if the situation about literary mongolian is so similar to literary chinese? But it seems like it would at least take at least a year for the ISO 639 RA to make any change to iso language code? And I am not familiar with Mongolian enough to say is this really a good method to handle the thing let alone submitting a proposal.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2016-10-05 Thread C933103
C933103 added a comment.
@GerardM but traditional mongolian script is like literary chinese, which is universal to every languages that were using it as their written form and thus it is invalid to say which language they belong to. Just like you can say Nihon Shoki is written in Chinese but you can't say it is written with Mandarin or Hakka. The situation with traditional Mongolian script is the same. And also, it would be incorrect [despite being a convention] to call those Mongolian text middle/classical Mongolian language just like you can't equate literary chinese to old/middle chinese, as there are still some changes being made to the written language that set the old language at that time apart from the written form continually being used.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong

2016-10-04 Thread C933103
C933103 added a comment.
@GerardM 
0. According to the "Requirements for a new language code" linked above, the WIP requirement for a new language code is a valid IETF tag not a valid ISO code


Macrolanguages in ISO 639-3 are still individual languages in ISO 639-2, and definition of macrolanguage in ISO 639-3 is "clusters of closely-related language varieties that [...] can be considered distinct individual languages, yet in certain usage contexts a single language identity for all is needed".  and thus macrolanguages should be treated as an lanuage with valid language code. And mn is a valid code and is currently used by Mongolian wikipedia, which also contain several articles written in traditional Mongolian script.
See BCP 47 section 2.1.1 for details about uppercasing. https://tools.ietf.org/html/bcp47
both khk, mvf, bua and xal can be written with Latn, Cyrl and Mong.
mn-Mong is not only used for mvf.
BCP 47 also stated that macrolanguage code can still be used instead of code for encompassed languge
you can see mn_Mong_CN is a likely subtag in http://unicode.org/repos/cldr-tmp/trunk/diff/supplemental/likely_subtags.html
You can see mn-Mong listed in IANA language subtag registry http://www.iana.org/assignments/language-subtag-registry/language-subtag-registry (listed as redundant as it have the correct form and format defined by RFC4646 and all the subtag it used are defined in the document. See RFC4645 for detail.)
TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs