[Wikidata-bugs] [Maniphest] T162331: Provide tools for processing obfuscated Chinese geodata (GCJ-02, BD-09)
C933103 reopened this task as "Open". C933103 added a comment. In T162331#8121512 <https://phabricator.wikimedia.org/T162331#8121512>, @MPhamWMF wrote: > Closing out low/est priority tasks over 6 months old with no activity within last 6 months in order to clean out the backlog of tickets we will not be addressing in the near term. Please feel free to reopen if you think a ticket is important, but bare in mind that given current priorities and resourcing, it is unlikely for the Search team to pick up these tasks for the indefinite future. We hope that the requested changes have either been addressed by or made irrelevant by work the team has done or is doing -- e.g. upgrading Elasticsearch to a newer version will solve various ES-related problems -- or will be subsumed by future work in a more generalized way. This belongs to search team? TASK DETAIL https://phabricator.wikimedia.org/T162331 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: MPhamWMF, Ericliu1912, Esc3300, Nikki, PokestarFan, Pnorman, C933103, Aklapper, Arthur2e5, Stang, Liuxinyu970226, mrephabricator, Astuthiodit_1, karapayneWMDE, Jonathan5566, Invadibot, LaMagiaaa, Func, LuciferianThomas, RuiyuShen, maantietaja, CBogen, ItamarWMDE, Akuckartz, Alilje, Joye_Zhang, VulpesVulpes825, SunAfterRain, 94rain, Nandana, Hamishcn, KirkLU, Lahi, Gq86, BJ6123C7BTD, Looniverse, GoranSMilovanovic, lisong, Sturm, Allthingsgo, QZanden, EBjune, Orienteerix, LawExplorer, _jensen, rosalieper, Htchien, JGirault, Taiwania_Justo, Scott_WUaS, phabyogi, Susannaanas, lxbarth, Fuzheado, Planemad, Cwek, Wikidata-bugs, aude, zhuyifei1999, Shizhao, awight, Yurik, TheDJ, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T236593: Cannot enter multiple forms for the same language variant
C933103 added a comment. In T236593#8092471 <https://phabricator.wikimedia.org/T236593#8092471>, @LucasWerkmeister wrote: > It’s still not clear to me which problem the `-x-Q123-1` patch is trying to solve. Several languages have been mentioned in this task, but which of them would benefit from this system? I feel like for several of them, we’ve already reached the conclusion that separate forms are in fact the way to go. > > I’d like to extract a general rule from @Fnielsen’s comment above (T236593#5610903 <https://phabricator.wikimedia.org/T236593#5610903>): if you need separate statements, then you need separate forms or lexemes. (I think this is a sufficient condition, though it might not be a necessary one.) Pronunciation (whether pronunciation audio <https://www.wikidata.org/wiki/Property:P443> or IPA transcription <https://www.wikidata.org/wiki/Property:P898>) is probably the most significant kind of statement here: if a speaker would pronounce the spellings differently, then they should be different forms – regardless of whether the difference is a completely different ending as in octopuses/octopi, or just an extra schwa as in aft(e)nen. (I don’t find the hyphenation example as convincing… don’t you need a different hyphenation for every spelling variant, even for cases that really should just be multiple representations of one form? E.g. co‧lor/co‧lour – that could just be multiple statements on the same form, with different monolingual text language codes.) > > I suspect this rule covers the Norwegian example that originally motivated this task: I feel like “parametere” and “parametre” are probably pronounced differently, much like “aftnen” and “aftenen” are pronounced differently in Danish according to Finn. For Vietnamese chữ Nôm, I feel like @mxn’s comment at T236593#8024999 <https://phabricator.wikimedia.org/T236593#8024999> goes in a similar direction, though I admit I find the whole Chinese-characters part of this discussion hard to follow. > > For the cases where you really only want to have one form with multiple representations, I still agree with @daniel’s comment (T236593#5610378 <https://phabricator.wikimedia.org/T236593#5610378>): “you make up a code for each of the spellings”. In practice, the only way to “make up a code” that we currently support is to append -x-Q//12345// to an existing, established language code. As far as I understand, this solution works well for Hebrew: e.g. ספר/סֵפֶר (L67105) <https://www.wikidata.org/wiki/Lexeme:L67105> (the “book” word) uses the language codes `he` and `he-x-Q21283070`, where Q21283070 <https://www.wikidata.org/wiki/Q21283070> represents Tiberian vocalization, the orthography with diacritics. At some point, an editorial decision was made that the spelling without diacritics “deserves” the unsuffixed `he` language code (instead of both spellings using an -x-Q//12345// language code), which I think is reasonable: data reusers who don’t care about the different spellings can use the most standard language code (`he`) and its single representation per form. > > Allowing people to append an integer number to the item ID adds a second way to make up a code, and one that seems less useful to me: without knowing what the number means, how do I know which form representation to use? To me this runs counter to the goal of “allow[ing] the consumer to choose which variant they prefer”. For the languages that appear to need multiple representations for the same language code per form (e.g. the Indian languages @Mahir256 mentioned in T236593#5608530 <https://phabricator.wikimedia.org/T236593#5608530>?), is it not possible to make the item ID approach work, by creating more special-purpose items? Wikidata editors would then make a decision which of the possible spellings “deserves” the standard language code, and which additional items need to be created (“spelling with character X”, “spelling with sequence Y”?). I understand that not all languages have standardized spellings where you can use a single item ID to refer to the spelling variants of a wide range of lexemes (like in Hebrew), but I think it should still be possible to describe different spellings using items that carry more meaning than just a number. As an English example, some religious people might refuse to write the name "God" out directly as it is as this would constitute idolatry. For this we can tag it as en-x-Q for which Q refer to religious group of people, but there are more than one alternative way to write "God". They can either write "G-d", "G*d", "G_d", "G-o-d", and so on. It would make no contextual differences in whether a hyphen or a underscore is being used, and the change in which exact symbol being used in place of original alphabet woul
[Wikidata-bugs] [Maniphest] T236593: Cannot enter multiple forms for the same language variant
C933103 added a comment. In T236593#5610378 <https://phabricator.wikimedia.org/T236593#5610378>, @daniel wrote: > I recall that we had long discussions about this when initially deciding on the data model. In technical terms, the question was whether we would allow only a single literal value for a spelling variant, or a list or set of words. Allowing a list or set would enable the kind of flexibility @jhsoby is asking for. But the down side is that it introduces ambiguity when listing forms (you would always have to list all of them, in undefined order), and when generating text (which one should you use)? > > If I recall correctly, we decided that we want to give the consumer of the data maximum control over which variant they prefer, by forcing the producer to provide different variant codes for all different spellings. We had discussions about how to encode this in the variant (language) codes, and how to represent it in the UI, but decided to leave that for later. > > So, the solution that we envisioned when originally discussing this about four years ago was: you make up a code for each of the spellings, in a way that allows the consumer to choose which variant they prefer. If that is done by encoding a region or a rhyme or a tradition or school or whatever will depend on the language. If it's a stylistic choice, name the style. > > The same approach can be used for historical spellings. codes could look something like de-x-hist-nd-15jh or something (this code is totally made up and probably linguistically nonsense). The underlying assumption behind this decision is that, different spelling forms must be associated with certain variant, or that there are some of the spelling being preferred over other spellings, or that some spelling is more commonly used for some spoken variant/sociolet/etc than others and is other spelling. None of these are correct assumption, when it come to non-Chinese languages that use Chinese characters, or even some Chinese languages that need to apply Chinese characters. Example of Vietnamese chu nom have already been presented above. Other examples includes Japanese ateji when Kanji are used for Japanese native words except cases where there have been full established transliteration, and its Korean equivalent in history, as well as in languages like Cantonese when non-Mandarin words need to be expressed in Chinese characters. TASK DETAIL https://phabricator.wikimedia.org/T236593 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: C933103, AGutman-WMF, mxn, So9q, Ijon, daniel, Asaf, Mahir256, Danmichaelo, Fnielsen, Lucas_Werkmeister_WMDE, Denny, Lydia_Pintscher, jeblad, jhsoby, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Bodhisattwa, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani
C933103 added a comment. In T180345#7916540 <https://phabricator.wikimedia.org/T180345#7916540>, @GerardM wrote: > Hoi, > We can only support scripts supported in Unicode. As mentioned in previous posts, chu nom are already supported and added into Unicode through CJKV unified ideographic characters extensions. > So what is the font to be > used? Please see this list of fonts: https://en.m.wikipedia.org/wiki/Template:Vi-nom/fonts.css > Thanks, > > GerardM TASK DETAIL https://phabricator.wikimedia.org/T180345 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: Yellowtailshark, Popolon, Esc3300, Nikki, Mahir256, Mbch331, Amire80, jhsoby, GerardM, mxn, Liuxinyu970226, Aklapper, revi, C933103, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani
C933103 added a comment. In T180345#7916094 <https://phabricator.wikimedia.org/T180345#7916094>, @mxn wrote: > Ken Lunde (2009) <https://books.google.com/books?id=SA92uQqTB-AC&pg=PA570> writes that //chữ Nôm// had a dedicated script code of `Cu` in ISO 15924:2004 (which has since been superseded by ISO 15924:2022). Does anyone have more information about these two-letter codes or why //chữ Nôm// didn’t get a four-letter code corresponding to this two-letter code? That table is same as the table in Japanese version of the book, published in year 2002. <https://books.google.com/books?id=U36IQGjmfqMC&pg=PA413> So it would be before the official publication of the ISO 15924. Given Unicode maintain a list of changes to ISO 15924 since the standard's official publication in 2004, and the list didn't include Chu nom, <https://unicode.org/iso15924/codechanges.html> I would assume the table in the book reflect content in the standard's draft that didn't make it to the ultimate list. TASK DETAIL https://phabricator.wikimedia.org/T180345 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: Yellowtailshark, Popolon, Esc3300, Nikki, Mahir256, Mbch331, Amire80, jhsoby, GerardM, mxn, Liuxinyu970226, Aklapper, revi, C933103, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani
C933103 added a comment. "Hani" simply mean "Chinese[Han] characters". "vi-Hani" mean "Vietnamese, written in Chinese[Han] characters". Chu Nho, despite widely used in Vietnam in ancient time, are written according to Classical Chinese grammar, and as such should classify as Classical Chinese text, with code "lzh", similar to comparable works from Japan, Korea, and other neighboring regions. The existence of ISO code Jpan is for the mixed use of Kana together of Kanji in Japanese text, which is still the common writing system for Japanese system nowadays. The existence of the ISO code Kore is for the mixed use of Hangul together with Hanja in Korean text. Although Hanja's role in Korean language have greatly diminished, it is still not unexpected to see Hanja in modern Korean text, hence the code "Kore" which represent Hanja+Hangul is still the default code for Korean language writing system, at least in South Korea. On the other hand, I do not think the mixed use of Han characters with other writing systems, say Latin alphabets, is an expected usage in Vietnam nowadays, hence I don't think it is necessary to apply for a new ISO 15924 code for such mixed use to reflect this. As for "characters common between Chinese characters as used in China vs characters that only exists in Chu nom", note that both Japanese and Korean languages also have some Han characters uniquely created by them for their countries, but they simply treat them as part of the Han characters in their language, in the same way as all other imported Han characters. And would be tagged with script code "Hani". But I do note that one thing that separate the Vietnamese Chu Nom from those unique characters from Japanese/Korean is that, there are large number of them, and they are formed according to some rule for many Vietnamese indigenous words. They can be treated as Han characters, and also followed typical ways of Han characters formation by combining meanings and sounds of characters, and thus "Hani" code is applicable. But I think it is also not impossible to apply for another ISO 15924 code, given how Traditional Chinese and Simplified Chinese which have much less different from each others still received their individual code. On the other hand however, the classification of Hans versus Hant is necessary in rendering text in two different writing systems that both are part of the Chinese language, but it is not really the case for Vietnamese when Chu Nho are coded lzh. As for situation of other ISO 15924 codes like Latg, I don't think they are comparable, as they represent different characters and different ways to write the language. I guess it would be more comparable to say Seal scripts should get their own ISO 15924 codes. TASK DETAIL https://phabricator.wikimedia.org/T180345 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: Yellowtailshark, Popolon, Esc3300, Nikki, Mahir256, Mbch331, Amire80, jhsoby, GerardM, mxn, Liuxinyu970226, Aklapper, revi, C933103, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani
C933103 added a comment. I have removed the Korean part of the ticket and focus on Vietnamese writing, due to problem of ambiguity of "Kore" script tag in ISO 15924, as mentioned in December 2020. TASK DETAIL https://phabricator.wikimedia.org/T180345 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: Popolon, Esc3300, Nikki, Mahir256, Mbch331, Amire80, jhsoby, GerardM, mxn, Liuxinyu970226, Aklapper, revi, C933103, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani
C933103 renamed this task from "Add monolingual language code vi-hani, ko-kore" to "Add monolingual language code vi-hani". C933103 updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T180345 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: Popolon, Esc3300, Nikki, Mahir256, Mbch331, Amire80, jhsoby, GerardM, mxn, Liuxinyu970226, Aklapper, revi, C933103, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T215032: Add monolingual language code cmg
C933103 added a comment. So with the example from July 7 last year, why isn't this request proceeding? TASK DETAIL https://phabricator.wikimedia.org/T215032 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: Esc3300, Lydia_Pintscher, jhsoby, Amire80, Mbch331, Nikki, C933103, Liuxinyu970226, Aklapper, Popolon, karapayneWMDE, Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T206426: Storing multiple sitelinks to a multilingual wiki
C933103 added a comment. Note that in addition to multilingual wiki, there are also wikiprojects like Wikipedia for Min Nan, where same article on same subject are being written in same language in multiple different article using multiple script, due to technical difficulty making automatic conversion between these different articles impossible. Currently, wikidata can only handle one site link to one script version of article on such sort of Wikipedia, and linking to another article through another wikidata QID entity, making article in the other script cannot be easily accessed through interlanguage link. TASK DETAIL https://phabricator.wikimedia.org/T206426 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: C933103, Lectrician1, mxn, Nintendofan885, Koavf, Uzume, Sotiale, Xbspiro, Awangba_Mangang, Liuxinyu970226, Aklapper, Yurik, Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T215032: Add monolingual language code cmg
C933103 added a comment. In T215032#7193390 <https://phabricator.wikimedia.org/T215032#7193390>, @Esc3300 wrote: > If you use "mul", you wouldn't actually need "cmg". Are you trying to play catch 22? The data cannot be input into wikidata under the correct code unless it's approved and added here, but to approve a code and add it, it is necessary for the code to first be used inside wikidata? TASK DETAIL https://phabricator.wikimedia.org/T215032 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: Esc3300, Lydia_Pintscher, jhsoby, Amire80, Mbch331, Nikki, C933103, Liuxinyu970226, Aklapper, Popolon, Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant
C933103 added a comment. In T180771#7193232 <https://phabricator.wikimedia.org/T180771#7193232>, @Esc3300 wrote: > The request isn't complete: it lacks samples. I added in comments above? TASK DETAIL https://phabricator.wikimedia.org/T180771 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Mbch331, C933103 Cc: Esc3300, Addshore, Manuel, noarave, Mbch331, Nikki, Lydia_Pintscher, Nikerabbit, wikibugs-l-list, Nemo_bis, siebrand, liangent, bzimport, Amire80, Ladsgroup, Ab6399, Kizule, jhsoby, GerardM, Davidzdh, Yejianfei, Liuxinyu970226, Aklapper, C933103, Biggs657, Invadibot, Lalamarie69, maantietaja, Alter-paule, Beast1978, Un1tY, Akuckartz, Hook696, Iflorez, Kent7301, alaa_wmde, joker88john, CucyNoiD, Nandana, Gaboe420, lucamauri, Giuliamocci, Cpaulf30, Lahi, Gq86, Af420, Bsandipan, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant
C933103 added a comment. Autonym for nan-hani: In T180771#6932367 <https://phabricator.wikimedia.org/T180771#6932367>, @Amire80 wrote: > In T180771#6916776 <https://phabricator.wikimedia.org/T180771#6916776>, @Mbch331 wrote: > >> @Yejianfei There is no Langcom approval yet to add those languages. > > To clarify, the keyword here is "yet". I'm not against against cdo-hani in principle. I just wanted to make sure that when `nan-hani` is deployed, it works as expected. Is `nan-hani` now deployed? Does it work as expected? Can anyone give some examples? I have just added the nan-hani label to a few wikidata, according to either the hani version of article title on nan wikipedia, or hani lang template for title on latin character articles on the wikipedia. Examples include Q703914, Q127031, Q45190, Q660947, Q36778, Q2914034. I think it is working as expected. p.s. It seems like Nan wikipedia is trying to use either namespace or category to categorize articles written in Hani but none appears to be comprehensive, and due to problem in wikidata those articles are also undiscoverable from wikidata, making it hard to find them ... p.p.s. Should someone post about this on nan wikipedia Village pump? TASK DETAIL https://phabricator.wikimedia.org/T180771 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Mbch331, C933103 Cc: Manuel, noarave, Mbch331, Nikki, Lydia_Pintscher, Nikerabbit, wikibugs-l-list, Nemo_bis, siebrand, liangent, bzimport, Amire80, Ladsgroup, Ab6399, Kizule, jhsoby, GerardM, Davidzdh, Yejianfei, Liuxinyu970226, Aklapper, C933103, Invadibot, Lalamarie69, maantietaja, Alter-paule, Beast1978, Un1tY, Akuckartz, Hook696, Iflorez, Kent7301, alaa_wmde, joker88john, CucyNoiD, Nandana, Gaboe420, lucamauri, Giuliamocci, Cpaulf30, Lahi, Gq86, Af420, Bsandipan, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, Lewizho99, Maathavan, _jensen, rosalieper, Scott_WUaS, Jonas, Wikidata-bugs, aude ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T206426: Storing multiple sitelinks to a multilingual wiki
C933103 added a comment. While interlinking of "(Wikidata, Commons, OSM wiki)" using the translation plugin could also be a problem, they should be interlinked within each individual wiki already and thus I don't think they have the most pressing meed Instead a more problematic and oressing case that require multiple site link for multilingual wiki, is things like Wikipedia for Min Nan, where same article on same subject are being written in same language in multiple different article using multiple script, due to technical difficulty making automatic conversion between these different articles impossible. Currently, wikidata can only handle one site link to one script version of article on such sort of Wikipedia, making srticlr in the other TASK DETAIL https://phabricator.wikimedia.org/T206426 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: C933103, Lectrician1, mxn, Nintendofan885, Koavf, Uzume, Sotiale, Xbspiro, Awangba_Mangang, Liuxinyu970226, Aklapper, Yurik, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] T180345: Add monolingual language code vi-hani, ko-kore
C933103 added a comment. Should the task be spilt to two? In T180345#6709988 <https://phabricator.wikimedia.org/T180345#6709988>, @Mahir256 wrote: > I'd just like to note that the Suppress-Script value for Korean according to the official subtag registry <https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry> is in fact Kore (meaning ko-Kore as a code is redundant in the eyes of a number of organizations). The thing with the "Kore" script tag is that, it indicate mixed use of Korean Hangul (Phonetic) and Hanja (Ideographic) characters, but it didn't indicate the ratio. Currently in regular Korean text, almost everything are being written in Hangul. But from time to time there can still be a few common shorthand Hanja characters in use in Korean text, in addition pf writing out Hanja as disambiguation in some situation. So I guess you can say it is a mixed script as well. On the other hand, what I originally have in mind was that, since many Korean term, especially proper noun, are originated or created based on Han characters and that they can be written as Hanja completely, it would not be possible to write Hanja for terms natively used in Korean or terms imported from Western languages in modern time. Such that, there are terms that can partially be written in Hanja but other parts need to be written as Hangul instead, like "Seoul Special City" or "Asiana Airlines", thus constitute another forn of mixed usage. I guess one can say it should use the "Hani" script tag instead since the intention is to show the Hanja characters of the terms, but then a problem is I don't think Hangul characters are to be expected for the Hani scripte value? TASK DETAIL https://phabricator.wikimedia.org/T180345 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: Nikki, Mahir256, Mbch331, Amire80, jhsoby, GerardM, mxn, Liuxinyu970226, Aklapper, revi, C933103, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T202602: Consider renaming language identifier (i.e. on the left of sitelinks of items) of Cantonese from "zh_yue" to "yue"
C933103 added a comment. Why "consider" instead of just requesting it this way? TASK DETAIL https://phabricator.wikimedia.org/T202602 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: C933103 Cc: C933103, Addshore, daniel, Ladsgroup, deryckchan, Hello903hello, Aklapper, Liuxinyu970226, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant
C933103 added a comment. In T180771#5627406 <https://phabricator.wikimedia.org/T180771#5627406>, @Davidzdh wrote: > 在T180771#4544573中,@C933103写道: > >> And then for hak... Can someone verify that "Hakka (Traditional Han script)" and "Hakka (Simplified Han Script)" are proper way to describe how Hakka speakers would write their language in Han scripts? > > Of course it is one of the correct way to write this language. Ministry of Education, ROC awards Literary Award of Taiwanese and Hakka (教育部閩客語文學獎, their website is https://www.edu.tw) every year. You can see hak-hant here. Hello, my question was NOT about whether it can be written in Chinese script (which I know it can), instead my question was that, whether there are meaningful differences between "Hakka with Simplified characters" and "Hakka with Traditional characters", as some previously mentioned that in certain other Chinese languages, characters that are currently used by the Simplified script have other function in the written version of that language, making it almost impossible to write the language using Simplified script and thus there are no need to distinguish Simplified - Traditional Chinese for that language. What I would like to know is whether Hakka also fit this situation being described. TASK DETAIL https://phabricator.wikimedia.org/T180771 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Ab6399, C933103 Cc: Nikerabbit, wikibugs-l-list, Nemo_bis, siebrand, liangent, bzimport, Amire80, Ladsgroup, Ab6399, Zoranzoki21, jhsoby, GerardM, Davidzdh, ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Beast1978, Un1tY, Chaytanya, Hook696, Daryl-TTMG, RomaAmorRoma, 0010318400, E.S.A-Sheild, darthmon_wmde, Meekrab2012, joker88john, Dinadineke, DannyS712, CucyNoiD, Nandana, NebulousIris, Kieubinhtb, Tks4Fish, Gaboe420, Mh-3110, Versusxo, Majesticalreaper22, Giuliamocci, tabish.shaikh91, Adrian1985, Asad_Ali_Palijo, Cpaulf30, Lahi, Gq86, Af420, Darkminds3113, Bsandipan, Lordiis, GoranSMilovanovic, Adik2382, Soteriaspace, Jayprakash12345, Th3d3v1ls, JakeTheDeveloper, Ramalepe, Liugev6, QZanden, merbst, LawExplorer, WSH1906, Lewizho99, Maathavan, _jensen, rosalieper, D3r1ck01, Scott_WUaS, MuhammadShuaib, Tmalhotra, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331 ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Subscribers] T215032: Add monolingual language code cmg
C933103 added subscribers: Liuxinyu970226, C933103.C933103 added a comment. Please consider the following email response given to @Liuxinyu970226 when they asked certain linguistic expert about their opinion on the matter: https://imgur.com/a/YT8bnzJ (I am not sure whether sufficient permissions have been obtained by the user for me to link the mail on the public internet but let's just look at it for now)TASK DETAILhttps://phabricator.wikimedia.org/T215032EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, Liuxinyu970226, Aklapper, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. Well, as mentioned, the code cmg previously suggested as possible alternative is actually not appropriate according to email exchanges you have conducted with professors that know more about these terminology. And given the email exchange also confirmed that the current ISO language codes for Mongolian languages doesn't really make much sense either, it would also be wrong to use individual language code for such purpose. So following the convention already used by others should be the most sensible way to represent such text string in the wiki. But then, if certain member of Langcom stand firm on their position and unwilling to change, then no amount of sensibility can force them to change.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Subscribers] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a subscriber: Liuxinyu970226.C933103 added a comment.Restricted Application removed a subscriber: Liuxinyu970226. In T137810#4920450, @Liuxinyu970226 wrote: @C933103 Because he said me "shame". I am pretty sure you have misunderstood the English phrase being used. Please check the following link: https://www.macmillandictionary.com/dictionary/british/oh-what-a-shameTASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Changed Subscribers] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a subscriber: Liuxinyu970226.C933103 added a comment.Restricted Application removed a subscriber: Liuxinyu970226. In T137810#4920436, @Liuxinyu970226 wrote: @Badaa ISO-639 is a total mess. What a shame! https://www.mediawiki.org/wiki/Code_of_Conduct#Unacceptable_behavior states: 1. Personal attacks, violence, threats of violence, or deliberate intimidation. It is inappropriate to see view point against a system as attack/threat/violence against individual person.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. In T137810#4641351, @Popolon wrote: Just a little tip, on firefox, go to about:config, then search for the browser.urlbar.decodeURLsOnCopy option and change it from false to true, this is essential for non pure-english characters (aka any accent or other char from any other language using latin script or any other script, so the url) : https://zh.wikipedia.org/wiki/%E8%92%99%E5%8F%A4%E8%AA%9E%E6%96%B9%E8%A8%80%E5%88%97%E8%A1%A8 Thas is unreadable (will be the same with mongolian name in both cyrillic/mongolian script) will become https://zh.wikipedia.org/wiki/蒙古語方言列表 shorter and readable (at least if you know the script). The previous comment said he/she cannot understand Chinese. In T137810#4641252, @Badaa wrote: In T137810#4638007, @Liuxinyu970226 wrote: @C933103 however there are only one Classical Mongolian Script just like there are only one Modern Standard Arabic. Huh? Phags'pa ≠ Mongolian? @Badaa Actually, the big misconception is that there doesn't exist many Mongolian languages but many scripts. https://zh.wikipedia.org/wiki/%E8%92%99%E5%8F%A4%E8%AA%9E%E6%96%B9%E8%A8%80%E5%88%97%E8%A1%A8 I won't repeat the likely problem answers again, unless if your both are even concerning this zhwiki list article. @Liuxinyu970226 Is there any English or Mongolian version? I don't know Chinese. That linked page is a list of Mongolian Dialects, which Liuxinyu970226 wanted to use it to shows there are many different dialects of Mongolian, including also Oirat and Buryat, to show that they cannot be merged into a single entity of "Mongolian" in term of language code, because each of these language variants have their own script [cyrillic, especially for those that are in Modern Russian territory] that write their language variant out directly, and that they are different from each other and thus cannot be merged However that's very different from what we are talking about here when Oirat and Buryat aren't even considered as part of the Mongolian Macrolanguage in ISO 639 And then the list also have Kalmyk and Qaracin, however they don't even have their own ISO 639 language code from what I knowTASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. @Popolon I believe Monguor and all that do not/no longer use Mongolian Script in writing so that's not really relevant to the context. @Liuxinyu970226 what about Phagspa? Also I would remind you that in ISO 639, the macrolanguage Mongolian only cover khk/mvf but not othersTASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. @Popolon According to my understanding assuming they are correct understanding, using Arabic as analogy, what you propose would be like making different monolingual value for "Libyan Modern Standard Arabic", "Egyptian Modern Standard Arabic", "Tunisian Modern Standard Arabic". Yes, Libyan/Egyptian/Tunisian Arabic are all different and could be considered as different languages, however there are only one single literary standard here. Surely, there are different phonetic literary standard that more closely reflect individual languages, like the Cyrillic alphabet being used to spell different Mongolic languages, which would warrant the establishment of wiki in each of their individual languages, however there are only one Classical Mongolian Script just like there are only one Modern Standard Arabic. You can say mvf is closest to Classical Mongolian in the same way as Egyptian Arabic being closest to the standard of Modern Standard Arabic, however they are not equal.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. @Liuxinyu970226 If the concern of ISO639's RA is "users of the codes understand that part 2 of the standard has a code that includes several coded languages in part 3.", then probably what can be done is ask for cancellation of the mvf code and khk code in the ISO639-3? Also, another thing is that I have just discovered that there is an ISO639-3 language code cmg for "Classical Mongolian", would that actually be appropriate to be used to describe text written in Classical Mongolian language?TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Badaa, ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant
C933103 added a comment. Actually my original ticket could be a little clearer... Like clarifying that the "example" there was meant to mean there are articles in cdo/nan/hak wikipedia that are written in alternative script and thus there should be related monolingual code that would allow recording of those article names in wikidata language field. Thus I would like to bump the request for monolingual language code cdo-hani and nan-hani. And then for hak... Can someone verify that "Hakka (Traditional Han script)" and "Hakka (Simplified Han Script)" are proper way to describe how Hakka speakers would write their language in Han scripts?TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: GerardM, Davidzdh, ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Kieubinhtb, tabish.shaikh91, Asad_Ali_Palijo, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, merbst, LawExplorer, D3r1ck01, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T145531: [feature request] allow calculated properties, derived properties
C933103 added a comment. The wikidata property proposal https://www.wikidata.org/wiki/Wikidata:Property_proposal/coordinate_location_GCJ02 would depend on this property datatype.TASK DETAILhttps://phabricator.wikimedia.org/T145531EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, Jarekt, Yair_rand, Liuxinyu970226, Aklapper, Esc3300, Lahi, Gq86, Darkminds3113, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Avner, Gehel, Jonas, FloNight, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T63958: Use existing $dateFormats to format dates on Wikidata
C933103 added a comment. If there's no way to fix the internationalized format now then please change the format into ISO date format as a temporary fix. There's currently no way for me to tell which day a date value actually represent without trying to edit it and see the calendar pop up.TASK DETAILhttps://phabricator.wikimedia.org/T63958EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: thiemowmde, C933103Cc: C933103, Capankajsmilyo, jeblad, PokestarFan, revi, Phreelance, KTC, Samat, deryckchan, Nikki, Snaevar, Agabi10, gerritbot, thiemowmde, RP88, Aklapper, Mbch331, Ricordisamoa, Liuxinyu970226, Wikidata-bugs, Addshore, He7d3r, PiRSquared17, Lydia_Pintscher, Stryn, Gaboe420, Versusxo, Majesticalreaper22, Giuliamocci, Adrian1985, Cpaulf30, Lahi, Gq86, Baloch007, Darkminds3113, Bsandipan, Lordiis, GoranSMilovanovic, Adik2382, Jayprakash12345, Th3d3v1ls, Ramalepe, Liugev6, QZanden, LawExplorer, Lewizho99, Maathavan, Srdjan_m, MuhammadShuaib, LNDDYL, Psychoslave, aude, Gryllida, Shizhao, Arrbee, Jay8g___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant
C933103 added a comment. Remove Japanese Kyujitai request as might be using variant subtag instead of script subtag could be a better idea? Although there are also problems in using variant subtagTASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Retitled] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant
C933103 renamed this task from "Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, ja-hant" to "Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant".C933103 updated the task description. (Show Details) CHANGES TO TASK DESCRIPTION...The Wikidata item id: Q18165189 The language code: ja-Hant Language name in the language itself or English: Japanese (Kyūjitai) The used script, if not obvious: Jpan + Hant Where and when the language was or is used: Japan, untiln Mid-20th century The Wikidata item id: Q1147857 Usage example: Use for wikidata items like Q865 Note: I am not too sure about the code for zh-Nshu and ja-Hant, because: - Hant is not commonly used to represent Japanese text written in Kyujitai, and Japanese text written in Kyujitai will also contain kana But others should have little problems.TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, ja-hant
C933103 added a comment. Remove Nushu as use case related to the language and script can be covered by using monolingual code mis due to the lack of language code for TuhuaTASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Retitled] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, ja-hant
C933103 renamed this task from "Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant" to "Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, ja-hant".C933103 updated the task description. (Show Details) CHANGES TO TASK DESCRIPTION...The Wikidata item id: Q18165189 The language code: zh-Nshu Language name in the language itself or English: Xiangnan Tuhua (Nüshu) The used script, if not obvious: Nshu Where and when the language was or is used: around Jiangyong County, Hunan, during imperial era The Wikidata item id: Q56436 (wikidata item id for the language: Q17031342) The language code: ja-Hant...Note: I am not too sure about the code for zh-Nshu and ja-Hant, because: - Xiangnan Tuhua have no ISO 639-3 code, despite being part of the macrolanguage group "zh"/"zho" - Hant is not commonly used to represent Japanese text written in Kyujitai, and Japanese text written in Kyujitai will also contain kana...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Edited] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant
C933103 updated the task description. (Show Details) CHANGES TO TASK DESCRIPTION...The Wikidata item id: Q56436 (wikidata item id for the language: Q17031342) The language code: ja-Hant...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Edited] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant
C933103 updated the task description. (Show Details) CHANGES TO TASK DESCRIPTION...Language name in the language itself or English: ShaozhouXiangnan Tuhua (Nüshu)...Note: I am not too sure about the code for zh-Nshu and ja-Hant, because: - It does not signify the particular varian- Xiangnan Tuhua have no ISO 639-3 code, despite being part of chinese being used to write in Nushuthe macrolanguage group "zh"/"zho" - Hant is not commonly used to represent Japanese text written in Kyujitai, and Japanese text written in Kyujitai will also contain kana...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ztl8702, TerraCodes, Yejianfei, Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, Soteriaspace, RazeSoldier, Jayprakash12345, JakeTheDeveloper, QZanden, Zoranzoki21, MuhammadShuaib, SimmeD, Wikidata-bugs, aude, TheDJ, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. In T137810#3811581, @Liuxinyu970226 wrote: In T137810#3771454, @C933103 wrote: What is the rationale of macrolanguage being not usable to identify text? If you could make a language conversion system between two writting system, Okay, otherwise my aganist to mn-Mong is still valid. Inner Mongolian University has developed a sentence-based conversion engine (As in the engine require context from other words in the sentence to determine which candidate word should be used for transliteration) and it was claimed in another essay which I linked on somewhere else that the engine has up to 85% correct rate (Although 85% correct rate still mean about one error per every six words on average). The conversion engine was linked from the English WIkipedia's article for Mongolian Script's external link section.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant
C933103 added a comment. hum edited task description accordinglyTASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Retitled] T180771: Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant
C933103 renamed this task from "Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant" to "Add monolingual language code nan-hani, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant".C933103 updated the task description. (Show Details) CHANGES TO TASK DESCRIPTIONPlease add the following language code nan-Hants to the list of language codes supported for monolingual text values. The language code: nan-Hant-TW Language name in the language itself or English: Min Nan (Hanji, Taiwan) The used script, if not obvious: Hant Where and when the language was or is used: Taiwan, modern era The Wikidata item id: Q15901848 The language code: nan-Hans-CNi Language name in the language itself or English: Min Nan (Hanji, China) The used script, if not obvious: Hansi Where and when the language was or is used: mainland ChinaMinnan-speaking area, modern era...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant
C933103 added a comment. Almost all the Hani text being discussed and used related to the nan.wp project now are Hant. Disregarding Hans for now and use Hani instead of Hant would probably do the job in the current setting but what about when mainland China Hans users start visiting and editing the site?TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant
C933103 added a comment. In T180771#3785764, @Liuxinyu970226 wrote: Why requesting separated BCP47 codes for Min nan? Why don't we just split the TWN Min nan translations instead (so we can also make Min nan labels separated). TWN min nan? as in nan-TW? According to my understanding, there are also POJ users in Taiwan and Hani users in mainland China so that doesn't seem to solve the situation entirely, although my understanding is not necessarily complete.TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Liuxinyu970226, Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Retitled] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant
C933103 renamed this task from "Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans, cdo-hant, hak-hans, hak-hant, zh-nshu, ja-hant" to "Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hani, hak-hans, hak-hant, zh-nshu, ja-hant".C933103 updated the task description. (Show Details) CHANGES TO TASK DESCRIPTION...The language code: cdo-Hans Language name in the language itself or English: Min Dong (Chinese character, Simplified) The used script, if not obvious: Hans Where and when the language was or is used: mainland China, modern era (mainly used under the name of dialect of individual specific regions, eg Fuzhou) The Wikidata item id: Q5365165 The language code: cdo-Hanti Language name in the language itself or English: Min Dong (Chinese character, Traditionals) The used script, if not obvious: Hanti Where and when the language was or is used: mainly for people on and from Matsu islandsMin-Dong-speaking people. Modern eraTASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans, cdo-hant, hak-hans, hak-hant, zh-nshu, ja-hant
C933103 added a comment. See also https://translatewiki.net/wiki/Thread:Support/Request_for_enabling_new_variants_for_Hakka_language_(2)TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Retitled] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans, cdo-hant, hak-hans, hak-hant, zh-nshu, ja-hant
C933103 renamed this task from "Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans-cn, cdo-hant-tw, hak-hans-cn, hak-hant-tw, zh-nshu, ja-hant" to "Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans, cdo-hant, hak-hans, hak-hant, zh-nshu, ja-hant".C933103 updated the task description. (Show Details) CHANGES TO TASK DESCRIPTION...The language code: cdo-Hans-CN Language name in the language itself or English: Min Dong (Chinese character, ChinaSimplified)...The language code: cdo-Hant-TW Language name in the language itself or English: Min Dong (Chinese character, TaiwanTraditional)...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Edited] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans-cn, cdo-hant-tw, hak-hans-cn, hak-hant-tw, zh-nshu, ja-hant
C933103 updated the task description. (Show Details) CHANGES TO TASK DESCRIPTION...The language code: hak-Hans-CN Language name in the language itself or English: Hakka (Chinese character, ChinaSimplified)...The language code: hak-Hant-TW Language name in the language itself or English: Hakka (Chinese character, TaiwanTraditional)...Where and when the language was or is used: Taiwan, Hong Kong, etc., modern era...TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, C933103, Lahi, Gq86, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. What is the rationale of macrolanguage being not usable to identify text?TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T144272: new monolingual language code requests for Wikidata (tracking)
C933103 added a comment. Are you implying that those monolingual language code I'm submitting does not represent anything useful? nan/cdo/hak-Hant/hans are language-script combinations being used to write wikipedia articles, and vi-hani, ko-kore, ja-Kyujitai are used to name people and things in respective countries. How do you write the name of "Ho Chi Minh City" in Vietnamese Han nom? The only place providing this info in wikidata for now is in the Japanese alias for the entry name. How about "Kim Jong-Il" in ko-Kore? Look at the Slovak alias. Is it better than having labels for each of these script variants?TASK DETAILhttps://phabricator.wikimedia.org/T144272EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, Nikerabbit, Mbch331, Lydia_Pintscher, Zoranzoki21, Nikki, Amire80, jhsoby, Liuxinyu970226, GerardM, thiemowmde, Aklapper, Esc3300, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. Then mon, mon is ISO 639-3TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Block] T144272: new monolingual language code requests for Wikidata (tracking)
C933103 created subtask T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans-cn, cdo-hant-tw, hak-hans-cn, hak-hant-tw, zh-nshu, ja-hant. TASK DETAILhttps://phabricator.wikimedia.org/T144272EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Nikerabbit, Mbch331, Lydia_Pintscher, Zoranzoki21, Nikki, Amire80, jhsoby, Liuxinyu970226, GerardM, thiemowmde, Aklapper, Esc3300, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Created] T180771: Add monolingual language code nan-hant-tw, nan-hans-cn, cdo-hans-cn, cdo-hant-tw, hak-hans-cn, hak-hant-tw, zh-nshu, ja-hant
C933103 created this task.C933103 added a project: Wikidata.Herald added a subscriber: Aklapper. TASK DESCRIPTIONPlease add the language code nan-Hant to the list of language codes supported for monolingual text values. The language code: nan-Hant-TW Language name in the language itself or English: Min Nan (Hanji, Taiwan) The used script, if not obvious: Hant Where and when the language was or is used: Taiwan, modern era The Wikidata item id: Q15901848 The language code: nan-Hans-CN Language name in the language itself or English: Min Nan (Hanji, China) The used script, if not obvious: Hans Where and when the language was or is used: mainland China, modern era The Wikidata item id: Q15901848 The language code: cdo-Hans-CN Language name in the language itself or English: Min Dong (Chinese character, China) The used script, if not obvious: Hans Where and when the language was or is used: mainland China, modern era (mainly used under the name of dialect of individual specific regions, eg Fuzhou) The Wikidata item id: Q5365165 The language code: cdo-Hant-TW Language name in the language itself or English: Min Dong (Chinese character, Taiwan) The used script, if not obvious: Hant Where and when the language was or is used: mainly for people on and from Matsu islands. Modern era. The Wikidata item id: Q5365165 The language code: hak-Hans-CN Language name in the language itself or English: Hakka (Chinese character, China) The used script, if not obvious: Hans Where and when the language was or is used: mainland China, modern era The Wikidata item id: Q22827960 The language code: hak-Hant-TW Language name in the language itself or English: Hakka (Chinese character, Taiwan) The used script, if not obvious: Hant Where and when the language was or is used: Taiwan, modern era The Wikidata item id: Q18165189 The language code: zh-Nshu Language name in the language itself or English: Shaozhou Tuhua (Nüshu) The used script, if not obvious: Nshu Where and when the language was or is used: around Jiangyong County, Hunan, during imperial era The Wikidata item id: Q56436 The language code: ja-Hant Language name in the language itself or English: Japanese (Kyūjitai) The used script, if not obvious: Jpan + Hant Where and when the language was or is used: Japan, untiln Mid-20th century The Wikidata item id: Q1147857 Usage example: Use for wikidata items like Q865 Note: I am not too sure about the code for zh-Nshu and ja-Hant, because: It does not signify the particular variant of chinese being used to write in Nushu Hant is not commonly used to represent Japanese text written in Kyujitai, and Japanese text written in Kyujitai will also contain kana But others should have little problems.TASK DETAILhttps://phabricator.wikimedia.org/T180771EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, C933103, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Retitled] T180345: Add monolingual language code vi-hani, ko-kore
C933103 renamed this task from "Add monolingual language code vi-hani, ko-hani" to "Add monolingual language code vi-hani, ko-kore".C933103 updated the task description. (Show Details) CHANGES TO TASK DESCRIPTION...*The language code: ko-HaniKore *Language name in the language itself or English: English: Korean Hanja Mixed Script *The used script, if not obvious: Hang + Hani mixed script (the proposed code is only Hani as there doesn't seems to be a code for mixed script in ISO 15924 database yet unlike Japanese)Kore *Where and when the language was or is used: Korea before WWII...TASK DETAILhttps://phabricator.wikimedia.org/T180345EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, revi, C933103, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Created] T180345: Add monolingual language code vi-hani, ko-hani
C933103 created this task.C933103 added a project: Wikidata.Herald added subscribers: revi, Aklapper. TASK DESCRIPTIONPlease add the language code vi-hani and ko-hani to the list of language codes supported for monolingual text values. *The language code: vi-Hani *Language name in the language itself or English: English: Vietnamese Chu Nom *The used script, if not obvious: Hani *Where and when the language was or is used: Mainly in Vietnam before 20th century *The Wikidata item id: Q875344 *The language code: ko-Hani *Language name in the language itself or English: English: Korean Hanja Mixed Script *The used script, if not obvious: Hang + Hani mixed script (the proposed code is only Hani as there doesn't seems to be a code for mixed script in ISO 15924 database yet unlike Japanese) *Where and when the language was or is used: Korea before WWII *The Wikidata item id: Q711797 Usage example: I was going to add entry for both of them in wikidata item Q1 but get rejected. Note that according to https://www.wikidata.org/wiki/Help:Monolingual_text_languages , although the policy was not finished yet, it was stated that a language code does not have to fulfill requirement of the language proposal policy for new wikis, and in general a code would be acceptable as long as the language code is valid. By the way the application form for new monolingual code on this template is greatly different from the form on wikidata. Which of these fields are actually needed?TASK DETAILhttps://phabricator.wikimedia.org/T180345EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Aklapper, revi, C933103, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Block] T144272: new monolingual language code requests for Wikidata (tracking)
C933103 created subtask T180345: Add monolingual language code vi-hani, ko-hani. TASK DETAILhttps://phabricator.wikimedia.org/T144272EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Nikerabbit, Mbch331, Lydia_Pintscher, Zoranzoki21, Nikki, Amire80, jhsoby, Liuxinyu970226, GerardM, thiemowmde, Aklapper, Esc3300, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. mvf only refer to Mongolian spoken in Central part of Inner Mongolia while mn-Mong is written by all mn users.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: ChristianKl, C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, Lahi, GoranSMilovanovic, QZanden, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T162331: Provide tools for processing obfuscated Chinese geodata (GCJ-02, BD-09)
C933103 added a comment. According to some pages I have read from google, it seems like in the US only the compilation of data is protected while data itself are not and the creation of databasr also need to have some creativity in order to make the database fulfil copyright law, and in the EU there is an extra protection of investment being put to collect, arrange and present data. So it seems like it should not have problem under the US law in most cases although it might be better to let a legal expert to answer the question ..TASK DETAILhttps://phabricator.wikimedia.org/T162331EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Pnorman, C933103, Aklapper, Arthur2e5, Cosine02, Liuxinyu970226, QZanden, Salgo60, D3r1ck01, Izno, Cwek, Wikidata-bugs, aude, zhuyifei1999, Shizhao, MaxSem, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T54971: Sitelinks to Incubator, OldWikisource and BetaWikiversity
C933103 added a comment. Is it within the scope of this task that ordinary wikipedia with multiple page for every single concept written in multiple script cannot be linked to same wikidata concept entry?TASK DETAILhttps://phabricator.wikimedia.org/T54971EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: Samwilson, Esc3300, srishakatux, C933103, Stashbot, hoo, aude, JanZerebecki, TTO, Liuxinyu970226, Accurimbono, Aklapper, Ricordisamoa, Purodha, liangent, Wikidata-bugs, Vogone, Candalua, SPQRobin, mxn, Filceolaire, jayvdb, Micru, revi, Billinghurst, Lydia_Pintscher, MF-Warburg, zhuyifei1999, Tpt, JohnLewis, D3r1ck01, Izno, Mbch331, Jay8g___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. btw would it be a good idea to apply for a literary mongolian code like literary chinese if the situation about literary mongolian is so similar to literary chinese? But it seems like it would at least take at least a year for the ISO 639 RA to make any change to iso language code? And I am not familiar with Mongolian enough to say is this really a good method to handle the thing let alone submitting a proposal.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. @GerardM but traditional mongolian script is like literary chinese, which is universal to every languages that were using it as their written form and thus it is invalid to say which language they belong to. Just like you can say Nihon Shoki is written in Chinese but you can't say it is written with Mandarin or Hakka. The situation with traditional Mongolian script is the same. And also, it would be incorrect [despite being a convention] to call those Mongolian text middle/classical Mongolian language just like you can't equate literary chinese to old/middle chinese, as there are still some changes being made to the written language that set the old language at that time apart from the written form continually being used.TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Commented On] T137810: [Task] Add monolingual language code mn-Mong
C933103 added a comment. @GerardM 0. According to the "Requirements for a new language code" linked above, the WIP requirement for a new language code is a valid IETF tag not a valid ISO code Macrolanguages in ISO 639-3 are still individual languages in ISO 639-2, and definition of macrolanguage in ISO 639-3 is "clusters of closely-related language varieties that [...] can be considered distinct individual languages, yet in certain usage contexts a single language identity for all is needed". and thus macrolanguages should be treated as an lanuage with valid language code. And mn is a valid code and is currently used by Mongolian wikipedia, which also contain several articles written in traditional Mongolian script. See BCP 47 section 2.1.1 for details about uppercasing. https://tools.ietf.org/html/bcp47 both khk, mvf, bua and xal can be written with Latn, Cyrl and Mong. mn-Mong is not only used for mvf. BCP 47 also stated that macrolanguage code can still be used instead of code for encompassed languge you can see mn_Mong_CN is a likely subtag in http://unicode.org/repos/cldr-tmp/trunk/diff/supplemental/likely_subtags.html You can see mn-Mong listed in IANA language subtag registry http://www.iana.org/assignments/language-subtag-registry/language-subtag-registry (listed as redundant as it have the correct form and format defined by RFC4646 and all the subtag it used are defined in the document. See RFC4645 for detail.) TASK DETAILhttps://phabricator.wikimedia.org/T137810EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: C933103Cc: C933103, jhsoby, thiemowmde, Liuxinyu970226, Lydia_Pintscher, GerardM, Aklapper, Zppix, Popolon, D3r1ck01, Izno, Wikidata-bugs, aude, Mbch331___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs