[Wikidata-bugs] [Maniphest] [Updated] T105430: Ensure that language tags generated in RDF output are standard language names
Lydia_Pintscher added a project: Wikidata-Sprint-2015-08-18. TASK DETAIL https://phabricator.wikimedia.org/T105430 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, Lydia_Pintscher Cc: Fomafix, gerritbot, Smalyshev, Aklapper, daniel, mkroetzsch, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, JanZerebecki, Malyacko ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T105430: Ensure that language tags generated in RDF output are standard language names
JanZerebecki added a project: Wikidata-Sprint-2015-08-11. TASK DETAIL https://phabricator.wikimedia.org/T105430 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, JanZerebecki Cc: Fomafix, gerritbot, Smalyshev, Aklapper, daniel, mkroetzsch, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, JanZerebecki, Malyacko ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T105430: Ensure that language tags generated in RDF output are standard language names
Tobi_WMDE_SW set Story Points to 1. TASK DETAIL https://phabricator.wikimedia.org/T105430 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, Tobi_WMDE_SW Cc: Fomafix, gerritbot, Smalyshev, Aklapper, daniel, mkroetzsch, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T105430: Ensure that language tags generated in RDF output are standard language names
JanZerebecki added a project: Wikidata-Sprint-2015-06-30. TASK DETAIL https://phabricator.wikimedia.org/T105430 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, JanZerebecki Cc: Fomafix, gerritbot, Smalyshev, Aklapper, daniel, mkroetzsch, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T105430: Ensure that language tags generated in RDF output are standard language names
Fomafix added a comment. Please do not quote and comment parts of the sentence without the context. I created https://phabricator.wikimedia.org/T106367 for some special not BCP 47 conform language codes. Please add more codes there if you find some more. I implemented this in `wfBCP47()` (https://gerrit.wikimedia.org/r/226040). So this would also solve the problem in the RDF export for this languages when `wfBCP47()` is used. Please comment the patch. Maybe there is a better position for such a mapping. TASK DETAIL https://phabricator.wikimedia.org/T105430 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, Fomafix Cc: Fomafix, gerritbot, Smalyshev, Aklapper, daniel, mkroetzsch, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T105430: Ensure that language tags generated in RDF output are standard language names
Fomafix added a comment. In https://phabricator.wikimedia.org/T105430#1466282, @Smalyshev wrote: This language code should be used for the sitelinks. In HTML and in the RDF export That would lead to the situation where links to Simple English wiki and to English wiki are indistinguishable. Which is not good. The sitelinks are distinguishable by the URL (https://simple.wikipedia.org/) and by the siteid (`simplewiki`). Show me a place where this is not enough. If you want to change this to en-x-simple then create a separate task. This is that task. `en` is also a standard language code. If you which to change //this// code to an other value this is a separate task. Describe your use cases for a separate language code for the Simple projects in https://phabricator.wikimedia.org/T27591. Such a change should be done consistently in all places, not only in the RDF export. This must be changed everywhere where a HTML attribute lang is generated. That has no relation to RDF export and thus outside of the scope of this task. When you change the language codes at the right position the RDF export gets automatically the correct language codes. When https://phabricator.wikimedia.org/T43723 is fixed most of your patch for the RDF export is superfluous. When it would be fixed, we can consider revisiting this code and if the fix allows to remove the special cases then they will be removed. However, since that ticket seems to be open since 2012, I'd rather fix the RDF export now (which otherwise will be confusing for third party users - the main audience of the export) than wait for https://phabricator.wikimedia.org/T43723. I think it is bad programming style to make several workarounds instead of fixing the core problem. TASK DETAIL https://phabricator.wikimedia.org/T105430 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, Fomafix Cc: Fomafix, gerritbot, Smalyshev, Aklapper, daniel, mkroetzsch, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T105430: Ensure that language tags generated in RDF output are standard language names
Fomafix added a subscriber: Fomafix. Fomafix added a comment. Converting the MediaWiki internal language codes to BCP 47 conform language codes is a valid task. Especially for the RDF and the HTML output. This should be done in all MediaWiki projects and extensions, not only in Wikidata. Here some notes to some special languages - //simple// is primary a separate additional Wikipedia project in language English. It is not a separate user interface language, because `uselang=simple` gets converted to `wgUserLanguage` = `en`. simple https://simple.wikipedia.org/ has `wgContentLanguage` = `en`. Wikidata has currently no content with the language code `simple` and as far as I know this is unwanted. Introducing a new language code `en-x-simple` may be possible, when there is an use case and a community consensus. - //de-formal// and //nl-informal// are primary user interface languages. It describes a variant of the languages used in user communication and normally not in the content. It may be used in content for content generated user interface texts like usage descriptions. Switching the user interface language to these codes should not impact the using/editing the content. This is requested in https://phabricator.wikimedia.org/T51024. Changing this codes to //de-x-formal// and //nl-x-informal// may be possible when this is necessary to be conform to BCP 47. Wikidata delivers currently not the language code of the `wgContentLanguage` of the projects. I described this in https://phabricator.wikimedia.org/T59706. This is still not fixed. TASK DETAIL https://phabricator.wikimedia.org/T105430 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, Fomafix Cc: Fomafix, gerritbot, Smalyshev, Aklapper, daniel, mkroetzsch, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T105430: Ensure that language tags generated in RDF output are standard language names
Fomafix added a comment. Not only the RDF export must have BCP 47 conform language codes, the HTML attribute lang also must have a BCP 47 conform language code. Simple https://simple.wikipedia.org/ already uses the correct language code ´en´ for the content (`wgContentLanguage`). This language code should be used for the sitelinks. In HTML and in the RDF export. When https://phabricator.wikimedia.org/T43723 is fixed the language code for sitelinks to simple is `en`. If you want to change this to `en-x-simple` then create a separate task. When `de-formal` and `nl-informal` are not conform to BCP 47 this should be changed. But not only for the RDF export. This must be changed everywhere where a HTML attribute lang is generated. `de-formal` and `nl-informal` are only used in the user interface. They are not wanted as separate language for the label, description and alias in Wikidata. Therefore these language code should never occur in a RDF export. When https://phabricator.wikimedia.org/T43723 is fixed your patch for the RDF export is superfluous. TASK DETAIL https://phabricator.wikimedia.org/T105430 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, Fomafix Cc: Fomafix, gerritbot, Smalyshev, Aklapper, daniel, mkroetzsch, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T105430: Ensure that language tags generated in RDF output are standard language names
Smalyshev added a comment. See also: https://phabricator.wikimedia.org/T44396 TASK DETAIL https://phabricator.wikimedia.org/T105430 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev Cc: Smalyshev, Aklapper, daniel, mkroetzsch, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
[Wikidata-bugs] [Maniphest] [Updated] T105430: Ensure that language tags generated in RDF output are standard language names
gerritbot added a project: Patch-For-Review. TASK DETAIL https://phabricator.wikimedia.org/T105430 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Smalyshev, gerritbot Cc: gerritbot, Smalyshev, Aklapper, daniel, mkroetzsch, jkroll, Wikidata-bugs, Jdouglas, aude, Manybubbles, JanZerebecki, Malyacko, P.Copp ___ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs