Re: [Wikidata] Language codes 'mul' and 'mis' not recognized
Hello Le lun. 22 juin 2020 à 18:19, Mbch331 a écrit : > Monolingual codes aren't available for labels/aliases, only for properties > of the type monolingual text. > Indeed, thank youu. What confused me is that I cannot enter such a monolingual text value with language code "mul" or "mis" from the GUI, for a statement like "title". Thomas > Mbch331 > Op 22-6-2020 om 15:52 schreef Thomas Francart: > > Hello > > Le lun. 22 juin 2020 à 15:33, Federico Leva (Nemo) a > écrit : > >> Thomas Francart, 22/06/20 16:21: >> > I also cannot use these 2 codes when editing through the human >> interface. >> >> Language code validation on Wikidata is sometimes confusing. See also: >> https://phabricator.wikimedia.org/T39459 >> >> > Thanks, I read the page but am not sure to understand how it relates to > the original question. Can you decode ? Does it explain why creating > labels/aliases with code 'mul' or 'mis' fails using the API ? > > >> Why "mis" instead of "und"? See also >> https://phabricator.wikimedia.org/T230833#6103004 >> >> > Sorry, I wrote "when the language value is unknown", but this was not > precise : the original data we want to synch already contains data using > code "mis" (as a valid ISO639-2 iso code, see > https://www.loc.gov/standards/iso639-2/php/code_list.php), and as I read > it was supported by Wikidata, I attempted to use that code directly through > the API, with no luck. Similarly, original data uses code 'mul' (also a > valid ISO639-2 code). > > Thomas > > >> Federico >> > > > -- > > *Thomas Francart* -* SPARNA* > Web de *données* | Architecture de l'*information* | Accès aux > *connaissances* > blog : blog.sparna.fr, site : sparna.fr, linkedin : > fr.linkedin.com/in/thomasfrancart > tel : +33 (0)6.71.11.25.97, skype : francartthomas > > ___ > Wikidata mailing > listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata > > > > <https://www.avast.com/sig-email?utm_medium=email_source=link_campaign=sig-email_content=emailclient> > Virusvrij. > www.avast.com > <https://www.avast.com/sig-email?utm_medium=email_source=link_campaign=sig-email_content=emailclient> > <#m_637642425612966372_DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2> > ___ > Wikidata mailing list > Wikidata@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikidata > -- *Thomas Francart* -* SPARNA* Web de *données* | Architecture de l'*information* | Accès aux *connaissances* blog : blog.sparna.fr, site : sparna.fr, linkedin : fr.linkedin.com/in/thomasfrancart tel : +33 (0)6.71.11.25.97, skype : francartthomas ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Re: [Wikidata] Language codes 'mul' and 'mis' not recognized
Hello Le lun. 22 juin 2020 à 15:33, Federico Leva (Nemo) a écrit : > Thomas Francart, 22/06/20 16:21: > > I also cannot use these 2 codes when editing through the human interface. > > Language code validation on Wikidata is sometimes confusing. See also: > https://phabricator.wikimedia.org/T39459 > > Thanks, I read the page but am not sure to understand how it relates to the original question. Can you decode ? Does it explain why creating labels/aliases with code 'mul' or 'mis' fails using the API ? > Why "mis" instead of "und"? See also > https://phabricator.wikimedia.org/T230833#6103004 > > Sorry, I wrote "when the language value is unknown", but this was not precise : the original data we want to synch already contains data using code "mis" (as a valid ISO639-2 iso code, see https://www.loc.gov/standards/iso639-2/php/code_list.php), and as I read it was supported by Wikidata, I attempted to use that code directly through the API, with no luck. Similarly, original data uses code 'mul' (also a valid ISO639-2 code). Thomas > Federico > -- *Thomas Francart* -* SPARNA* Web de *données* | Architecture de l'*information* | Accès aux *connaissances* blog : blog.sparna.fr, site : sparna.fr, linkedin : fr.linkedin.com/in/thomasfrancart tel : +33 (0)6.71.11.25.97, skype : francartthomas ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
[Wikidata] Language codes 'mul' and 'mis' not recognized
Hello (Please direct me somewhere else is this is not the place to ask.) I am developing a bot to synch ISSN data [1] using WDTK. Creating a label/alias with the language code 'mul' (to indicate a multilingual value), or 'mis' (to indicate the language is unknown) does not work and returns an Exception from the wikimedia API ("[not-recognized-language] The supplied language code was not recognized"). These 2 codes are however documented at https://www.wikidata.org/wiki/Help:Monolingual_text_languages and listed at https://www.wikidata.org/wiki/Help:Wikimedia_language_codes/lists/all I also cannot use these 2 codes when editing through the human interface. I do however find items in Wikidata with 'mul' language codes. Am I doing something wrong, did I misunderstand something, or is there a true problem with these codes ? Also opened ticket in WDTK issue tracker : https://github.com/Wikidata/Wikidata-Toolkit/issues/509 Thanks Thomas [1] : ISSN Bot : https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_Periodicals#Data_donation_from_ISSN_Register_(Followup) -- *Thomas Francart* -* SPARNA* Web de *données* | Architecture de l'*information* | Accès aux *connaissances* blog : blog.sparna.fr, site : sparna.fr, linkedin : fr.linkedin.com/in/thomasfrancart tel : +33 (0)6.71.11.25.97, skype : francartthomas ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
[Wikidata] Data donation from ISSN Internation Center - feedback welcome
Hello The ISSN International Center, which maintains the ISSN Register (*the most complete reference source for the identification of serial publications*) at http://portal.issn.org wishes to *contribute a subset of its bibliographic database to Wikidata*. This consists of (approximately) *1.100.000 serial publications notices* (only current/not discontinued publications), with 10% having 2 or more ISSNs (typically, paper and electronic version). Currently, Wikidata contains approximately 94000 ISSN values. Each serial notice is described with metadata from the set of metadata already freely available from the ISSN Portal : - ISSN identifier - ISSN-L identifier (the ISSN-L identifies the serial publication independantly of its medium, while distinct ISSNs are assigned to each medium) - Proper (original) title of the serial - Key (unambiguous) title of the serial - Country of publication - Distribution format (of a given ISSN) - URL (for online publications) - Language of the serial *(Note that the publisher is not part of the free data).* More information can be read in this presentation. <https://docs.google.com/presentation/d/1vJrfj3zOKTw-29QYp2RyBxkJUlHVk_VKWznhsoZE9FA/edit#slide=id.p1> Preliminary analysis has been conducted and a data model proposal to capture this set of metadata has been submitted at https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_Periodicals#Data_donation_from_ISSN_Register_-_Feedback_welcome (Wikidata Periodicals and Wiki Source Projects have been notified) Feedback on this is welcome until 4th of december (although not a formal deadline), either here or on wiki at the link above. We also welcome general advice on how to proceed with an ingest of this size (the plan is currently to use OpenRefine to reconcile and import). Best Regards Thomas -- *Thomas Francart* -* SPARNA* Web de *données* | Architecture de l'*information* | Accès aux *connaissances* blog : blog.sparna.fr, site : sparna.fr, linkedin : fr.linkedin.com/in/thomasfrancart tel : +33 (0)6.71.11.25.97, skype : francartthomas ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
[Wikidata] SPARQL examples page not displaying properly latest examples ?
Hello I recently contributed to an example SPARQL query in the example page ( https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples#Display_the_class_tree_under_a_known_class_(subclass_of) ) It is not displayed properly and it seems that this page is broken as none of the latest examples in the page is displayed, starting with this one : https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples#Human_settlements_without_an_article_in_any_language_version_of_Wikipedia Hypothesis : I suspect this is due to the high number of the same "Template:SPARQL" used in that same page. Is this a known issue ? any way to fix this ? Also, I can't find this query example in the query interface, when searching for it. How/when is the query interface updated with the example page ? is it related to the display problem in the page itself ? Cheers Thomas -- *Thomas Francart* -* SPARNA* Web de *données* | Architecture de l'*information* | Accès aux *connaissances* blog : blog.sparna.fr, site : sparna.fr, linkedin : fr.linkedin.com/in/thomasfrancart tel : +33 (0)6.71.11.25.97, skype : francartthomas ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
Re: [Wikidata] Full-text / autocomplete search on labels
Ettore, Lucas, thanks a lof for your answers, that's what I was looking for. Searching for an entity based on a label + language (and optionally additionnal criteria like instance of) looks like a reasonable use-case, this could be "packaged" inside magic triples to make this feature directly available in SPARQL (as other triplestores with full-text/index-based search are doing). Cheers Thomas Le ven. 4 oct. 2019 à 14:55, Lucas Werkmeister a écrit : > If you use regular search rather than entity search, you can use > WikibaseCirrusSearch > <https://www.mediawiki.org/wiki/Special:MyLanguage/Help:Extension:WikibaseCirrusSearch> > search keywords like "haswbstatement:P31=Q5". Note that this uses a > different kind of search, though – I think it only matches full words > (after stemming and other transformations?), not prefixes. Query Link > <https://query.wikidata.org/#SELECT%20%3Fperson%20%3FpersonLabel%20WHERE%20%7B%0A%20%20SERVICE%20wikibase%3Amwapi%20%7B%0A%20%20%20%20bd%3AserviceParam%20wikibase%3Aapi%20%22Search%22%3B%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20wikibase%3Aendpoint%20%22www.wikidata.org%22%3B%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20mwapi%3Asrsearch%20%22einst%20haswbstatement%3AP31%3DQ5%22.%0A%20%20%20%20%3Fperson%20wikibase%3AapiOutputItem%20mwapi%3Atitle.%0A%20%20%7D%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%0A%7D> > > Cheers, > Lucas > On 04.10.19 10:22, Ettore RIZZA wrote: > > Forgot to mention: you can do the same search using the API only: > https://www.wikidata.org/w/api.php?action=wbsearchentities=einst=en=json > > But not sure you can easily filter the results by "instance of". > > Ettore Rizza > > > On Fri, 4 Oct 2019 at 10:15, Ettore RIZZA wrote: > >> Hello Thomas, >> >> You can perform a full text search with the API, but not yet with SPARQL >> AFAIK. However, it is possible to call the API in a SPARQL query. For >> example, here is a query >> <https://query.wikidata.org/#SELECT%20DISTINCT%20%3Fperson%20%3FpersonLabel%20WHERE%20%7B%0A%20%20SERVICE%20wikibase%3Amwapi%20%7B%0A%20%20%20%20%20%20bd%3AserviceParam%20wikibase%3Aapi%20%22EntitySearch%22%20.%0A%20%20%20%20%20%20bd%3AserviceParam%20wikibase%3Aendpoint%20%22www.wikidata.org%22%20.%0A%20%20%20%20%20%20bd%3AserviceParam%20mwapi%3Asearch%20%22einst%22%20.%0A%20%20%20%20%20%20bd%3AserviceParam%20mwapi%3Alanguage%20%22en%22%20.%0A%20%20%20%20%20%20%3Fperson%20wikibase%3AapiOutputItem%20mwapi%3Aitem%20.%0A%20%20%7D%0A%20%20%3Fperson%20wdt%3AP31%20wd%3AQ5.%0A%20%20%0A%20%20%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%0A%20%20%0A%20%20%0A%7D%20ORDER%20BY%20DESC(%3Fperson)%20LIMIT%2020> >> that looks for "human (Q5)" whose label contains the string "einst". >> >> Hope this helps, >> >> Ettore Rizza >> >> >> On Fri, 4 Oct 2019 at 09:58, Thomas Francart >> wrote: >> >>> Hello >>> >>> I understand the wikidata SPARQL label service only fetches the labels, >>> but does not allow to search/filter on them; labels are also available in >>> regulare rdfs:label on which a FILTER can be made. >>> However I would like to do full-text search over labels, to e.g. feed an >>> autocomplete search field, actually just like the usual top-right wikidata >>> search field does. I would also be interested to combine this with a >>> criteria on "instance of", to search only on instances of a given class. >>> >>> Can I do that efficiently using the Wikidata SPARQL service ? or is >>> there a separate API I could use ? (exemple welcome) >>> >>> Thanks >>> Thomas >>> >>> -- >>> >>> *Thomas Francart* -* SPARNA* >>> Web de *données* | Architecture de l'*information* | Accès aux >>> *connaissances* >>> blog : blog.sparna.fr, site : sparna.fr, linkedin : >>> fr.linkedin.com/in/thomasfrancart >>> tel : +33 (0)6.71.11.25.97, skype : francartthomas >>> _______ >>> Wikidata mailing list >>> Wikidata@lists.wikimedia.org >>> https://lists.wikimedia.org/mailman/listinfo/wikidata >>> >> > ___ > Wikidata mailing > listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata > > ___ > Wikidata mailing list > Wikidata@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikidata > -- *Thomas Francart* -* SPARNA* Web de *données* | Architecture de l'*information* | Accès aux *connaissances* blog : blog.sparna.fr, site : sparna.fr, linkedin : fr.linkedin.com/in/thomasfrancart tel : +33 (0)6.71.11.25.97, skype : francartthomas ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata
[Wikidata] Full-text / autocomplete search on labels
Hello I understand the wikidata SPARQL label service only fetches the labels, but does not allow to search/filter on them; labels are also available in regulare rdfs:label on which a FILTER can be made. However I would like to do full-text search over labels, to e.g. feed an autocomplete search field, actually just like the usual top-right wikidata search field does. I would also be interested to combine this with a criteria on "instance of", to search only on instances of a given class. Can I do that efficiently using the Wikidata SPARQL service ? or is there a separate API I could use ? (exemple welcome) Thanks Thomas -- *Thomas Francart* -* SPARNA* Web de *données* | Architecture de l'*information* | Accès aux *connaissances* blog : blog.sparna.fr, site : sparna.fr, linkedin : fr.linkedin.com/in/thomasfrancart tel : +33 (0)6.71.11.25.97, skype : francartthomas ___ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata