Re: [Wikidata] Language codes 'mul' and 'mis' not recognized

2020-06-22 Thread Thomas Francart
Hello

Le lun. 22 juin 2020 à 18:19, Mbch331  a
écrit :

> Monolingual codes aren't available for labels/aliases, only for properties
> of the type monolingual text.
>
Indeed, thank youu.
What confused me is that I cannot enter such a monolingual text value with
language code "mul" or "mis" from the GUI, for a statement like "title".

Thomas


> Mbch331
> Op 22-6-2020 om 15:52 schreef Thomas Francart:
>
> Hello
>
> Le lun. 22 juin 2020 à 15:33, Federico Leva (Nemo)  a
> écrit :
>
>> Thomas Francart, 22/06/20 16:21:
>> > I also cannot use these 2 codes when editing through the human
>> interface.
>>
>> Language code validation on Wikidata is sometimes confusing. See also:
>> https://phabricator.wikimedia.org/T39459
>>
>>
> Thanks, I read the page but am not sure to understand how it relates to
> the original question. Can you decode ? Does it explain why creating
> labels/aliases with code 'mul' or 'mis' fails using the API ?
>
>
>> Why "mis" instead of "und"? See also
>> https://phabricator.wikimedia.org/T230833#6103004
>>
>>
> Sorry, I wrote "when the language value is unknown", but this was not
> precise : the original data we want to synch already contains data using
> code "mis" (as a valid ISO639-2 iso code, see
> https://www.loc.gov/standards/iso639-2/php/code_list.php), and as I read
> it was supported by Wikidata, I attempted to use that code directly through
> the API, with no luck. Similarly, original data uses code 'mul' (also a
> valid ISO639-2 code).
>
> Thomas
>
>
>> Federico
>>
>
>
> --
>
> *Thomas Francart* -* SPARNA*
> Web de *données* | Architecture de l'*information* | Accès aux
> *connaissances*
> blog : blog.sparna.fr, site : sparna.fr, linkedin :
> fr.linkedin.com/in/thomasfrancart
> tel :  +33 (0)6.71.11.25.97, skype : francartthomas
>
> ___
> Wikidata mailing 
> listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
>
>
>
> <https://www.avast.com/sig-email?utm_medium=email_source=link_campaign=sig-email_content=emailclient>
>  Virusvrij.
> www.avast.com
> <https://www.avast.com/sig-email?utm_medium=email_source=link_campaign=sig-email_content=emailclient>
> <#m_637642425612966372_DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>


-- 

*Thomas Francart* -* SPARNA*
Web de *données* | Architecture de l'*information* | Accès aux
*connaissances*
blog : blog.sparna.fr, site : sparna.fr, linkedin :
fr.linkedin.com/in/thomasfrancart
tel :  +33 (0)6.71.11.25.97, skype : francartthomas
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Language codes 'mul' and 'mis' not recognized

2020-06-22 Thread Thomas Francart
Hello

Le lun. 22 juin 2020 à 15:33, Federico Leva (Nemo)  a
écrit :

> Thomas Francart, 22/06/20 16:21:
> > I also cannot use these 2 codes when editing through the human interface.
>
> Language code validation on Wikidata is sometimes confusing. See also:
> https://phabricator.wikimedia.org/T39459
>
>
Thanks, I read the page but am not sure to understand how it relates to the
original question. Can you decode ? Does it explain why creating
labels/aliases with code 'mul' or 'mis' fails using the API ?


> Why "mis" instead of "und"? See also
> https://phabricator.wikimedia.org/T230833#6103004
>
>
Sorry, I wrote "when the language value is unknown", but this was not
precise : the original data we want to synch already contains data using
code "mis" (as a valid ISO639-2 iso code, see
https://www.loc.gov/standards/iso639-2/php/code_list.php), and as I read it
was supported by Wikidata, I attempted to use that code directly through
the API, with no luck. Similarly, original data uses code 'mul' (also a
valid ISO639-2 code).

Thomas


> Federico
>


-- 

*Thomas Francart* -* SPARNA*
Web de *données* | Architecture de l'*information* | Accès aux
*connaissances*
blog : blog.sparna.fr, site : sparna.fr, linkedin :
fr.linkedin.com/in/thomasfrancart
tel :  +33 (0)6.71.11.25.97, skype : francartthomas
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


[Wikidata] Language codes 'mul' and 'mis' not recognized

2020-06-22 Thread Thomas Francart
Hello

(Please direct me somewhere else is this is not the place to ask.)

I am developing a bot to synch ISSN data [1] using WDTK. Creating a
label/alias with the language code 'mul' (to indicate a multilingual
value), or 'mis' (to indicate the language is unknown) does not work and
returns an Exception from the wikimedia API ("[not-recognized-language] The
supplied language code was not recognized"). These 2 codes are however
documented at https://www.wikidata.org/wiki/Help:Monolingual_text_languages
and listed at
https://www.wikidata.org/wiki/Help:Wikimedia_language_codes/lists/all

I also cannot use these 2 codes when editing through the human interface.
I do however find items in Wikidata with 'mul' language codes.

Am I doing something wrong, did I misunderstand something, or is there a
true problem with these codes ?

Also opened ticket in WDTK issue tracker :
https://github.com/Wikidata/Wikidata-Toolkit/issues/509

Thanks
Thomas

[1] : ISSN Bot :
https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_Periodicals#Data_donation_from_ISSN_Register_(Followup)

-- 

*Thomas Francart* -* SPARNA*
Web de *données* | Architecture de l'*information* | Accès aux
*connaissances*
blog : blog.sparna.fr, site : sparna.fr, linkedin :
fr.linkedin.com/in/thomasfrancart
tel :  +33 (0)6.71.11.25.97, skype : francartthomas
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


[Wikidata] Data donation from ISSN Internation Center - feedback welcome

2019-11-04 Thread Thomas Francart
Hello

The ISSN International Center, which maintains the ISSN Register (*the most
complete reference source for the identification of serial publications*)
at http://portal.issn.org wishes to *contribute a subset of its
bibliographic database to Wikidata*.
This consists of (approximately) *1.100.000 serial publications notices*
(only current/not discontinued publications), with 10% having 2 or more
ISSNs (typically, paper and electronic version). Currently, Wikidata
contains approximately 94000 ISSN values.

Each serial notice is described with metadata from the set of metadata
already freely available from the ISSN Portal :

   - ISSN identifier
   - ISSN-L identifier (the ISSN-L identifies the serial publication
   independantly of its medium, while distinct ISSNs are assigned to each
   medium)
   - Proper (original) title of the serial
   - Key (unambiguous) title of the serial
   - Country of publication
   - Distribution format (of a given ISSN)
   - URL (for online publications)
   - Language of the serial

*(Note that the publisher is not part of the free data).*

More information can be read in this presentation.
<https://docs.google.com/presentation/d/1vJrfj3zOKTw-29QYp2RyBxkJUlHVk_VKWznhsoZE9FA/edit#slide=id.p1>

Preliminary analysis has been conducted and a data model proposal to
capture this set of metadata has been submitted at
https://www.wikidata.org/wiki/Wikidata_talk:WikiProject_Periodicals#Data_donation_from_ISSN_Register_-_Feedback_welcome
(Wikidata Periodicals and Wiki Source Projects have been notified)

Feedback on this is welcome until 4th of december (although not a formal
deadline), either here or on wiki at the link above.

We also welcome general advice on how to proceed with an ingest of this
size (the plan is currently to use OpenRefine to reconcile and import).

Best Regards
Thomas

-- 

*Thomas Francart* -* SPARNA*
Web de *données* | Architecture de l'*information* | Accès aux
*connaissances*
blog : blog.sparna.fr, site : sparna.fr, linkedin :
fr.linkedin.com/in/thomasfrancart
tel :  +33 (0)6.71.11.25.97, skype : francartthomas
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


[Wikidata] SPARQL examples page not displaying properly latest examples ?

2019-10-08 Thread Thomas Francart
Hello

I recently contributed to an example SPARQL query in the example page (
https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples#Display_the_class_tree_under_a_known_class_(subclass_of)
)

It is not displayed properly and it seems that this page is broken as none
of the latest examples in the page is displayed, starting with this one :
https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/queries/examples#Human_settlements_without_an_article_in_any_language_version_of_Wikipedia

Hypothesis : I suspect this is due to the high number of the same
"Template:SPARQL" used in that same page.

Is this a known issue ? any way to fix this ?

Also, I can't find this query example in the query interface, when
searching for it. How/when is the query interface updated with the example
page ? is it related to the display problem in the page itself ?

Cheers
Thomas

-- 

*Thomas Francart* -* SPARNA*
Web de *données* | Architecture de l'*information* | Accès aux
*connaissances*
blog : blog.sparna.fr, site : sparna.fr, linkedin :
fr.linkedin.com/in/thomasfrancart
tel :  +33 (0)6.71.11.25.97, skype : francartthomas
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


Re: [Wikidata] Full-text / autocomplete search on labels

2019-10-04 Thread Thomas Francart
Ettore, Lucas, thanks a lof for your answers, that's what I was looking for.
Searching for an entity based on a label + language (and optionally
additionnal criteria like instance of) looks like a reasonable use-case,
this could be "packaged" inside magic triples to make this feature directly
available in SPARQL (as other triplestores with full-text/index-based
search are doing).

Cheers
Thomas

Le ven. 4 oct. 2019 à 14:55, Lucas Werkmeister  a
écrit :

> If you use regular search rather than entity search, you can use
> WikibaseCirrusSearch
> <https://www.mediawiki.org/wiki/Special:MyLanguage/Help:Extension:WikibaseCirrusSearch>
> search keywords like "haswbstatement:P31=Q5". Note that this uses a
> different kind of search, though – I think it only matches full words
> (after stemming and other transformations?), not prefixes. Query Link
> <https://query.wikidata.org/#SELECT%20%3Fperson%20%3FpersonLabel%20WHERE%20%7B%0A%20%20SERVICE%20wikibase%3Amwapi%20%7B%0A%20%20%20%20bd%3AserviceParam%20wikibase%3Aapi%20%22Search%22%3B%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20wikibase%3Aendpoint%20%22www.wikidata.org%22%3B%0A%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20%20mwapi%3Asrsearch%20%22einst%20haswbstatement%3AP31%3DQ5%22.%0A%20%20%20%20%3Fperson%20wikibase%3AapiOutputItem%20mwapi%3Atitle.%0A%20%20%7D%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%0A%7D>
>
> Cheers,
> Lucas
> On 04.10.19 10:22, Ettore RIZZA wrote:
>
> Forgot to mention: you can do the same search using the API only:
> https://www.wikidata.org/w/api.php?action=wbsearchentities=einst=en=json
>
> But not sure you can easily filter the results by "instance of".
>
> Ettore Rizza
>
>
> On Fri, 4 Oct 2019 at 10:15, Ettore RIZZA  wrote:
>
>> Hello Thomas,
>>
>> You can perform a full text search with the API, but not yet with SPARQL
>> AFAIK. However, it is possible to call the API in a SPARQL query. For
>> example, here is a query
>> <https://query.wikidata.org/#SELECT%20DISTINCT%20%3Fperson%20%3FpersonLabel%20WHERE%20%7B%0A%20%20SERVICE%20wikibase%3Amwapi%20%7B%0A%20%20%20%20%20%20bd%3AserviceParam%20wikibase%3Aapi%20%22EntitySearch%22%20.%0A%20%20%20%20%20%20bd%3AserviceParam%20wikibase%3Aendpoint%20%22www.wikidata.org%22%20.%0A%20%20%20%20%20%20bd%3AserviceParam%20mwapi%3Asearch%20%22einst%22%20.%0A%20%20%20%20%20%20bd%3AserviceParam%20mwapi%3Alanguage%20%22en%22%20.%0A%20%20%20%20%20%20%3Fperson%20wikibase%3AapiOutputItem%20mwapi%3Aitem%20.%0A%20%20%7D%0A%20%20%3Fperson%20wdt%3AP31%20wd%3AQ5.%0A%20%20%0A%20%20%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22%5BAUTO_LANGUAGE%5D%2Cen%22.%20%7D%0A%20%20%0A%20%20%0A%7D%20ORDER%20BY%20DESC(%3Fperson)%20LIMIT%2020>
>> that looks for "human (Q5)"  whose label contains the string "einst".
>>
>> Hope this helps,
>>
>> Ettore Rizza
>>
>>
>> On Fri, 4 Oct 2019 at 09:58, Thomas Francart 
>> wrote:
>>
>>> Hello
>>>
>>> I understand the wikidata SPARQL label service only fetches the labels,
>>> but does not allow to search/filter on them; labels are also available in
>>> regulare rdfs:label on which a FILTER can be made.
>>> However I would like to do full-text search over labels, to e.g. feed an
>>> autocomplete search field, actually just like the usual top-right wikidata
>>> search field does. I would also be interested to combine this with a
>>> criteria on "instance of", to search only on instances of a given class.
>>>
>>> Can I do that efficiently using the Wikidata SPARQL service ? or is
>>> there a separate API I could use ? (exemple welcome)
>>>
>>> Thanks
>>> Thomas
>>>
>>> --
>>>
>>> *Thomas Francart* -* SPARNA*
>>> Web de *données* | Architecture de l'*information* | Accès aux
>>> *connaissances*
>>> blog : blog.sparna.fr, site : sparna.fr, linkedin :
>>> fr.linkedin.com/in/thomasfrancart
>>> tel :  +33 (0)6.71.11.25.97, skype : francartthomas
>>> _______
>>> Wikidata mailing list
>>> Wikidata@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/wikidata
>>>
>>
> ___
> Wikidata mailing 
> listWikidata@lists.wikimedia.orghttps://lists.wikimedia.org/mailman/listinfo/wikidata
>
> ___
> Wikidata mailing list
> Wikidata@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikidata
>


-- 

*Thomas Francart* -* SPARNA*
Web de *données* | Architecture de l'*information* | Accès aux
*connaissances*
blog : blog.sparna.fr, site : sparna.fr, linkedin :
fr.linkedin.com/in/thomasfrancart
tel :  +33 (0)6.71.11.25.97, skype : francartthomas
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata


[Wikidata] Full-text / autocomplete search on labels

2019-10-04 Thread Thomas Francart
Hello

I understand the wikidata SPARQL label service only fetches the labels, but
does not allow to search/filter on them; labels are also available in
regulare rdfs:label on which a FILTER can be made.
However I would like to do full-text search over labels, to e.g. feed an
autocomplete search field, actually just like the usual top-right wikidata
search field does. I would also be interested to combine this with a
criteria on "instance of", to search only on instances of a given class.

Can I do that efficiently using the Wikidata SPARQL service ? or is there a
separate API I could use ? (exemple welcome)

Thanks
Thomas

-- 

*Thomas Francart* -* SPARNA*
Web de *données* | Architecture de l'*information* | Accès aux
*connaissances*
blog : blog.sparna.fr, site : sparna.fr, linkedin :
fr.linkedin.com/in/thomasfrancart
tel :  +33 (0)6.71.11.25.97, skype : francartthomas
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata