date:20170614

[Wikidata] External ID URL format

2017-06-14 Thread David Lowe

NYPL's Photographers' Identities Catalog (P2750) has a new data view that
would be preferable for use in WD (the IDs are, of course, unchanged).  The
current formatted URL is

http://pic.nypl.org/map/?DisplayName=$1

But I'd rather it point to:

http://pic.nypl.org/constituents/$1

I can't seem to edit it (perhaps because I'm on my phone, or more likely
it's restricted for security purposes). How can we get this switched?
Many thanks in advance!

David

-- 


*David Lowe | The New York Public Library**Specialist II, Photography
Collection*

*Photographers' Identities Catalog *
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

[Wikidata] New satellite event for the Celtic Knot: Wikipedia Language Conference - Booking closes 27 June

2017-06-14 Thread MCANDREW Ewan

Dear colleagues,

How can technology support language communities?

Join us at the Celtic Knot: Wikipedia Language 
conference taking 
place Thursday 6 July 2017 at the University of Edinburgh Business School to 
find out. Booking closes 27 June so don’t delay!
The main objective for Celtic Knot 2017 is the coming together of those working 
to support Celtic and Indigenous Languages in the same room at same time; 
strengthening the bonds into a 'knot' and leading into action. We welcome 
diverse attendees ranging from Wikimedians, linguists, educators, researchers, 
information professionals, media professionals, translators, learning 
technologists and more coming together to share good practice and find fruitful 
new collaborations to support language communities as a result of the event.

New satellite event: Introduction to Wikidata and the Wikidata Query 
Service
 – this event is now scheduled 11am to 1pm on Tuesday 4 July presented by Léa 
Lacroix (Project Manager Community Communication for Wikidata, Wikimedia 
Deutschland). Léa will also be joining us at the Celtic Knot as part of the 
Wikidata workshop on 6 July too. Just one of the many great 
speakers 
and 
presentations
 at the Celtic Knot:

Keynote speakers

  *   Professor Antonella 
Sorace - 
Professor of Developmental Linguistics at the University of 
Edinburgh and founding director of 
Bilingualism Matters will be speaking 
on ‘Bilingualism in minority languages: a resource and an opportunity’.
  *   Jason 
Evans - 
Wikimedian in Residence at the National Library of 
Wales
 will discuss his strategy for working with Wikimedia UK and the Welsh 
Government to develop the Welsh Wicipedia using a combination of community 
engagement, data manipulation and the implementation of Open Access policies.


Confirmed speakers also include:

  *   Susan Ross – Gaelic Wikipedian in 
Residence
 at the National Library of 
Scotland.
  *   Dr. Sharon 
Arbuthnot
 - Research Fellow, Queen's University, Belfast. Presenting on the AHRC-funded 
eDIL project (Irish Language dictionary) on Wednesday 5th 
July.
  *   Gareth Morlais – the Welsh Language Unit, Welsh Government.  Gareth will 
speak about how mapping how much importance major companies (Google, Twitter, 
Apple) attach to creative activity on Wikipedia led to the Welsh Government 
helping to fund two Welsh-language Wikipedia initiatives.
  *   Delyth Prys – Head of the Language Technologies Unit, Bangor University, 
will speak on Welsh/Celtic speech technology and why text-to-speech and speech 
recognition are becoming increasingly important in our digital world.
  *   Àlex Hinojo – Executive Director, 
Amical Wikimedia on the 
Catalan language project.
  *   Iñaki Lopez de Luzuriaga – Developing the Basque Wikipedia: From corpus 
expansion to outreach.
  *   Astrid Carlsen – Executive Director, Wikimedia Norge speaking on 
Norwegian Bokmål, Norwegian Nynorsk and building a project to revitalize the 
Northern Sami Wikipedia.
  *   Robin Owain – Wales Manager, Wikimedia UK, speaking on recent 
developments supporting the Welsh language community.
  *   Mina Theofilatou  presenting on ‘The Kefalonian Dialect in Wiktionary and 
how Wikitherapy addresses social equality in open-source language projects’.
  *   Duncan Brown - Llên Natur, 
presenting on ‘Y BYWIADUR: the dictionary of life’.
  *   Rémy Gerbet - Wikimedia France; presenting on the Lingua Libre 
project for massive open audio recording.
  *   Käbi Suvi - Wikimedia Estonia on the ‘Miljon+’ project as part of 
Estonia’s 100th anniversary.
  *   Ilario Valdelli - Wikimedia Switzerland, speaking on the Digital Library 
in Romansch and the new initiatives to map the archeological sites connected 
with Celtic culture in the Alps.
  *   Wikipedia’s new Content Translation tool and how it has been successfully 
employed in Higher Education to

Re: [Wikidata] Multilingual and synonym support for M'n'm / was: Mix'n'Match with existing (indirect) mappings

2017-06-14 Thread Neubert, Joachim

Hi Magnus,

the idea was not to search for all labels/synonyms separately, but to
concatenate everything in one large search string, and let the fulltext search
do the magic.

E.g., for STW descriptor “CGE model”, search for “CGE model, CGE-Modell, ORANI
model, MONASH model, Dynamic CGE model, Computable general equilibrium model,
CGE analysis, Applied general equilibrium model”

When, as in Fuseki, the fulltext search tries to match every word in the
string, it may return long lists of results. However: When these can be sorted
by a score value, they can be limited to the best matching 10 or whatever
results.

An according example query, which works on a GND endpoint, is here:
http://zbw.eu/beta/sparql-lab/?endpoint=http://zbw.eu/beta/sparql/gnd/query&queryRef=https://api.github.com/repos/zbw/sparql-queries/contents/gnd/search_subject.rq
I’m pretty sure, that would work as well on our currently unavailable internal
WD endpoint on Fuseki. Unfortunately, MWAPI fulltext search seems to work
differently.

Another pattern, which I have applied with a query which looks up person names
and their name variants from GND, and then searches in the above mentioned
custom WD instance, is here:
https://github.com/zbw/sparql-queries/blob/master/wikidata/search_person_by_gnd_names.rq.

For, e.g., “John H. Dunning” (http://d-nb.info/gnd/119094665) all name variants
are bound in a fulltext search expression, and a sum of scores is computed to
rank the total result
(http://zbw.eu/beta/sparql-lab/result?resultRef=https://api.github.com/repos/zbw/sparql-queries/contents/wikidata/results/search_person_by_gnd_names.wikidata_2016-11-07.gnd_2016-09.json).

I have experimented a bit, but neither of these patterns seems to work with the
current MWAPI implementation. Since my understanding is very poor here, and the
implementation is in an early stage, I cc Stas, who perhaps can contribute
ideas.

Cheers, Joachim

Von: Wikidata [mailto:wikidata-boun...@lists.wikimedia.org] Im Auftrag von
Magnus Manske
Gesendet: Mittwoch, 14. Juni 2017 09:33
An: Discussion list for the Wikidata project.
Betreff: Re: [Wikidata] Multilingual and synonym support for M'n'm / was:
Mix'n'Match with existing (indirect) mappings

On Tue, Jun 13, 2017 at 6:25 PM Neubert, Joachim
mailto:j.neub...@zbw.eu>> wrote:
Hi Magnus, Osma,

I suppose the scenario Osma pointed out is quite common for knowledge
organization systems and in particular thesauri: Matching could take advantage
of multilingual labels and also of synonyms, which are defined in the KOS.

For the populating STW Thesaurus for Economics ID (P3911), my preliminary plan
was to match with all multilingual labels and synonyms as search string in a
custom WD endpoint (Fuseki, with full text search support), and display in the
ranked SPARQL results of the search with a column with a valid insert statement
that can be copied and pasted into QuickStatements2.

Since Stas just announced an extension for WDQS with fulltext search (if I
haven’t misunderstood his mail of 2017-06-12), it is perhaps now possible to do
this kind of matching in WDQS.

It would be great if such an extended matching could be integrated into M’n’m.
To clarify, Mix'n'match already searches language-neutral, e.g. for automatch.

Storing multiple labels per entry in the Mix'n'match database, and then
checking all-against-all, would require some large-scale rewiring.
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

Re: [Wikidata] Running queries on a schedule / How does Constraint Violations Reporting work?

2017-06-14 Thread Tony Bowden

On 13 June 2017 at 19:11, Jonas Kress  wrote:
> For using your own SPARQL queries and creating violation lists you could use
> Magnus' tool listera [3]

I'd like to echo this one — we recently resuscitated the Heads of
State and Government wikiproject[1], and as part of that made loads of
Listeria based tables to check that data is consistent, e.g.
https://www.wikidata.org/wiki/Wikidata:EveryPolitician/Contrast_Report:Head_of_Government

The version of that to list obvious problems is now empty, as we got
it all cleaned it all up[2], but if some people keep that page on
their Watchlist then hopefully any new problems can be spotted and
corrected quite quickly. (Hint, a second bookmark to a Watchlist of
only pages in the 'Wikidata' namespace (i.e. not changes to items),
and with bot edits allowed, even if you've those hidden in your main
watchlist, can be handy for this)

Tony

[1] 
https://www.wikidata.org/wiki/Wikidata:WikiProject_Heads_of_state_and_government
[2] compare what it was like a week previously :
https://www.wikidata.org/w/index.php?title=Wikidata:EveryPolitician/Contrast_Report:Head_of_Government&oldid=490812995

___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

Re: [Wikidata] Multilingual and synonym support for M'n'm / was: Mix'n'Match with existing (indirect) mappings

2017-06-14 Thread Magnus Manske

On Tue, Jun 13, 2017 at 6:25 PM Neubert, Joachim  wrote:

> Hi Magnus, Osma,
>
>
>
> I suppose the scenario Osma pointed out is quite common for knowledge
> organization systems and in particular thesauri: Matching could take
> advantage of multilingual labels and also of synonyms, which are defined in
> the KOS.
>
>
>
> For the populating STW Thesaurus for Economics ID (P3911), my preliminary
> plan was to match with all multilingual labels and synonyms as search
> string in a custom WD endpoint (Fuseki, with full text search support), and
> display in the ranked SPARQL results of the search with a column with a
> valid insert statement that can be copied and pasted into QuickStatements2.
>
>
>
> Since Stas just announced an extension for WDQS with fulltext search (if I
> haven’t misunderstood his mail of 2017-06-12), it is perhaps now possible
> to do this kind of matching in WDQS.
>
>
>
> It would be great if such an extended matching could be integrated into
> M’n’m.
>
To clarify, Mix'n'match already searches language-neutral, e.g. for
automatch.

Storing multiple labels per entry in the Mix'n'match database, and then
checking all-against-all, would require some large-scale rewiring.
___
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata

[Wikidata] External ID URL format

[Wikidata] New satellite event for the Celtic Knot: Wikipedia Language Conference - Booking closes 27 June

Re: [Wikidata] Multilingual and synonym support for M'n'm / was: Mix'n'Match with existing (indirect) mappings

Re: [Wikidata] Running queries on a schedule / How does Constraint Violations Reporting work?

Re: [Wikidata] Multilingual and synonym support for M'n'm / was: Mix'n'Match with existing (indirect) mappings

5 matches

Site Navigation

Mail list logo

Footer information