[Wikidata-l] annotating red links

2015-02-11 Thread Amir E. Aharoni
Hi,

TL;DR: How can a red link be annotated in a semantic way with a foreign
article title or a Wikidata Q item number?

Imagine: I'm writing a Wikipedia article in Russian. There's a red link in
it. I don't have time to write the target article for that link now, but
I'm sure that it should exist. In fact, that article does exist in the
English Wikipedia.

I want the link to be red (fr the usual wiki reasons), but until the
Russian article is written, I want to give the software a hint about which
topic it is supposed to be about. Telling it the English article name would
be one way to do it. Giving it the Wikidata Q item number would be an even
better way to do it.

Unfortunately, MediaWiki does not currently have true syntax to do either.
(Correct me if I'm wrong.)

Some Wikipedias may have templates that do something like this (e.g.
Russian: https://ru.wikipedia.org/wiki/Template:En ). But there's nothing
that is uniform to all projects.

*Why* is it useful to give the software this hint in the first place? Most
simplistically, it's useful to the reader - in case that reader knows
English, she can at least read something.

But there's something bigger. When the ContentTranslation extension
translates links, it automatically adapts links that can be found. What to
do about those that can't be auto-adapted? It frequently happens when
Wikipedians translate articles that many links in the created articles turn
out to be red. We'd love to get ContentTranslation to help the translators
make those articles by writing relevant articles with as few clicks as
possible, and that is only possible by annotating the red links with the
topics to which they belong.

So, any ideas?
What do other Wikipedias for such annotation?
Is it imaginable to add wiki syntax for such a thing?
Can anybody think of a hack that reuses the current [[link]] syntax to add
such annotation?

--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
‪“We're living in pieces,
I want to live in peace.” – T. Moore‬
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] annotating red links

2015-02-11 Thread Jane Darnell
Hi Amir,
We created a template in the English Wikipedia for this and I used it here:
https://en.wikipedia.org/wiki/Koekkoek

I also just stumbled across this, which is also acceptable here:
https://en.wikipedia.org/wiki/Wauters

The first method keeps the enwiki link red, which is what you want, but the
wrapper leads you to Wikidata or to Reasonator.

The second method leads you to an article in the language that you may
recognize by the prefix, but the color difference from blue is too subtle
to notice. It would be nice if we had a gadget that made these orange, the
way I have a gadget now that makes redirects look green.

Jane

On Wed, Feb 11, 2015 at 8:26 PM, Amir E. Aharoni 
amir.ahar...@mail.huji.ac.il wrote:

 Hi,

 TL;DR: How can a red link be annotated in a semantic way with a foreign
 article title or a Wikidata Q item number?

 Imagine: I'm writing a Wikipedia article in Russian. There's a red link in
 it. I don't have time to write the target article for that link now, but
 I'm sure that it should exist. In fact, that article does exist in the
 English Wikipedia.

 I want the link to be red (fr the usual wiki reasons), but until the
 Russian article is written, I want to give the software a hint about which
 topic it is supposed to be about. Telling it the English article name would
 be one way to do it. Giving it the Wikidata Q item number would be an even
 better way to do it.

 Unfortunately, MediaWiki does not currently have true syntax to do either.
 (Correct me if I'm wrong.)

 Some Wikipedias may have templates that do something like this (e.g.
 Russian: https://ru.wikipedia.org/wiki/Template:En ). But there's nothing
 that is uniform to all projects.

 *Why* is it useful to give the software this hint in the first place? Most
 simplistically, it's useful to the reader - in case that reader knows
 English, she can at least read something.

 But there's something bigger. When the ContentTranslation extension
 translates links, it automatically adapts links that can be found. What to
 do about those that can't be auto-adapted? It frequently happens when
 Wikipedians translate articles that many links in the created articles turn
 out to be red. We'd love to get ContentTranslation to help the translators
 make those articles by writing relevant articles with as few clicks as
 possible, and that is only possible by annotating the red links with the
 topics to which they belong.

 So, any ideas?
 What do other Wikipedias for such annotation?
 Is it imaginable to add wiki syntax for such a thing?
 Can anybody think of a hack that reuses the current [[link]] syntax to add
 such annotation?

 --
 Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
 http://aharoni.wordpress.com
 ‪“We're living in pieces,
 I want to live in peace.” – T. Moore‬

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l


___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] annotating red links

2015-02-11 Thread Amir E. Aharoni
2015-02-11 22:14 GMT+02:00 Ricordisamoa ricordisa...@openmailbox.org:
 Adding non-existing pages to Wikidata items?
 Using a syntax like [Q42[notexistingpagetitle]]?

Is this a suggestion for possible syntax or something that actually works
somewhere?

But yeah, something like this - something that includes the title of a page
that doesn't exist in this wiki, but may some to exist some day, and the Q
number.

--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
‪“We're living in pieces,
I want to live in peace.” – T. Moore‬
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] annotating red links

2015-02-11 Thread Amir E. Aharoni
Yes, Gerard and Jane - this looks like what I'm talking about.

If I may dream for a moment, this should be something that can be used in
all Wikipedias, and without copying this template everywhere, but built
into the site's software :)


--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
‪“We're living in pieces,
I want to live in peace.” – T. Moore‬

2015-02-11 22:47 GMT+02:00 Gerard Meijssen gerard.meijs...@gmail.com:

 Hoi,
 Have a look at this article ...
 https://en.wikipedia.org/wiki/Herman_Skolnik_Award
 Thanks to Magnus for a blog post I am still to write.
 Thanks,
  GerardM

 On 11 February 2015 at 20:26, Amir E. Aharoni 
 amir.ahar...@mail.huji.ac.il wrote:

 Hi,

 TL;DR: How can a red link be annotated in a semantic way with a foreign
 article title or a Wikidata Q item number?

 Imagine: I'm writing a Wikipedia article in Russian. There's a red link
 in it. I don't have time to write the target article for that link now, but
 I'm sure that it should exist. In fact, that article does exist in the
 English Wikipedia.

 I want the link to be red (fr the usual wiki reasons), but until the
 Russian article is written, I want to give the software a hint about which
 topic it is supposed to be about. Telling it the English article name would
 be one way to do it. Giving it the Wikidata Q item number would be an even
 better way to do it.

 Unfortunately, MediaWiki does not currently have true syntax to do
 either. (Correct me if I'm wrong.)

 Some Wikipedias may have templates that do something like this (e.g.
 Russian: https://ru.wikipedia.org/wiki/Template:En ). But there's
 nothing that is uniform to all projects.

 *Why* is it useful to give the software this hint in the first place?
 Most simplistically, it's useful to the reader - in case that reader knows
 English, she can at least read something.

 But there's something bigger. When the ContentTranslation extension
 translates links, it automatically adapts links that can be found. What to
 do about those that can't be auto-adapted? It frequently happens when
 Wikipedians translate articles that many links in the created articles turn
 out to be red. We'd love to get ContentTranslation to help the translators
 make those articles by writing relevant articles with as few clicks as
 possible, and that is only possible by annotating the red links with the
 topics to which they belong.

 So, any ideas?
 What do other Wikipedias for such annotation?
 Is it imaginable to add wiki syntax for such a thing?
 Can anybody think of a hack that reuses the current [[link]] syntax to
 add such annotation?

 --
 Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
 http://aharoni.wordpress.com
 ‪“We're living in pieces,
 I want to live in peace.” – T. Moore‬

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l


___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] annotating red links

2015-02-11 Thread Ricordisamoa

Il 11/02/2015 20:26, Amir E. Aharoni ha scritto:

Hi,

TL;DR: How can a red link be annotated in a semantic way with a 
foreign article title or a Wikidata Q item number?


Imagine: I'm writing a Wikipedia article in Russian. There's a red 
link in it. I don't have time to write the target article for that 
link now, but I'm sure that it should exist. In fact, that article 
does exist in the English Wikipedia.


I want the link to be red (fr the usual wiki reasons), but until the 
Russian article is written, I want to give the software a hint about 
which topic it is supposed to be about. Telling it the English article 
name would be one way to do it. Giving it the Wikidata Q item number 
would be an even better way to do it.


Unfortunately, MediaWiki does not currently have true syntax to do 
either. (Correct me if I'm wrong.)


Some Wikipedias may have templates that do something like this (e.g. 
Russian: https://ru.wikipedia.org/wiki/Template:En ). But there's 
nothing that is uniform to all projects.


*Why* is it useful to give the software this hint in the first place? 
Most simplistically, it's useful to the reader - in case that reader 
knows English, she can at least read something.


But there's something bigger. When the ContentTranslation extension 
translates links, it automatically adapts links that can be found. 
What to do about those that can't be auto-adapted? It frequently 
happens when Wikipedians translate articles that many links in the 
created articles turn out to be red. We'd love to get 
ContentTranslation to help the translators make those articles by 
writing relevant articles with as few clicks as possible, and that is 
only possible by annotating the red links with the topics to which 
they belong.


So, any ideas?
What do other Wikipedias for such annotation?
Is it imaginable to add wiki syntax for such a thing?
Can anybody think of a hack that reuses the current [[link]] syntax to 
add such annotation?


--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
‪“We're living in pieces,
I want to live in peace.” – T. Moore‬


Adding non-existing pages to Wikidata items?
Using a syntax like [Q42[notexistingpagetitle]]?

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] annotating red links

2015-02-11 Thread Amir E. Aharoni
Yeah, looking into labels is certainly something that I considered, but
that is by definition only a guess and not as bulletproof as Q numbers.

We considered doing stuff like:
* [[not-yet-written article about Douglas Adams|Douglas Adams]]!-- wd: Q42
--
* [[not-yet-written article about Douglas Adams#Q42|Douglas Adams]]

... and this would kinda work, but would be leave a lot of mess to the
community editors to clean up. The template way, suggested by Gerard, is
similar and seems slightly less messy to me. But only slightly.

(If anybody cares, the relevant task in ContentTranslation is
https://phabricator.wikimedia.org/T88580 .)


--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
‪“We're living in pieces,
I want to live in peace.” – T. Moore‬

2015-02-12 7:51 GMT+02:00 Maarten Dammers maar...@mdammers.nl:

 Hi Amir,

 Amir E. Aharoni schreef op 11-2-2015 om 13:12:

 If I may dream for a moment, this should be something that can be used in
 all Wikipedias, and without copying this template everywhere, but built
 into the site's software :)

 Exactly, the template based approach doesn't scale at all. You have to
 somehow make it automatic. One thing I thought about is adding suggested
 sitelinks to Wikidata. The software would encounter a red link and would
 look in Wikidata if it can find an item with a suggested sitelink of the
 same title. Huge software overhaul so I don't see that happening.

 Another approach that is probably already possible right now:
 * Take an article with a red link
 * Look at the links in the article in other languages.
 * If you find a link that points to another article which has the same
 label as the red link in the same language, link to it

 I wonder how many good results that would give.

 Maarten


 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] annotating red links

2015-02-11 Thread Maarten Dammers

Hi Amir,

Amir E. Aharoni schreef op 11-2-2015 om 13:12:
If I may dream for a moment, this should be something that can be used 
in all Wikipedias, and without copying this template everywhere, but 
built into the site's software :)
Exactly, the template based approach doesn't scale at all. You have to 
somehow make it automatic. One thing I thought about is adding suggested 
sitelinks to Wikidata. The software would encounter a red link and would 
look in Wikidata if it can find an item with a suggested sitelink of the 
same title. Huge software overhaul so I don't see that happening.


Another approach that is probably already possible right now:
* Take an article with a red link
* Look at the links in the article in other languages.
* If you find a link that points to another article which has the same 
label as the red link in the same language, link to it


I wonder how many good results that would give.

Maarten

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] descriptions in mobile app

2015-02-11 Thread Gerard Meijssen
Hoi,
It is pointless to include automated descriptions when they are then saved
in a fixed form. The point of automated descriptions is exactly that they
change as new statements are made. This is one reason why they are superior
to manual descriptions. The other is that when one label is added in a
language, it immediately affects all items that include the associated item.

When the argument is that external users need the best descriptions
available at whatever time, it is best to have the automated descriptions
separate. We have enough experience of the disruption caused by failing
dumps. Given that there is a need for descriptions for off line usage, it
makes sense to consider caching such a file and removing the content that
is changed and have it regenerated in a batch process. When a description
is needed it can always be generated there and then. These can be used
interactively as well.
Thanks,
  GerardM

On 9 February 2015 at 13:21, Markus Kroetzsch 
markus.kroetz...@tu-dresden.de wrote:

 Hi Magnus, hi Daniel,

 I don't think file size should be our primary concern here. What may seem
 big today will be negligible in a few years. Having all data in one place
 is just easier to work with. I am happy to wait for another 30min for a
 download if it saves me from implementing another Web service connector in
 my own code. Compute time is cheap, disk space is cheap, human labour is
 expensive.

 Maybe the whole size discussion is a bit of a red herring here anyway. If
 we are worried about file size, there would maybe be better ways of
 reducing it. We can split the contents into several smaller dump files, not
 just for descriptions. We are already doing this when creating RDF dumps,
 and it would be easy for us to do the same for JSON. We could do this
 immediately if someone needs it (just let me know and we will set it up for
 you). However, if we want to provide smaller files, a more effective method
 would be to split by language rather than by term type: all labels in all
 languages would still be much bigger than labels+descriptions+aliases in
 English only, and many applications will not need labels in 300 languages.

 Anyway, as I said, I do not mind whether the auto-descriptions are stored
 like normal descriptions or whether they are added to the dump files last
 minute when generating them. I just need the descriptions in the dumps.

 Cheers,

 Markus

 On 09.02.2015 12:28, Daniel Kinzler wrote:

 Am 09.02.2015 um 12:25 schrieb Magnus Manske:

 But wouldn't it be better to keep the dump as it is, for those who don't
 want
 triple size (just inventing a number here), and have one separate, or
 even
 per-language, dump with just the automated descriptions, for those who
 want that?


 Possibly. Depends on how much more data this would actually be. Which also
 depends on whether we would omit descriptions in languages that can
 easily be
 covered by language fallback (e.g. no separate descriptions in de-ch and
 de-at).




 --
 Markus Kroetzsch
 Faculty of Computer Science
 Technische Universität Dresden
 +49 351 463 38486
 http://korrekt.org/

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l