[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2017-01-21 Thread gerritbot
gerritbot added a comment.
Change 327905 merged by jenkins-bot:
Add plain-text link name to sitelinks, for easier display.

https://gerrit.wikimedia.org/r/327905TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: gerritbotCc: daniel, gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, Th3d3v1ls, Ramalepe, Liugev6, EBjune, mschwarzer, merbst, Avner, Lewizho99, Maathavan, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2017-01-21 Thread daniel
daniel added a comment.

In T148923#2888185, @Smalyshev wrote:
Yes, but I'm not sure we can safely claim that every article on certain wiki is a string in a language of that wiki. It's easy to add it, I'm just not sure it's right to do it.


It's not always technically correct, but a valid assumption that should be right in 99% of the cases, and will produce usable results in like 99.99% or something. Good enough.TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: danielCc: daniel, gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, Th3d3v1ls, Ramalepe, Liugev6, EBjune, mschwarzer, merbst, Avner, Lewizho99, Maathavan, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2017-01-20 Thread Smalyshev
Smalyshev added a comment.
I changed it to schema:name and added the language. See the updated patch in gerrit.TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: SmalyshevCc: gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, Th3d3v1ls, Ramalepe, Liugev6, EBjune, mschwarzer, merbst, Avner, Lewizho99, Maathavan, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-12-19 Thread Smalyshev
Smalyshev added a comment.
Yes, but I'm not sure we can safely claim that every article on certain wiki is a string in a language of that wiki. It's easy to add it, I'm just not sure it's right to do it.TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: SmalyshevCc: gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, Th3d3v1ls, Ramalepe, Liugev6, EBjune, mschwarzer, Avner, Lewizho99, Maathavan, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-12-17 Thread Esc3300
Esc3300 added a comment.
rdfs:label is used for dublin core title .. it seems suitable for WP article titles.

BTW we do have  "ARPANET"@ru at Wikidata:  ARPANETTASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Esc3300Cc: gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, Th3d3v1ls, Ramalepe, Liugev6, EBjune, mschwarzer, Avner, Lewizho99, Maathavan, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-12-17 Thread Smalyshev
Smalyshev added a comment.
Shouldn't it be "San Francisco"@en as well? (matching label datatype).

Well, the thing is you don't know. You's say "it is the language of the wiki" - but there's no guarantee of that! Consider https://ru.wikipedia.org/wiki/ARPANET - it's a Russian-language wiki, but ARPANET is not a Russian word. There's also https://he.wikipedia.org/wiki/ARPANET. There could be words in different language as wiki titles. I don't think language tag there would be of any use, especially if we know it can be wrong. If you need wiki language, you have inLanguage triple, but that does not guarantee the title is actually a word in that language.

only items have triples with that predicate, and queries that rely on this assumption might break

This is not true, properties have labels too. Can you specify a query that would break? I'd say if a query relies on an assumption only items have labels, it's already broken. But maybe I am missing some use case, let's see the query.

why do we have to stick to URL and its requirements

Because otherwise many tools will be unable to consume those. These are not just abstract strings, these actually represent articles in Wikipedia and other wikis. If they will be in the form from which you can't go to an article, that would be defeating the purpose of a sitelink - i.e. link to a site.TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: SmalyshevCc: gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, Th3d3v1ls, Ramalepe, Liugev6, EBjune, mschwarzer, Avner, Lewizho99, Maathavan, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-12-17 Thread Base
Base added a comment.
@Smalyshev why do we have to stick to URL and its requirements. RDF seems to be all about IRI rather than URL, and those I believe allow Unicode.TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: BaseCc: gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, Th3d3v1ls, Ramalepe, Liugev6, EBjune, mschwarzer, Avner, Lewizho99, Maathavan, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-12-17 Thread WikidataFacts
WikidataFacts added a comment.
I don’t really like the choice of rdfs:label as predicate. Currently, as far as I’m aware, only items have triples with that predicate, and queries that rely on this assumption might break (and the straightforward fix, ?item a wikibase:Item, isn’t available on WDQS). There’s also the datatype issue that @Esc3300 mentioned, but I don’t think it would be correct to claim that every title on enwiki is in English, either (random examples: Q300 is just some identifier, Sposalizio is Italian, …).TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: WikidataFactsCc: gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, Th3d3v1ls, Ramalepe, Liugev6, EBjune, mschwarzer, Avner, Lewizho99, Maathavan, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-12-17 Thread Esc3300
Esc3300 added a comment.
To compare with labels, spaces would be better.

Shouldn't it be "San Francisco"@en  as well? (matching label datatype).TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Esc3300Cc: gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, Th3d3v1ls, Ramalepe, Liugev6, EBjune, mschwarzer, Avner, Lewizho99, Maathavan, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-12-17 Thread Nikki
Nikki added a comment.
Spaces would be better for all three use cases I listed, so I would prefer spaces.TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: NikkiCc: gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, Th3d3v1ls, Ramalepe, Liugev6, EBjune, mschwarzer, Avner, Lewizho99, Maathavan, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-12-16 Thread gerritbot
gerritbot added a comment.
Change 327905 had a related patch set uploaded (by Smalyshev):
[WIP] Add plain-text link name to sitelinks, for easier display.

https://gerrit.wikimedia.org/r/327905TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: gerritbotCc: gerritbot, Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, EBjune, mschwarzer, Avner, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-11-01 Thread Smalyshev
Smalyshev added a comment.
@Esc3300 works for English ones, Russian or Chinese ones would be a bit more tricky.TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: SmalyshevCc: Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, mschwarzer, Avner, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-11-01 Thread Esc3300
Esc3300 added a comment.
Workaround:

(REPLACE(REPLACE(REPLACE(strafter(str(?article),"/wiki/"),"%20"," "),"%28","("),"%29",")") as ?title)

;)TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Esc3300Cc: Esc3300, Smalyshev, WikidataFacts, Aklapper, Base, Nikki, mschwarzer, Avner, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-10-23 Thread Smalyshev
Smalyshev added a comment.
We can't have unencoded links, because there are rules about which characters can appear in the URL. We can have unencoded strings (strings can have nearly any (sane) character inside) but I'm not sure how you propose to distinguish names from different wikis.TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: SmalyshevCc: Smalyshev, WikidataFacts, Aklapper, Base, Nikki, mschwarzer, Avner, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T148923: Provide a way to access unencoded page names for sitelinks

2016-10-23 Thread WikidataFacts
WikidataFacts added a comment.
I would suggest a triple with the predicate schema:name, or perhaps schema:headline.TASK DETAILhttps://phabricator.wikimedia.org/T148923EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: WikidataFactsCc: WikidataFacts, Aklapper, Base, Nikki, mschwarzer, Avner, debt, Gehel, D3r1ck01, Jonas, FloNight, Xmlizer, Izno, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Deskana, Manybubbles, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs