[Wikidata-bugs] [Maniphest] [Commented On] T194273: Clean up indexes of wb_terms table

2018-05-21 Thread Marostegui
Marostegui added a comment.
@Ladsgroup can you confirm which indexes we have to drop? There are tw different patches and the syntax doesn't match what we have in production:

These are the patches:

DROP INDEX /*i*/term_search_key ON /*_*/wb_terms;
DROP INDEX /*i*/term_search_full ON /*_*/wb_terms;

DROP INDEX /*i*/term_search ON /*_*/wb_terms;
DROP INDEX /*i*/term_entity ON /*_*/wb_terms;

These are the ones we have in core.

->   KEY `wb_terms_entity_id` (`term_entity_id`),
->   KEY `wb_terms_text` (`term_text`),
->   KEY `wb_terms_search_key` (`term_search_key`),
->   KEY `tmp1` (`term_language`,`term_type`,`term_entity_type`,`term_search_key`),
->   KEY `term_full_entity` (`term_full_entity_id`),
->   KEY `term_search_full` (`term_language`,`term_full_entity_id`,`term_type`,`term_search_key`(16))

Can you confirm which ones we have to delete as part of this task? (I know we are handling tmp1 on a different task).TASK DETAILhttps://phabricator.wikimedia.org/T194273EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: MarosteguiCc: gerritbot, Marostegui, Aklapper, Lucas_Werkmeister_WMDE, Jonas, jcrespo, Ladsgroup, Versusxo, Majesticalreaper22, Giuliamocci, Adrian1985, Cpaulf30, Lahi, Gq86, Baloch007, Darkminds3113, Bsandipan, Lordiis, GoranSMilovanovic, Adik2382, Th3d3v1ls, Ramalepe, Liugev6, QZanden, LawExplorer, Lewizho99, Minhnv-2809, Maathavan, Luke081515, Wikidata-bugs, aude, Mbch331, Jay8g, Krenair___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Claimed] T194273: Clean up indexes of wb_terms table

2018-05-21 Thread Marostegui
Marostegui claimed this task.Marostegui moved this task from Next to In progress on the DBA board.Marostegui added a comment.
I need to deploy other schema changes on s8, so I will include this as it is a pretty straightforward one and has been merged already.TASK DETAILhttps://phabricator.wikimedia.org/T194273WORKBOARDhttps://phabricator.wikimedia.org/project/board/1060/EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: MarosteguiCc: gerritbot, Marostegui, Aklapper, Lucas_Werkmeister_WMDE, Jonas, jcrespo, Ladsgroup, Versusxo, Majesticalreaper22, Giuliamocci, Adrian1985, Cpaulf30, Lahi, Gq86, Baloch007, Darkminds3113, Bsandipan, Lordiis, GoranSMilovanovic, Adik2382, Th3d3v1ls, Ramalepe, Liugev6, QZanden, LawExplorer, Lewizho99, Minhnv-2809, Maathavan, Luke081515, Wikidata-bugs, aude, Mbch331, Jay8g, Krenair___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Claimed] T194270: Drop 'tmp1' index from wb_terms table in production

2018-05-21 Thread Marostegui
Marostegui claimed this task.Marostegui moved this task from Next to In progress on the DBA board.Marostegui added a comment.
I need to deploy other schema changes on s8, so I will include this as it is a pretty straightforward one.TASK DETAILhttps://phabricator.wikimedia.org/T194270WORKBOARDhttps://phabricator.wikimedia.org/project/board/1060/EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: MarosteguiCc: Marostegui, RazShuty, Aklapper, Lucas_Werkmeister_WMDE, Jonas, jcrespo, Ladsgroup, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, Minhnv-2809, Luke081515, Wikidata-bugs, aude, Mbch331, Jay8g, Krenair___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T195258: [weird] Wikidata SPARQL query results not the same when exported

2018-05-21 Thread abian
abian added a comment.
I thought this was because of the ;, but seems to be more complex, sometimes it happens and sometimes it doesn't.TASK DETAILhttps://phabricator.wikimedia.org/T195258EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: abianCc: abian, Lucas_Werkmeister_WMDE, VIGNERON, Aklapper, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Lydia_Pintscher, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T195258: [weird] Wikidata SPARQL query results not the same when exported

2018-05-21 Thread abian
abian added a comment.
I've just seen that this issue arises when writing two triples condensed with ; in the same line, and not happens when writing them in different lines. The same can be reproduced with this other query.

If we were dealing with private data, we would talk about a security breach... but this data is free! 😃TASK DETAILhttps://phabricator.wikimedia.org/T195258EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: abianCc: abian, Lucas_Werkmeister_WMDE, VIGNERON, Aklapper, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Lydia_Pintscher, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T195258: [weird] Wikidata SPARQL query results not the same when exported

2018-05-21 Thread abian
abian added a comment.
The language in which I get the online query is correct, the one requested with SERVICE wikibase:label, while the language of the dump is the one configured for the Wikidata Query UI.

From the command line, the output is also correct.

$ curl 'https://query.wikidata.org/sparql?query=SELECT%20%3Fq%20%3FqLabel%20WHERE%20%7B%0A%20%20%3Fq%20wdt%3AP31%20wd%3AQ3917681%20%3B%20wdt%3AP17%20wd%3AQ142%20.%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22fr%2Cen%22.%20%7D%0A%7D%0A'TASK DETAILhttps://phabricator.wikimedia.org/T195258EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: abianCc: abian, Lucas_Werkmeister_WMDE, VIGNERON, Aklapper, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Lydia_Pintscher, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Edited] T195258: [weird] Wikidata SPARQL query results not the same when exported

2018-05-21 Thread VIGNERON
VIGNERON updated the task description. (Show Details)
CHANGES TO TASK DESCRIPTION...Even weirder, I tested and the bug appears on Firefox (60.0.1 - 64 bits) but not on Chrome nor on Edge OoTASK DETAILhttps://phabricator.wikimedia.org/T195258EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: VIGNERONCc: Lucas_Werkmeister_WMDE, VIGNERON, Aklapper, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Lydia_Pintscher, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T195258: [weird] Wikidata SPARQL query results not the same when exported

2018-05-21 Thread Lucas_Werkmeister_WMDE
Lucas_Werkmeister_WMDE added a comment.
Huh, I can reproduce this on my work laptop, but not on my private PC… (both using Firefox)TASK DETAILhttps://phabricator.wikimedia.org/T195258EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Lucas_Werkmeister_WMDECc: Lucas_Werkmeister_WMDE, VIGNERON, Aklapper, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Lydia_Pintscher, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Updated] T195258: [weird] Wikidata SPARQL query results not the same when exported

2018-05-21 Thread Lucas_Werkmeister_WMDE
Lucas_Werkmeister_WMDE added projects: Wikidata, Wikidata Query UI.
TASK DETAILhttps://phabricator.wikimedia.org/T195258EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Lucas_Werkmeister_WMDECc: VIGNERON, Aklapper, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, Jonas, Xmlizer, jkroll, Smalyshev, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Lydia_Pintscher, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T187060: reserved Lexeme IDs

2018-05-21 Thread Theklan
Theklan added a comment.
First Basque written word was izioki (actually jçioq, but current ortography would be like that). Would it be possible to have an Easter Egg with this and reserve L171081 for this word?TASK DETAILhttps://phabricator.wikimedia.org/T187060EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: Jakob_WMDE, TheklanCc: Theklan, Addshore, gerritbot, Pablo-WMDE, Aklapper, Lydia_Pintscher, Versusxo, Majesticalreaper22, Giuliamocci, Adrian1985, Cpaulf30, Lahi, Gq86, Baloch007, Darkminds3113, Bsandipan, Lordiis, GoranSMilovanovic, Adik2382, Th3d3v1ls, Ramalepe, Liugev6, QZanden, LawExplorer, Lewizho99, Maathavan, Wikidata-bugs, aude, Darkdadaah, Mbch331___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs


[Wikidata-bugs] [Maniphest] [Commented On] T193728: Solve legal uncertainty of Wikidata

2018-05-21 Thread Cirdan
Cirdan added a comment.

In T193728#4213806, @Micru wrote:
since I hold the rights to that text

The concept of "rights" is quite flexible, as shows Wikipedia. The Wikipedias are based on texts that have copyrights but they have been re-paraphrased so that the copyright no longer applies. Same with data-mining, in a way it is re-paraphrasing a text in a machine readable format.


I'm again sorry to say that, but your comments show a deeply flawed understanding of copyright. Copyright of texts is by no means "flexible". It seems you are confusing plagiarism and copyright violation, which are completely separate categories (the former is a concept in the context of academic scholarship, the latter a concept in the context of law). What we are discussing here is whether data collections licensed under CC-BY-SA or other non-CC-0 licenses (like OSM) can be imported to Wikidata. The licenses of these collections do not simply vanish because one alters some words or uses a computer program to extract the information.


In T193728#4218415, @Micru wrote:
In my opinion, a CC license that would allow for data mining as CC0 would be most helpful, and not only for the Wikimedia movement.


There is already a license which allows data mining under CC-0: CC-0 itself. There cannot be any other license which allows re-use of content under CC-0 which is not effectively identical CC-0. If there are cases where copyright law permits the extraction of information from copyrighted texts, then this applies to CC-BY-SA licensed texts as well, so there is no need to change CC-BY-SA to extract information from Wikipedia.

But as said, I would be rather troubled by such an approach.

Do you care to explain why does it bother you to clarify the license?

It's not a "clarification", it would constitute a retroactive conversion of CC-BY-SA into a license which is effectively CC-0. As we have explained to you multiple times now, that is not possible without consent of every single contributor of a copyright protected text to Wikipedia and it is highly doubtful that a majority of Wikipedians (or the WMF) is interested in converting Wikipedia to CC-0.

(I can only urge you again to carefully read the explanations people in this discussion have given you and perhaps also look into copyright law (a Wikipedia article will do) and the CC-BY-SA and CC-0 license texts to understand the fundamental issues we are discussing here.)TASK DETAILhttps://phabricator.wikimedia.org/T193728EMAIL PREFERENCEShttps://phabricator.wikimedia.org/settings/panel/emailpreferences/To: CirdanCc: SimonPoole, Scott_WorldUnivAndSch, Micru, lisong, Lofhi, Nemo_bis, TomT0m, jrbs, EgonWillighagen, sarojdhakal, Agabi10, NMaia, Simon_Villeneuve, Jarekt, Rspeer, OhKayeSierra, Aschmidt, AndrewSu, Mateusz_Konieczny, Maxlath, Huji, Glrx, Realworldobject, Ltrlg, Papapep, Tgr, Ayack, Gnom1, MichaelMaggs, MisterSynergy, Pasleim, Cirdan, 0x010C, Sylvain_WMFr, Denny, Ivanhercaz, Pintoch, Lydia_Pintscher, Lea_Lacroix_WMDE, Aklapper, Psychoslave, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, ZhouZ, Mpaulson, Wikidata-bugs, aude, jayvdb, Slaporte, Mbch331, Jay8g___
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs