Manuel added a subscriber: AndrewTavis_WMDE. Manuel added a comment.
I have moved this comment from T336361#9134372 <https://phabricator.wikimedia.org/T336361#9134372>. In T336361#9134372 <https://phabricator.wikimedia.org/T336361#9134372>, @AndrewTavis_WMDE wrote: > @Manuel, I think we'll need to abondon the idea of creating an edits subset of `wmf.webrequests`, sadly :( The following would be where we'd find the various actions that we'd need to collect to define as edits fully: https://www.wikidata.org/w/api.php. We know at the very least that we'd want `uri_query LIKE '?action=edit%'` and similarly `wbsetclaim` and `wbeditentity`, but figuring out what else needs to be added seems to be prohibitive given the discrepancy: > > > SELECT > COUNT(*) AS total_edits > > FROM > wmf.webrequest > > WHERE > year = 2023 > AND month = 7 > AND day = 31 > AND uri_host IN ('www.wikidata.org', 'm.wikidata.org') > AND ( > uri_query LIKE '?action=edit%' > OR uri_query LIKE '?action=wbsetclaim%' > OR uri_query LIKE '?action=wbeditentity%' > ) > > ... gives us `25,374` (`26,329` if we do `%wbsetclaim%` and `%wbeditentity%`), and the following: > > SELECT > COUNT(*) AS total_edits > > FROM > wmf_raw.mediawiki_private_cu_changes > > WHERE > wiki_db = 'wikidatawiki' > AND month = '2023-07' > AND '20230731' <= cuc_timestamp > AND cuc_timestamp < '20230801' > > ... gives us `657,347`, with `25374/657347` being `3.86%`. This way of doing this is gonna get really messy as well as there are apparently combinations of actions and some actions also have `edit` in their names. > > There definitely should be a combination of those actions that gets us a similar number, but this would be something that we'd need to loop WMF into, and the easiest route would likely be to talk to them about getting an edits subset similar to `pageview_actor` as a table in the Data Lake. TASK DETAIL https://phabricator.wikimedia.org/T344052 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Manuel Cc: AndrewTavis_WMDE, Aklapper, Manuel, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org