Ghuron added a comment.
Look at the query that is used to get missing articles for "List of articles
every Wikipedia should have" https://quarry.wmflabs.org/query/26700 There are 2
joins:
- For iwlinks/wb_items_per_site join I have to do either CONCAT ('Q',
ips_item_id) or TRIM('Q'
Ladsgroup added a comment.
The query @Multichill is running is complex on its own and even if we resolve this task, it'll be still a very slow query, what I would recommend is to make a temporary table in labs for exactly this tool (which happens all the time, for example Magnus does it too) and
daniel added a comment.
We should also consider a solution that will only put the full ids on labs. We don't actually need it in production. Could be a separate table, kept up to day by a trigger.TASK DETAILhttps://phabricator.wikimedia.org/T114904EMAIL
hoo added a comment.
Giving the size of the table, changing this shouldn't be overly horrible. It's a fair bit of migration work… but I assume doing this for maintenance queries and consistency is worth it.TASK DETAILhttps://phabricator.wikimedia.org/T114904EMAIL
daniel added a comment.
@Multichill wb_items_per_site is *always* items. So to get the full item ID, just use concat('Q', ips_item_id).
That's not very nice, but viable I think. We could even make a virtual column ips_full_entity_id on the labs view...
I'm dropping the prio back to normal,