Marostegui added a comment. |
In T179106#3750329, @daniel wrote:In T179106#3750118, @Marostegui wrote:I will definitely try to optimize the table, as it is getting out of hand
Now that we use Elastic for prefix search, we could drop all rows with term_type = "alias" from this table. Needs a little bit of code to make this optional, but should save a lot of space. If you think that would help, ask @hoo and @Lydia_Pintscher about it.
The % of rows with that type isn't huge
root@db2086.codfw.wmnet[wikidatawiki]> select count(*) from wb_terms where term_type="alias"; +----------+ | count(*) | +----------+ | 16570605 | +----------+ 1 row in set (11 min 24.28 sec)
root@db2086.codfw.wmnet[wikidatawiki]> select count(*) from wb_terms; +------------+ | count(*) | +------------+ | 1268234185 | +------------+ 1 row in set (6 min 4.49 sec)
So that makes it around 1.3%, not huge, but it would still be good to get them cleaned if they are not necessary.
@hoo @Lydia_Pintscher you want me to create a ticket about it or you'd do it?
TASK DETAIL
EMAIL PREFERENCES
To: Marostegui
Cc: Lydia_Pintscher, Stashbot, Ladsgroup, daniel, Marostegui, Aklapper, hoo, Lahi, GoranSMilovanovic, QZanden, Minhnv-2809, Luke081515, Wikidata-bugs, aude, Mbch331, Jay8g, Krenair
Cc: Lydia_Pintscher, Stashbot, Ladsgroup, daniel, Marostegui, Aklapper, hoo, Lahi, GoranSMilovanovic, QZanden, Minhnv-2809, Luke081515, Wikidata-bugs, aude, Mbch331, Jay8g, Krenair
_______________________________________________ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs