Hannah_Bast added a comment.
@DD063520: You find some details at https://github.com/ad-freiburg/qlever/blob/master/docs/quickstart.md . For the current Wikidata, indexing takes around 24 hours on a AMD Ryzen 9 5900X with 128 GB of RAM and cheap HDDs. Our goal is an indexing time of at most 1 hour / 1 billion triples and we are not far from that. The latest version of Wikidata (including the lexeme data) has around 16B triples. The current index size is 750 GB, but this will soon be cut in half by some further compression. With efficient support only for queries where each predicates has a fixed value and is no variable (which is the case for most queries), the index size is 250 GB. Being super space-efficient was not among our high-priority goals so far, since space is relatively cheap. TASK DETAIL https://phabricator.wikimedia.org/T291903 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Hannah_Bast Cc: DD063520, Justin0x2004, Hannah_Bast, So9q, Aklapper, Invadibot, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, QZanden, EBjune, merbst, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org