Ah, and I think I found a bug in your command: by grepping for |Q[1-9][0-9]*+*|, you’re excluding single-digit item IDs. I’m going to speculate that if you fix that, Q5 will comfortably beat all other items :)
On 8/23/18 11:31 PM, Lucas Werkmeister wrote: > > The top result freaks me out, to be honest. Are /that many/ people > running the first query from the SPARQL tutorial > <https://www.wikidata.org/wiki/Wikidata:SPARQL_tutorial>, or is there > some other reason why Bach might be so overwhelmingly popular? > > On 8/23/18 9:57 PM, Daniel Mietchen wrote: >> I just ran Max' one-liner over one of the dump files, and it worked >> smoothly. Not sure where the best place would be to store such things, >> so I simply put it in my sandbox for now: >> https://www.wikidata.org/w/index.php?title=User:Daniel_Mietchen/sandbox&oldid=732396160 >> . >> d. >> On Tue, Aug 7, 2018 at 6:06 PM David Cuenca Tudela <dacu...@gmail.com> wrote: >>> If someone could post the 10 (or 50!) more popular items, I would really >>> appreciate it :-) >>> >>> Cheers, >>> Micru >>> >>> On Tue, Aug 7, 2018 at 5:59 PM Maximilian Marx >>> <maximilian.m...@tu-dresden.de> wrote: >>>> Hi, >>>> >>>> On Tue, 7 Aug 2018 17:37:34 +0200, Markus Kroetzsch >>>> <markus.kroetz...@tu-dresden.de> said: >>>>> If you want a sorted list of "most popular" items, this is a bit more >>>>> work and would require at least some Python script, or some less >>>>> obvious combination of sed (extracting all URLs of entities), and >>>>> sort. >>>> zgrep -Eoe '%3Chttp%3A%2F%2Fwww.wikidata.org%2Fentity%2FQ[1-9][0-9]+%3E' >>>> dump.gz | cut -d 'Q' -f 2 | cut -d '%' -f 1 | sort | uniq -c | sort -nr >>>> >>>> should do the trick. >>>> >>>> Best, >>>> >>>> Maximilian >>>> -- >>>> Dipl.-Math. Maximilian Marx >>>> Knowledge-Based Systems Group >>>> Faculty of Computer Science >>>> TU Dresden >>>> +49 351 463 43510 >>>> https://kbs.inf.tu-dresden.de/max >>>> >>>> _______________________________________________ >>>> Wikidata mailing list >>>> Wikidata@lists.wikimedia.org >>>> https://lists.wikimedia.org/mailman/listinfo/wikidata >>> -- >>> Etiamsi omnes, ego non >>> _______________________________________________ >>> Wikidata mailing list >>> Wikidata@lists.wikimedia.org >>> https://lists.wikimedia.org/mailman/listinfo/wikidata >> _______________________________________________ >> Wikidata mailing list >> Wikidata@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/wikidata > > _______________________________________________ > Wikidata mailing list > Wikidata@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikidata
_______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata