> For now, I'm doing this in a personal PAWS notebook and I plan to run it
on around 800 enwiki and 200 eswiki articles. But I would like to share the
notebook with others in the future, so they can use it for their own list
of articles, and I may try to make it into a Toolforge tool eventually.

While accessing via cloudelastic replicas would certainly be more
performant, for a set of 1k articles sequentially requesting them through
the public mediawiki api's should be doable.  A query such as this will
return the weighted tags:
https://en.wikipedia.org/wiki/Special:ApiSandbox#action=query&format=json&prop=cirrusdoc&titles=Email&formatversion=2&cdincludes=weighted_tags

That API does report that it is an internal format and subject to change,
but that internal format is the exact same thing we would see talking to
cloudelastic directly.
_______________________________________________
Discovery mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to