Am 05.01.26 um 19:53 schrieb Jim:
Other scraping solutions, e.g. via wget, are blocked ATM by Anubis.
(AI bots killing servers otherwise.)
Maybe we could allow the IPs of the mirror servers.
Even then, would that help the Command pages issue?
Yes, because wget would just take the HTML as is shown to the browser
while mwoffliner calls the Mediawiki API which might lead to a better
structured download (not sure, I didn’t really look into results yet).
Hraban
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the
Wiki!
maillist : [email protected] /
https://mailman.ntg.nl/mailman3/lists/ntg-context.ntg.nl
webpage : https://www.pragma-ade.nl / https://context.aanhet.net (mirror)
archive : https://github.com/contextgarden/context
wiki : https://wiki.contextgarden.net
___________________________________________________________________________________