On Sat, Jun 12, 2021 at 8:15 PM John E Petersen wrote: > If I find it is possible to simply download the entire collection, without > having to host a mirror, I may very well go that route.
That is definitely possible, there are two sides to every Debian mirror: 1) downloading Debian 2) making the files available on the web. The second part is definitely optional and many Debian folks do just the first part in order to serve their personal machines with Debian packages. > If I continue the scraping route, would adding wait time in my loop between > downloads make my repeated access less of a problem? I would like to let it > run until it is finished. It is tedious to restart my scrape periodically. Please use the ftpmirror method recommended by Étienne, it is more likely to produce a correct result than scraping and much less likely to get blocked. The Debian archive is only updated every six hours, so it would be a waste of bandwidth to update more often than that. -- bye, pabs https://wiki.debian.org/PaulWise