Greetings, Rich. Hope it helps.
A few notes: - I modified the notebook and ran it again to capture more data - I added headers to the notebook so that there is a minimal Table of Contents to make jumping around easier - the notebook mirrored the site in just under 5 minutes in the CoLab environment - there are a total of 1609 files - total size is just under 15 GB - two files suffered a 503 error - I added a few code cells to find those errors and reattempt a download, if desired - a full tree, long listing, and short listing of files are included in the notebook at the end Good luck and I'm curious to hear what you do next with the data. Regards, - Robert On Sat, Nov 18, 2023 at 6:38 AM Rich Shepard <[email protected]> wrote: > On Sat, 18 Nov 2023, Robert Citek wrote: > > > I was able to mirror the site in Google's Colab. Here's a gist with a > > notebook describing what I did and its output: > > > > https://gist.github.com/rwcitek/8d3035f6d2931d80f0569d3964fa6e28 > > > > In the notebook, you can click on the "Open in Colab" button to run the > > commands in your own Colab environment. > > Robert, > > Thank you. > > Regards, > > Rich >
