Hi, SO I managed to download the whole html subdir (http://www.w3schools.com/html). It worked mostly OK, I have 90% of what I wanted. However there is one thing that would make me happier, namely "try it yourself" examples they have at:
http://www.w3schools.com/html/html_examples.asp To invoke html examples they use calls like (just the first example): http://www.w3schools.com/html/tryit.asp?filename=tryhtml_basic Wget saves a file and a directory with this very name, but it gets stuck at this one: http://www.w3schools.com/html/tryit_edit.asp?filename=tryhtml_basic&referer=http://www.w3schools.com/html/html_examples.asp Instead of a file it saves a directory with a truncated name: /tryit_edit.asp?filename=tryhtml_basic&referer=http: with a subdir www.w3schools.com/html/ and with the CORRECT content in it, but with a WRONG name (html_primary.asp) SO ALL THE NECCESARRY INFORMATION IS DOWNLOADED but useless for viewing due to wrong naming. Is there a way out somehow? (to protect filenames against : ? / etc.) thanks a lot dominique Thomas Lussnig wrote: > Dominique wrote: > >>> Posting the URL of the problem site would be a good idea. >> >> well, I have quite a few. let's start with this: >> >> http://www.w3schools.com/html/default.asp >> >> or just anything from such a page page. I hacked around for a while >> with no apparent success. > > > Try this and it works !!! > wget -U "Mozilla/5.0 (compatible; MSIE 6.0; Windows NT 5.1)" > http://www.w3schools.com/html/default.asp > > Problem is that these site Block wget !!!! > > Cu Thomas Lußnig