Hi,

SO I managed to download the whole html subdir 
(http://www.w3schools.com/html). It worked mostly OK, I have 90% of what 
I wanted. However there is one thing that would make me happier, namely 
"try it yourself" examples they have at:

http://www.w3schools.com/html/html_examples.asp

To invoke html examples they use calls like (just the first example):

http://www.w3schools.com/html/tryit.asp?filename=tryhtml_basic

Wget saves a file and a directory with this very name, but it gets stuck 
at this one:

http://www.w3schools.com/html/tryit_edit.asp?filename=tryhtml_basic&referer=http://www.w3schools.com/html/html_examples.asp

Instead of a file it saves a directory with a truncated name:

/tryit_edit.asp?filename=tryhtml_basic&referer=http:

with a subdir www.w3schools.com/html/ and with the CORRECT content in 
it, but with a WRONG name (html_primary.asp)

SO ALL THE NECCESARRY INFORMATION IS DOWNLOADED but useless for viewing 
due to wrong naming.

Is there a way out somehow? (to protect filenames against : ? / etc.)

thanks a lot
dominique

Thomas Lussnig wrote:

> Dominique wrote:
>
>>> Posting the URL of the problem site would be a good idea.
>>
>> well, I have quite a few. let's start with this:
>>
>> http://www.w3schools.com/html/default.asp
>>
>> or just anything from such a page page. I hacked around for a while 
>> with no apparent success.
>
>
> Try this and it works !!!
> wget -U "Mozilla/5.0 (compatible; MSIE 6.0; Windows NT 5.1)" 
> http://www.w3schools.com/html/default.asp
>
> Problem is that these site Block wget !!!!
>
> Cu Thomas Lußnig


Reply via email to