Hi!
Wget 1.5.3 uses /robots.txt to skip some parts of web-site. But it
doesn't use tag, which serves
to the same purpose.
I believe that Wget must also parse and use
tags
WBR
Stas mailto:[EMAIL PROTECTED]
Volker Kuhlmann wrote:
> Tried to download these 2 files. From konqueror it succeeds:
>
>GET /%7Egaultney/gentium/files/Gentium%20RU%20Specimen.pdf HTTP/1.1
>
> but from wget I get
>
>HEAD /~gaultney/gentium/files/Gentium RU Specimen.pdf HTTP/1.0
>
> The server barfs on spaces in the HEAD l
Go to
my site, get wget if you don;t have it already, get the documentation. >From the
Mini Howto section get the archvie with the example batch files, take a look at
the wget options used in getlinks. Compare them with the
manual.
Youl
will use something like
wget
-vkKrHp -l1
Use them on
When something like this happens you need more information. Sometimes -v and
the logfile can be enough, sometimes a look at the html helps.
In this case opening that page from a browser showed me links pointing to
another host, so after a bit of tweaking this seems to work fine(running
currently):