Hi there,

I am not sure if it is a bug or I just do not understand how wget
works (generally known as 'I am stupid').

I use a script to mirror some directories recursively on a CentOS 8
with GNU Wget 1.19.5.

> export WGETOPTIONS='--limit-rate=2m --mirror --timestamp --recursive 
> --no-parent -R "index.html*" --no-host-directories --cut-dirs=1 --no-verbose'
> wget $WGETOPTIONS 'https://example.com/Dir-1/Dir-1.1/' -P 
> /srv/fileserver/docs/example.com/Dir-1

Which throws the following output:
> 2019-11-25 14:42:24 URL:https://example.com/Dir-1/Dir-1.1/ [14650] -> 
> "/srv/fileserver/docs/example.com/Dir-1/Dir-1.1/index.html" [1]
> https://example.com/robots.txt:
> 2019-11-25 14:42:24 ERROR 404: Not Found.
> /srv/fileserver/docs/example.com/Dir-1/SubDir-1.1/Sub-SubDir-1.1.1: Is a 
> directory
> Cannot write to 
> ‘/srv/fileserver/docs/example.com/Dir-1/SubDir-1.1/Sub-SubDir-1.1.1’ (Is a 
> directory).
> /srv/fileserver/docs/example.com/Dir-1/SubDir-1.1/Sub-SubDir-1.1.2: Is a 
> directory
> Cannot write to 
> ‘/srv/fileserver/docs/example.com/Dir-1/SubDir-1.1/Sub-SubDir-1.1.2’ (Is a 
> directory).
> /srv/fileserver/docs/example.com/Dir-1/SubDir-1.1/Sub-SubDir-1.1.3: Is a 
> directory
> Cannot write to 
> ‘/srv/fileserver/docs/example.com/Dir-1/SubDir-1.1/Sub-SubDir-1.1.3’ (Is a 
> directory).

If I remove a directory from my local file server, wget mirrors the
directory correctly.

I tried a lot of other options, but no matter what I tried I run into
this error.
It also does download the index.html even if I reject it.

Maybe you guys can help me a little bit.

Cheers
 Boris

Reply via email to