Stefan Melbinger ha scritto:
Hello everyone,

I'm having troubles with the newest trunk version of wget (revision 2187).

Command-line arguments:

wget
  --recursive
  --spider
  --no-parent
  --no-directories
  --follow-ftp
  --retr-symlinks
  --no-verbose
  --level='2'
  --span-hosts
  --domains='www.example.com,a.example.com,b.example.com'
  --user-agent='Example'
  --output-file='example.log'
  'www.euroskop.cz'

Results in:

wget: url.c:1934: getchar_from_escaped_string: Assertion `str && *str' failed.
Aborted

Can somebody reproduce this problem? Am I using illegal combinations of arguments? Any ideas?

(Worked before the newest patch.)

it's really weird. with this command:

wget -d --verbose --recursive --spider --no-parent --no-directories --follow-ftp --retr-symlinks --level='2' --span-hosts --user-agent='Mozilla/5.001 (windows; U; NT4.0; en-us) Gecko/25250101' --domains='www.example.com,a.example.com,b.example.com' http://www.euroskop.cz/

i get:

---response begin---
HTTP/1.0 200 OK
Date: Mon, 28 Aug 2006 14:35:14 GMT
Content-Type: text/html
Expires: Mon, 28 Aug 2006 14:35:14 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Server: Apache/1.3.26 (Unix) Debian GNU/Linux CSacek/2.1.9 PHP/4.1.2
X-Powered-By: PHP/4.1.2
Pragma: no-cache
Set-Cookie: PHPSESSID=b8af8e220f5f1f7321b86ce0524f88b2; expires=Tue, 29-Aug-06 14:35:14 GMT; path=/
Via: 1.1 proxy (NetCache NetApp/5.6.2R1)

---response end---
200 OK

Stored cookie www.euroskop.cz -1 (ANY) / <permanent> <insecure> [expiry 2006-08-29 16:35:14] PHPSESSID b8af8e220f5f1f7321b86ce0524f88b2
Length: unspecified [text/html]
Closed fd 3
200 OK

index.html: No such file or directory

FINISHED --16:37:42--
Downloaded: 0 bytes in 0 files


it seems there is a weird interaction between cookies and the recursive spider algorithm that makes wget bail out. i'll have to investigate this.


PS: Just FYI, when I compile I get the following warnings:

http.c: In function `http_loop':
http.c:2425: warning: implicit declaration of function `nonexisting_url'

main.c: In function `main':
main.c:1009: warning: implicit declaration of function `print_broken_links'

recur.c: In function `retrieve_tree':
recur.c:279: warning: implicit declaration of function `visited_url'

fixed, thanks.

--
Aequam memento rebus in arduis servare mentem...

Mauro Tortonesi                          http://www.tortonesi.com

University of Ferrara - Dept. of Eng.    http://www.ing.unife.it
GNU Wget - HTTP/FTP file retrieval tool  http://www.gnu.org/software/wget
Deep Space 6 - IPv6 for Linux            http://www.deepspace6.net
Ferrara Linux User Group                 http://www.ferrara.linux.it

Reply via email to