I'm using GNU's "wget" to crawl my local web server
which is mostly CGI based with variables.  For
example, http://localhost/item.cgi?id=1 (id=2, id=3,
etc.).  I'm using wget instead of htdig because the
cookie jar and cookie handler of wget works, and htdig
doesn't seem to.

Anyhow, my plan is to use htdig with local_urls and
local_urls_only -- but it seems to have problems
pulling up documents that are named like this:
item.cgi?id=1 or item.cgi?id=10&cata=denim

I get something like this:

pick: http://www.foo.com, # servers = 1
0:2:0:http://www.foo.com/index.cgi?id=1: Trying local
files
  found existing file
/home/jes/www.foo.com/item.cgi?id=1
Local retrieval failed, trying HTTP
 host not found

--

So htdig finds the file, but then it says local
retrieval failed.  Are there any known issues with
pulling files with CGI variables in the names?

__________________________________________________
Do you Yahoo!?
Yahoo! Mail Plus - Powerful. Affordable. Sign up now.
http://mailplus.yahoo.com


-------------------------------------------------------
This SF.NET email is sponsored by:
SourceForge Enterprise Edition + IBM + LinuxWorld = Something 2 See!
http://www.vasoftware.com
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to