On Wed, 10 Jun 2004, Graeme Nichols wrote:

> [EMAIL PROTECTED] graeme]$ sudo rundig -vvv
> ht://dig Start Time: Thu Jun 10 13:13:17 2004
>         1:1:http://localhost/home/graeme/gramps/web/
> New server: localhost, 80
>  - Persistent connections: enabled
>  - HEAD before GET: disabled
>  - Timeout: 30
>  - Connection space: 0
>  - Max Documents: -1
>  - TCP retries: 1
>  - TCP wait time: 5
>  - Accept-Language:
> Trying to retrieve robots.txt file
> Making HTTP request on http://localhost/robots.txt
> Unable to establish the connection with host: localhost (port 80)

Do you have a web server running on port 80? If you open up a browser and
enter http://localhost/robots.txt in the address box, do you get a
response, or does your browser just timeout? If you do have a web server
setup to run on port 80, are you certain it was running when you tried to
index the site?

Are you sure that http://localhost/home/graeme/gramps/web/ is a valid
start_url? Such a URL would seem to imply that your web browser is
configured to use / as the document root, which would generally be
considered a bad thing and is certainly not typical. If you type this URL
into your browser, what happens?

If you are just trying to index things locally, you might want to take a
look at the local_urls and local_urls_only attributes.

  http://www.htdig.org/attrs.html#local_urls
  http://www.htdig.org/attrs.html#local_urls_only


Jim


-------------------------------------------------------
This SF.Net email is sponsored by: GNOME Foundation
Hackers Unite!  GUADEC: The world's #1 Open Source Desktop Event.
GNOME Users and Developers European Conference, 28-30th June in Norway
http://2004/guadec.org
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to