[htdig] problem indexing a site
Hi, I am having a problem indexing with htdig. I am trying to index one site. I am running with -vvv and - output but the output does not indicate any errors. It looks like as follows: 1:0:http://www.site.net/ New server: www.site.net, 80 It just sits there for a long while. Thanks in advance. To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
Re: [htdig] problem indexing a site
On Wed, 17 Jan 2001, Elsa Chan wrote: not indicate any errors. It looks like as follows: 1:0:http://www.site.net/ New server: www.site.net, 80 It just sits there for a long while. The first thing you should check is if you can contact this site with another browser, e.g. lynx, Netscape, etc. The first thing htdig must do is to retrieve the robots.txt file from the server. So if you cannot connect to the server using other means, htdig will not be able to either and you will have to look at networking issues. That said, it should not just "hang" since there is a timeout set in the connection code and the 3.1.5 version should be good about killing connections if they timeout. How long is "a long while?" -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
RE: [htdig] problem indexing a site
If you aren't using port 80, you will need to set this in the start_url, e.g.: start_url: http://www.foo.com:81/ Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ On Wed, 17 Jan 2001, Elsa Chan wrote: It just hangs for 10 to 15 minutes. If port 80 is not what we use, do I go and change this in the robots.txt file? Where is this file? Thanks To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
RE: [htdig] problem indexing a site
I try that but I still get the same message. 1:0:http://www.site.com New Server www.site.com , 80 And it hangs there, I also try putting the url in quotes as well in the config file. Thanks -Original Message- From: Geoff Hutchison [EMAIL PROTECTED] To: Elsa Chan [EMAIL PROTECTED] CC: [EMAIL PROTECTED] [EMAIL PROTECTED] Sent: Wed Jan 17 11:42:00 2001 Subject: RE: [htdig] problem indexing a site If you aren't using port 80, you will need to set this in the start_url, e.g.: start_url: http://www.foo.com:81/ Cheers, -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ On Wed, 17 Jan 2001, Elsa Chan wrote: It just hangs for 10 to 15 minutes. If port 80 is not what we use, do I go and change this in the robots.txt file? Where is this file? Thanks To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
RE: [htdig] problem indexing a site
On Wed, 17 Jan 2001, Elsa Chan wrote: 1:0:http://www.site.com New Server www.site.com , 80 I think we need to see your config file--if you did change your htdig.conf, then you have done it in a manner that htdig does not recognize. -- -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] You will receive a message to confirm this. List archives: http://www.htdig.org/mail/menu.html FAQ:http://www.htdig.org/FAQ.html
Re: [htdig] problem indexing a site - no errors but nothing is
On Fri, 10 Sep 1999, Jay Tsao wrote: sites within our intranet. I am running with -v output but the output does not indicate any errors. It looks like as follows: New server: site1.hp.com, 80 New server: site2.hp.com, 80 0:0:0:http://site2.hp.com/: *+*+++--++-+++--+---+-+-- size = 17070 You'll probably see what's going on better with -vvv or -. This will show the connection status, any HTTP headers, and the results of the robots.txt file. -Geoff Hutchison Williams Students Online http://wso.williams.edu/ To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] containing the single word unsubscribe in the SUBJECT of the message.
Re: htdig: problem indexing secure site
Tried setting the variable local_urls, so http://bla.bla.com/=/directory/path/to/DocumentRoot By secure, I believe you mean you implement SSL. But shouldn't your site to index be https:// ? -- --- Denis Bazinet, Systems Administrator Online Strategy Development Bell Emergis [EMAIL PROTECTED] (613) 781-3974 -- To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] containing the single word "unsubscribe" in the body of the message.
htdig: problem indexing secure site
Hello, I am having problems with htdig being able to access my secure server site? Tried setting the variable local_urls, so http://bla.bla.com/=/directory/path/to/DocumentRoot If set in conjunction with start_url, it hangs and does not create db files. If used by itself, then www.htdig.org is used, because to htdig my directory is not accessible, which it is. Any ideas? Thanks Rosemary -- [EMAIL PROTECTED] Cybersource, Unix Administrator Ext: 6152 Or 408.516.1470 Epage: [EMAIL PROTECTED] http://latte.cybersource.com/docs/chad/cgi/vote.cgi -- To unsubscribe from the htdig mailing list, send a message to [EMAIL PROTECTED] containing the single word "unsubscribe" in the body of the message.