Hi,
My systems are RH7
with htdig-3.2.0b6 installed. I'm trying to get htdig to
index the site running on these servers. The URL i'm
trying to index is a virtual IP that goes to 2 servers. What this means is
that if I type in this URL, it will go to one of the server with less
traffic on it. I'm trying to build the dbase in just one of the
servers. The ports 80 and 443 are open.
This is the alarm I get.
I tried to locate the robots.txt and that file is only available on
/opt/fedex/htdig/htdig-3.2.0b6/test/htdocs/robots.txt not on the Document Root.
If I type in the URL https://dsscos.prod.fedex.com/robots.txt,
it says page cannot be found.
httpd is running and and
if I go to the site https://dsscos.prod.fedex.com/, I can
see data which means this is a valid URL.
+++
[EMAIL PROTECTED] bin]# ./rundig
-vvv
ht://dig Start Time: Tue Apr 4 10:32:47 2006
1:1:https://dsscos.prod.fedex.com/
New server: dsscos.prod.fedex.com, 443
- Persistent connections: enabled
- HEAD before GET: enabled
- Timeout: 30
- Connection space: 0
- Max Documents: -1
- TCP retries: 1
- TCP wait time: 5
- Accept-Language:
Trying to retrieve robots.txt file
Making HTTP request on https://dsscos.prod.fedex.com/robots.txt
Unable to establish the connection with host: dsscos.prod.fedex.com (port 443)
Request time: 35 secs
Unable to establish the connection with host: dsscos.prod.fedex.com (port 443)
Request time: 35 secs
.Unable to establish the connection with host: dsscos.prod.fedex.com (port 443)
Request time: 35 secs
. pushed
pick: dsscos.prod.fedex.com, # servers = 1
> dsscos.prod.fedex.com with a traditional HTTP connection
ht://dig End Time: Tue Apr 4 10:34:32 2006
htpurge: Database is empty!
ht://dig Start Time: Tue Apr 4 10:32:47 2006
1:1:https://dsscos.prod.fedex.com/
New server: dsscos.prod.fedex.com, 443
- Persistent connections: enabled
- HEAD before GET: enabled
- Timeout: 30
- Connection space: 0
- Max Documents: -1
- TCP retries: 1
- TCP wait time: 5
- Accept-Language:
Trying to retrieve robots.txt file
Making HTTP request on https://dsscos.prod.fedex.com/robots.txt
Unable to establish the connection with host: dsscos.prod.fedex.com (port 443)
Request time: 35 secs
Unable to establish the connection with host: dsscos.prod.fedex.com (port 443)
Request time: 35 secs
.Unable to establish the connection with host: dsscos.prod.fedex.com (port 443)
Request time: 35 secs
. pushed
pick: dsscos.prod.fedex.com, # servers = 1
> dsscos.prod.fedex.com with a traditional HTTP connection
ht://dig End Time: Tue Apr 4 10:34:32 2006
htpurge: Database is empty!
Preamble
text:
+++++
+++++
Just to simplify the process, I
just followed the procedure:
The standard GNU installation process works for
ht://Dig../configure
--prefix=/usr/localmakemake
installvi
/usr/local/conf/htdig.conf/usr/local/bin/rundig
(The
final three commands must be issued as root.)
If I changed the "start_url"
into a non-SSL site, rundig runs well.
Any help would be greatly
appreciated.
Junie Ablay III

