Hi 

I have a webserver on port 8080 (Zope) which htdig will not search.
Another server listes on port 80, and htdig tries to get relative links
from this one.


I have tried to search the mail archives, but I cannot find a solution.
Any ideas?

>From htdig.conf:
  limit_urls_to:          http://www2.spacetec.no:8080
  start_url: http://www2.spacetec.no:8080/www2/docs/Rutiner/Adm-rutiner/


This is the output from "htdig -i -vvvv":

ht://dig Start Time: Mon Nov  4 08:44:23 2002
        1:1:http://www2.spacetec.no:8080/www2/docs/Rutiner/Adm-rutiner/
New server: www2.spacetec.no, 8080
 - Persistent connections: enabled
 - HEAD before GET: disabled
 - Timeout: 30
 - Connection space: 0
 - Max Documents: -1
 - TCP retries: 1
 - TCP wait time: 5
Trying to retrieve robots.txt file
Creating an HtHTTPBasic object
Making HTTP request on http://www2.spacetec.no:8080/robots.txt
Try to get through to host www2.spacetec.no (port 8080)
    1 - Open of the connection ok
        Assigned the remote host www2.spacetec.no
        Assigned the port 8080
Header line: HTTP/1.1 404 Not Found
Header line: Server: Zope/(Zope 2.5.0 (binary release, python 2.1,
linux2-x86), python 2.1.2, linux2) ZServer/1.1b1
Header line: Date: Mon, 04 Nov 2002 08:43:51 GMT
Header line: Bobo-Exception-File:
/usr/local/zope/2-5-0/lib/python/ZPublisher/HTTPResponse.py
Discarded header line: Bobo-Exception-File:
/usr/local/zope/2-5-0/lib/python/ZPublisher/HTTPResponse.py
Header line: Content-Type: text/html
Header line: Bobo-Exception-Type: NotFound
Discarded header line: Bobo-Exception-Type: NotFound
Header line: Bobo-Exception-Value: bobo exception
Discarded header line: Bobo-Exception-Value: bobo exception
Header line: Etag: 
Discarded header line: Etag: 
Header line: Content-Length: 1849
Header line: Bobo-Exception-Line: 470
Discarded header line: Bobo-Exception-Line: 470
No modification time returned: assuming now
Retrieving document /robots.txt on host: www2.spacetec.no:8080
Http version      : HTTP/1.1
Server            : HTTP/1.1
Status Code       : 404
Reason            : Not Found
Access Time       : Mon, 04 Nov 2002 08:43:51 GMT
Modification Time : Mon, 04 Nov 2002 07:44:24 GMT
Content-type      : text/html
Persistent connection: would be accepted
Body not retrieved
Connection stays up ... (Persistent connection)
Request time: 1 secs
 pushed
pick: www2.spacetec.no, # servers = 1
> www2.spacetec.no supports HTTP persistent connections (infinite)
0:2:0:http://www2.spacetec.no:8080/www2/docs/Rutiner/Adm-rutiner/:
Creating an HtHTTPBasic object
Making HTTP request on
http://www2.spacetec.no:8080/www2/docs/Rutiner/Adm-rutiner/
Try to get through to host www2.spacetec.no (port 8080)
    2 - Open of the connection ok
        Assigned the remote host www2.spacetec.no
        Assigned the port 8080
Header line: HTTP/1.1 200 OK
Header line: Server: Zope/(Zope 2.5.0 (binary release, python 2.1,
linux2-x86), python 2.1.2, linux2) ZServer/1.1b1
Header line: Date: Mon, 04 Nov 2002 08:43:51 GMT
Header line: Content-Type: text/html
Header line: Etag: 
Discarded header line: Etag: 
Header line: Content-Length: 1531
No modification time returned: assuming now
Retrieving document /www2/docs/Rutiner/Adm-rutiner/ on host:
www2.spacetec.no:8080
Http version      : HTTP/1.1
Server            : HTTP/1.1
Status Code       : 200
Reason            : OK
Access Time       : Mon, 04 Nov 2002 08:43:51 GMT
Modification Time : Mon, 04 Nov 2002 07:44:24 GMT
Content-type      : text/html
Persistent connection: would be accepted
Reading the body of the response
Connection stays up ... (Persistent connection)
Request time: 0 secs
Tag: html, matched -1
Tag: head, matched -1
Tag: base href="http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/";
/, matched 23
Tag: title, matched 0
word: Administrative@1
word: rutiner@2
Tag: /title, matched 1

title: Administrative rutiner
Tag: /head, matched -1
Tag: body BGCOLOR="#FFFFFF", matched -1
Tag: a href="..", matched 2
word: Rutiner@3
Tag: /a, matched 3
href: http://www2.spacetec.no/www2/docs/Rutiner/ (Rutiner)

   Rejected: URL not in the limits!
url rejected: (level 1)http://www2.spacetec.no/www2/docs/Rutiner/
Tag: a href="javascript:window.history.go(-1);", matched 2
word: Tilbake@4
Tag: /a, matched 3
href:  (Tilbake)

   Rejected: URL not in the limits!
url rejected: (level 1)
Tag: a href="javascript:window.history.go(1);", matched 2
word: Frem@5
Tag: /a, matched 3
href:  (Frem)

   Rejected: URL not in the limits!
url rejected: (level 1)
Tag: a href="printable_html" target="print", matched 2
word: Utskriftsvennlig@6
word: versjon@7
Tag: /a, matched 3
href:
http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/printable_html
(Utskriftsvennlig versjon)

   Rejected: URL not in the limits!
url rejected: (level
1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/printable_html
Tag: p, matched -1
Tag: h1, matched 4
word: DOKUMENTASJON@8
word: ADMINISTRATIVE@9
word: RUTINER@10
Tag: /h1, matched 10
Tag: li, matched 19
Tag: a
href="http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/innkjop";,
matched 2
word: Innkj@11
Tag: /a, matched 3
href: http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/innkjop
(Innkj�p)

   Rejected: URL not in the limits!
url rejected: (level
1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/innkjop
Tag: br, matched -1
Tag: /li, matched -1
Tag: li, matched 19
Tag: a
href="http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/aut_prosjektrapp";, matched 
2
word: Prosjektrapport@12
word: med@13
word: automatisk@14
word: oppdatering@15
Tag: /a, matched 3
href:
http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/aut_prosjektrapp
(Prosjektrapport med automatisk oppdatering)

   Rejected: URL not in the limits!
url rejected: (level
1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/aut_prosjektrapp
Tag: br, matched -1
Tag: /li, matched -1
Tag: li, matched 19
Tag: a
href="http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/Sekretaerer";, matched 2
word: Sekret@16
word: rer@17
word: hvem@18
word: hva@19
Tag: /a, matched 3
href: http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/Sekretaerer
(Sekret�rer - hvem gj�r hva)

   Rejected: URL not in the limits!
url rejected: (level
1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/Sekretaerer
Tag: br, matched -1
Tag: /li, matched -1
Tag: li, matched 19
Tag: a
href="http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/reiseregning";, matched 2
word: Utfylling@20
word: reiseregning@21
Tag: /a, matched 3
href: http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/reiseregning
(Utfylling av reiseregning)

   Rejected: URL not in the limits!
url rejected: (level
1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/reiseregning
Tag: br, matched -1
Tag: /li, matched -1
Tag: p, matched -1
Tag: hr, matched -1
Tag: a href="..", matched 2
word: Rutiner@22
Tag: /a, matched 3
href: http://www2.spacetec.no/www2/docs/Rutiner/ (Rutiner)

   Rejected: URL not in the limits!
url rejected: (level 1)http://www2.spacetec.no/www2/docs/Rutiner/
Tag: a href="javascript:window.history.go(-1);", matched 2
word: Tilbake@23
Tag: /a, matched 3
href:  (Tilbake)

   Rejected: URL not in the limits!
url rejected: (level 1)
Tag: a href="javascript:window.history.go(1);", matched 2
word: Frem@24
Tag: /a, matched 3
href:  (Frem)

   Rejected: URL not in the limits!
url rejected: (level 1)
Tag: a href="printable_html" target="print", matched 2
word: Utskriftsvennlig@25
word: versjon@26
Tag: /a, matched 3
href:
http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/printable_html
(Utskriftsvennlig versjon)

   Rejected: URL not in the limits!
url rejected: (level
1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/printable_html
Tag: p, matched -1
word: Kommentarer@27
word: til@28
Tag: a href="mailto:drift@;spacetec.no", matched 2
word: drift@29
word: spacetec.no@30
word part: spacetec@30
Tag: /a, matched 3
href:  ([EMAIL PROTECTED])

   Rejected: URL not in the limits!
url rejected: (level 1)
Tag: br, matched -1
word: copy@31
word: Kongsberg@32
word: Spacetec@33
Tag: /p, matched -1
Tag: p, matched -1
Tag: a href="http://www.zope.org/Credits"; target="_top", matched 2
Tag: img src="http://www2.spacetec.no/p_/ZopeButton"; width="115"
height="50" border="0" alt="Powered by Zope" /, matched 18
word: Powered@1
word: Zope@2
image: http://www2.spacetec.no/p_/ZopeButton
Tag: /a, matched 3
href: http://www.zope.org/Credits (Powered by Zope )

   Rejected: URL not in the limits!
url rejected: (level 1)http://www.zope.org/Credits
Tag: /p, matched -1
Tag: /body, matched -1
Tag: /html, matched -1
head:   Rutiner | Tilbake | Frem | Utskriftsvennlig versjon
DOKUMENTASJON - ADMINISTRATIVE RUTINER * Innkj�p * Prosjektrapport med
automatisk oppdatering * Sekret�rer - hvem gj�r hva * Utfylling av
reiseregning Rutiner | Tilbake | Frem | Utskriftsvennlig versjon
Kommentarer til [EMAIL PROTECTED] &copy Kongsberg Spacetec as Powered by
Zope 
 size = 1531
pick: www2.spacetec.no, # servers = 1
> www2.spacetec.no supports HTTP persistent connections (infinite)
ht://dig End Time: Mon Nov  4 08:44:24 2002

-- 
/ hans - http://go.to/tusenfrydveien32
/        http://www.spacetec.no/~hans/dfood.htm      
/---------------------------------------------
/ HANS = High Availability No Superman



-------------------------------------------------------
This SF.net email is sponsored by: ApacheCon, November 18-21 in
Las Vegas (supported by COMDEX), the only Apache event to be
fully supported by the ASF. http://www.apachecon.com
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to