Hi I have a webserver on port 8080 (Zope) which htdig will not search. Another server listes on port 80, and htdig tries to get relative links from this one.
I have tried to search the mail archives, but I cannot find a solution. Any ideas? >From htdig.conf: limit_urls_to: http://www2.spacetec.no:8080 start_url: http://www2.spacetec.no:8080/www2/docs/Rutiner/Adm-rutiner/ This is the output from "htdig -i -vvvv": ht://dig Start Time: Mon Nov 4 08:44:23 2002 1:1:http://www2.spacetec.no:8080/www2/docs/Rutiner/Adm-rutiner/ New server: www2.spacetec.no, 8080 - Persistent connections: enabled - HEAD before GET: disabled - Timeout: 30 - Connection space: 0 - Max Documents: -1 - TCP retries: 1 - TCP wait time: 5 Trying to retrieve robots.txt file Creating an HtHTTPBasic object Making HTTP request on http://www2.spacetec.no:8080/robots.txt Try to get through to host www2.spacetec.no (port 8080) 1 - Open of the connection ok Assigned the remote host www2.spacetec.no Assigned the port 8080 Header line: HTTP/1.1 404 Not Found Header line: Server: Zope/(Zope 2.5.0 (binary release, python 2.1, linux2-x86), python 2.1.2, linux2) ZServer/1.1b1 Header line: Date: Mon, 04 Nov 2002 08:43:51 GMT Header line: Bobo-Exception-File: /usr/local/zope/2-5-0/lib/python/ZPublisher/HTTPResponse.py Discarded header line: Bobo-Exception-File: /usr/local/zope/2-5-0/lib/python/ZPublisher/HTTPResponse.py Header line: Content-Type: text/html Header line: Bobo-Exception-Type: NotFound Discarded header line: Bobo-Exception-Type: NotFound Header line: Bobo-Exception-Value: bobo exception Discarded header line: Bobo-Exception-Value: bobo exception Header line: Etag: Discarded header line: Etag: Header line: Content-Length: 1849 Header line: Bobo-Exception-Line: 470 Discarded header line: Bobo-Exception-Line: 470 No modification time returned: assuming now Retrieving document /robots.txt on host: www2.spacetec.no:8080 Http version : HTTP/1.1 Server : HTTP/1.1 Status Code : 404 Reason : Not Found Access Time : Mon, 04 Nov 2002 08:43:51 GMT Modification Time : Mon, 04 Nov 2002 07:44:24 GMT Content-type : text/html Persistent connection: would be accepted Body not retrieved Connection stays up ... (Persistent connection) Request time: 1 secs pushed pick: www2.spacetec.no, # servers = 1 > www2.spacetec.no supports HTTP persistent connections (infinite) 0:2:0:http://www2.spacetec.no:8080/www2/docs/Rutiner/Adm-rutiner/: Creating an HtHTTPBasic object Making HTTP request on http://www2.spacetec.no:8080/www2/docs/Rutiner/Adm-rutiner/ Try to get through to host www2.spacetec.no (port 8080) 2 - Open of the connection ok Assigned the remote host www2.spacetec.no Assigned the port 8080 Header line: HTTP/1.1 200 OK Header line: Server: Zope/(Zope 2.5.0 (binary release, python 2.1, linux2-x86), python 2.1.2, linux2) ZServer/1.1b1 Header line: Date: Mon, 04 Nov 2002 08:43:51 GMT Header line: Content-Type: text/html Header line: Etag: Discarded header line: Etag: Header line: Content-Length: 1531 No modification time returned: assuming now Retrieving document /www2/docs/Rutiner/Adm-rutiner/ on host: www2.spacetec.no:8080 Http version : HTTP/1.1 Server : HTTP/1.1 Status Code : 200 Reason : OK Access Time : Mon, 04 Nov 2002 08:43:51 GMT Modification Time : Mon, 04 Nov 2002 07:44:24 GMT Content-type : text/html Persistent connection: would be accepted Reading the body of the response Connection stays up ... (Persistent connection) Request time: 0 secs Tag: html, matched -1 Tag: head, matched -1 Tag: base href="http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/" /, matched 23 Tag: title, matched 0 word: Administrative@1 word: rutiner@2 Tag: /title, matched 1 title: Administrative rutiner Tag: /head, matched -1 Tag: body BGCOLOR="#FFFFFF", matched -1 Tag: a href="..", matched 2 word: Rutiner@3 Tag: /a, matched 3 href: http://www2.spacetec.no/www2/docs/Rutiner/ (Rutiner) Rejected: URL not in the limits! url rejected: (level 1)http://www2.spacetec.no/www2/docs/Rutiner/ Tag: a href="javascript:window.history.go(-1);", matched 2 word: Tilbake@4 Tag: /a, matched 3 href: (Tilbake) Rejected: URL not in the limits! url rejected: (level 1) Tag: a href="javascript:window.history.go(1);", matched 2 word: Frem@5 Tag: /a, matched 3 href: (Frem) Rejected: URL not in the limits! url rejected: (level 1) Tag: a href="printable_html" target="print", matched 2 word: Utskriftsvennlig@6 word: versjon@7 Tag: /a, matched 3 href: http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/printable_html (Utskriftsvennlig versjon) Rejected: URL not in the limits! url rejected: (level 1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/printable_html Tag: p, matched -1 Tag: h1, matched 4 word: DOKUMENTASJON@8 word: ADMINISTRATIVE@9 word: RUTINER@10 Tag: /h1, matched 10 Tag: li, matched 19 Tag: a href="http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/innkjop", matched 2 word: Innkj@11 Tag: /a, matched 3 href: http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/innkjop (Innkj�p) Rejected: URL not in the limits! url rejected: (level 1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/innkjop Tag: br, matched -1 Tag: /li, matched -1 Tag: li, matched 19 Tag: a href="http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/aut_prosjektrapp", matched 2 word: Prosjektrapport@12 word: med@13 word: automatisk@14 word: oppdatering@15 Tag: /a, matched 3 href: http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/aut_prosjektrapp (Prosjektrapport med automatisk oppdatering) Rejected: URL not in the limits! url rejected: (level 1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/aut_prosjektrapp Tag: br, matched -1 Tag: /li, matched -1 Tag: li, matched 19 Tag: a href="http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/Sekretaerer", matched 2 word: Sekret@16 word: rer@17 word: hvem@18 word: hva@19 Tag: /a, matched 3 href: http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/Sekretaerer (Sekret�rer - hvem gj�r hva) Rejected: URL not in the limits! url rejected: (level 1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/Sekretaerer Tag: br, matched -1 Tag: /li, matched -1 Tag: li, matched 19 Tag: a href="http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/reiseregning", matched 2 word: Utfylling@20 word: reiseregning@21 Tag: /a, matched 3 href: http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/reiseregning (Utfylling av reiseregning) Rejected: URL not in the limits! url rejected: (level 1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/reiseregning Tag: br, matched -1 Tag: /li, matched -1 Tag: p, matched -1 Tag: hr, matched -1 Tag: a href="..", matched 2 word: Rutiner@22 Tag: /a, matched 3 href: http://www2.spacetec.no/www2/docs/Rutiner/ (Rutiner) Rejected: URL not in the limits! url rejected: (level 1)http://www2.spacetec.no/www2/docs/Rutiner/ Tag: a href="javascript:window.history.go(-1);", matched 2 word: Tilbake@23 Tag: /a, matched 3 href: (Tilbake) Rejected: URL not in the limits! url rejected: (level 1) Tag: a href="javascript:window.history.go(1);", matched 2 word: Frem@24 Tag: /a, matched 3 href: (Frem) Rejected: URL not in the limits! url rejected: (level 1) Tag: a href="printable_html" target="print", matched 2 word: Utskriftsvennlig@25 word: versjon@26 Tag: /a, matched 3 href: http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/printable_html (Utskriftsvennlig versjon) Rejected: URL not in the limits! url rejected: (level 1)http://www2.spacetec.no/www2/docs/Rutiner/Adm-rutiner/printable_html Tag: p, matched -1 word: Kommentarer@27 word: til@28 Tag: a href="mailto:drift@;spacetec.no", matched 2 word: drift@29 word: spacetec.no@30 word part: spacetec@30 Tag: /a, matched 3 href: ([EMAIL PROTECTED]) Rejected: URL not in the limits! url rejected: (level 1) Tag: br, matched -1 word: copy@31 word: Kongsberg@32 word: Spacetec@33 Tag: /p, matched -1 Tag: p, matched -1 Tag: a href="http://www.zope.org/Credits" target="_top", matched 2 Tag: img src="http://www2.spacetec.no/p_/ZopeButton" width="115" height="50" border="0" alt="Powered by Zope" /, matched 18 word: Powered@1 word: Zope@2 image: http://www2.spacetec.no/p_/ZopeButton Tag: /a, matched 3 href: http://www.zope.org/Credits (Powered by Zope ) Rejected: URL not in the limits! url rejected: (level 1)http://www.zope.org/Credits Tag: /p, matched -1 Tag: /body, matched -1 Tag: /html, matched -1 head: Rutiner | Tilbake | Frem | Utskriftsvennlig versjon DOKUMENTASJON - ADMINISTRATIVE RUTINER * Innkj�p * Prosjektrapport med automatisk oppdatering * Sekret�rer - hvem gj�r hva * Utfylling av reiseregning Rutiner | Tilbake | Frem | Utskriftsvennlig versjon Kommentarer til [EMAIL PROTECTED] © Kongsberg Spacetec as Powered by Zope size = 1531 pick: www2.spacetec.no, # servers = 1 > www2.spacetec.no supports HTTP persistent connections (infinite) ht://dig End Time: Mon Nov 4 08:44:24 2002 -- / hans - http://go.to/tusenfrydveien32 / http://www.spacetec.no/~hans/dfood.htm /--------------------------------------------- / HANS = High Availability No Superman ------------------------------------------------------- This SF.net email is sponsored by: ApacheCon, November 18-21 in Las Vegas (supported by COMDEX), the only Apache event to be fully supported by the ASF. http://www.apachecon.com _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

