Works fine for me using old libwww-perl-0.40.
Content-location is not a redirect, so don't try to follow it. It merely tells
you where the content was found, since you didn't specify a filename on the
URL.
GET http://www.svenskakyrkan.se/stift/harnosand/ HTTP/1.0
Host: www.svenskakyrkan.se
Pragma: no-cache
User-Agent: Mozilla/4.03 [en] (MOMspider)
HTTP/1.1 200 OK
Server: Microsoft-IIS/4.0
Content-location: http://www.svenskakyrkan.se/stift/harnosand/Index.htm
Set-cookie: SITESERVER=ID=77f490b5a0d0451693378d1a9e791d8f; expires=Monday,
01-Jan-2035 00:00:00 GMT; path=/; domain=.svenskakyrkan.se
Date: Wed, 12 Apr 2000 13:58:33 GMT
Content-type: text/html
Accept-ranges: bytes
Last-modified: Thu, 06 Apr 2000 10:58:35 GMT
Etag: "104754eb79fbf1:11695"
Content-length: 828
<html>
<head>
<meta HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=windows-1252">
<meta name="GENERATOR" content="Microsoft FrontPage 4.0">
<meta name="ProgId" content="FrontPage.Editor.Document">
<title>H�rn�sands stift p� Internet</title>
<link rel="stylesheet" type="text/css" href="mall.css">
</head>
<frameset framespacing="0" border="0" rows="114,*" frameborder="0">
<frame name="banderoll" scrolling="no" noresize target="innehall"
src="top.htm">
<frameset cols="150,*">
<frame name="innehall" target="huvud" src="vanster1.htm" scrolling="no">
<frame name="huvud" src="startsida.htm" target="_self" scrolling="auto">
</frameset>
<noframes>
<body>
<p>P� den h�r sidan anv�nds ramar som inte st�ds av din webbl�sare.</p>
</body>
</noframes>
</frameset>
</html>
From: jlpoutre%corp.nl.home.com@Internet on 2000-04-12 06:08 AM
To: Mattias.Borell%lub.lu.se@Internet
cc: libwww%perl.org@Internet (bcc: Marvin Simkin)
Subject: Re: Annoying URL...
>Hi all.
>
>Within a local perl-based project we needed some link-checking to be done,
>and
>naturally we used LWP for the job, and it works in it usual charming way.
>However, we've stumbled across a specific URL that defies fetching with our
>code, or HEAD/GET as supplied with LWP.
>
>When you try to reach it with Netscape or *shudder* HotJava, it loads as it
>should. Lynx just freezes after loading a little data...
>
>So, could anyone shed any light on what's going on at a protocol (HTTP) level
>here? I can't seem to debug this properly...
>
> URL: http://www.svenskakyrkan.se/stift/harnosand/
>
>And yes, it's a IIS-server, we've found that much out. :->
>
Weird, the first try with HEAD got me a "500 read timeout" error, but at
the second try this response:
bash-2.01$ HEAD http://www.svenskakyrkan.se/stift/harnosand/
200 OK
Date: Wed, 12 Apr 2000 13:01:51 GMT
Accept-Ranges: bytes
Server: Microsoft-IIS/4.0
Content-Length: 828
Content-Location: http://www.svenskakyrkan.se/stift/harnosand/Index.htm
Content-Type: text/html
ETag: "104754eb79fbf1:11695"
Last-Modified: Thu, 06 Apr 2000 10:58:35 GMT
Client-Date: Wed, 12 Apr 2000 13:02:51 GMT
Client-Peer: 195.17.98.104:80
Seems you need to follow the "Content-Location:" url; typycally NT to have
that capitalized Index.htm file...
Hope this helps!
Regards,
Johannes la Poutre
Content Software Engineer
--
@Home Benelux BV
Gyroscoopweg 90-92
1042 AX Amsterdam
The Netherlands
Tel. +31(0)20 88 555 68
Fax. +31(0)20 88 555 22
Mobile: +31(0)6 218 555 03
http://www.home.nl