Jérôme,

> which Nutch version do you use?

Kind of gave up on mapred for awhile, so I am using
trunk.

> There were a bug concerning the content-types with
> parameters such as
> "text/html; charset=iso-8859-1".

Yeah, when I telnet in to GET / shopthar.com, I get

Content-Type: text/html; charset=iso-8859-1

> This issue is fixed in trunk and mapred.

Hmm, well, I was seeing something earlier in trunk. 
That said, something happened and I now seem to get a
partial crawl started.  How very strange.  I did catch
a few updates today, but the commits sure didn't seem
related.

Now I crawl for awhile, and then it just stops.  I
still get new segments starting, but no new http hits
to the server.  So looks like I have something new to
track down.  But yeah, when it is going, it can hammer
pretty good.

Earl


                
__________________________________ 
Yahoo! FareChase: Search multiple travel sites in one click.
http://farechase.yahoo.com


-------------------------------------------------------
This SF.Net email is sponsored by:
Power Architecture Resource Center: Free content, downloads, discussions,
and more. http://solutions.newsforge.com/ibmarch.tmpl
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to