Perhaps you are sending a field to Solr which you didn't configure in Solr?
Then the SolrIndexer will break.
Set the loggers to debug and have a look at your log-files.
Matthias
On Sun, Nov 14, 2010 at 7:35 AM, Eric Martin e...@makethembite.com wrote:
Hello,
I switched from Jetty to
Thanks, I went back and took a look and restarted the server. Apparently I
have more issues than I though and looks like it is outside the scope of the
nutch group. IT appears that my Solr installation is not configured
correctly for Tomcat. Since this isn't a Solr mailing list, if you want to
Is this related?
http://mostrey.be/configuring-tomcat-provide-utf-8-support-solr
(UTF 8 isn't guaranteed by Tomcat - you need to make it explicit)
On 11/14/2010 09:51 AM, Eric Martin wrote:
Thanks, I went back and took a look and restarted the server. Apparently I
have more issues than I
Yes, using a hadoop cluster and the different fetcher settings for
number of fetcher threads you run as many crawlers as you want in parallel.
Nutch's crawler does obey robots.txt and is polite in that all pages
from a given domain are fetched on a single machine.
Dennis
On 11/12/2010 05:34
Yes, using hadoop you can run multiple nutch jobs in parallel as long as
the output directories don't conflict. For example you can run multiple
updatedb jobs on different crawldbs at the same time but not on the same
crawldb going to the same output directory.
Dennis
On 11/13/2010 10:20
hi,all
I want to use nutch for distributed searching. But I don't know how to
cut indexes for distributed searching?
Is there a guide for that?
--
Thanks and best regards.
zsx
6 matches
Mail list logo