Re: Switched to Tomcat

2010-11-14 Thread Matthias Paul
Perhaps you are sending a field to Solr which you didn't configure in Solr? Then the SolrIndexer will break. Set the loggers to debug and have a look at your log-files. Matthias On Sun, Nov 14, 2010 at 7:35 AM, Eric Martin e...@makethembite.com wrote: Hello, I switched from Jetty to

RE: Switched to Tomcat

2010-11-14 Thread Eric Martin
Thanks, I went back and took a look and restarted the server. Apparently I have more issues than I though and looks like it is outside the scope of the nutch group. IT appears that my Solr installation is not configured correctly for Tomcat. Since this isn't a Solr mailing list, if you want to

Re: Switched to Tomcat

2010-11-14 Thread Robert Douglass
Is this related? http://mostrey.be/configuring-tomcat-provide-utf-8-support-solr (UTF 8 isn't guaranteed by Tomcat - you need to make it explicit) On 11/14/2010 09:51 AM, Eric Martin wrote: Thanks, I went back and took a look and restarted the server. Apparently I have more issues than I

Re: can nutch's crawler run parallel?

2010-11-14 Thread Dennis Kubes
Yes, using a hadoop cluster and the different fetcher settings for number of fetcher threads you run as many crawlers as you want in parallel. Nutch's crawler does obey robots.txt and is polite in that all pages from a given domain are fetched on a single machine. Dennis On 11/12/2010 05:34

Re: can nutch's crawler run parallel?

2010-11-14 Thread Dennis Kubes
Yes, using hadoop you can run multiple nutch jobs in parallel as long as the output directories don't conflict. For example you can run multiple updatedb jobs on different crawldbs at the same time but not on the same crawldb going to the same output directory. Dennis On 11/13/2010 10:20

how to cut indexes for distributed searching?

2010-11-14 Thread 朱诗雄
hi,all I want to use nutch for distributed searching. But I don't know how to cut indexes for distributed searching? Is there a guide for that? -- Thanks and best regards. zsx