Are you adding this to nutch-site within your webapp or just in your root Nutch installation. This needs to be included in your webapp version of nutch-site.xml. In my experience this was a small case of confusion at first.
On Fri, Jul 15, 2011 at 7:03 PM, Chip Calhoun <ccalh...@aip.org> wrote: > You've gotten me very close to a breakthrough. I've started over, and I've > found that If I don't make any edits to nutch-site.xml, I get a working > Nutch web app; I have no index and all of my searches fail, but I have > Nutch. When I add my crawl location to nutch-site.xml and restart Tomcat, > that's when I start getting the 404 with the "The requested resource () is > not available" message. > Clearly I'm doing something wrong when I edit nutch-site.xml. I'm going to > paste the entire contents of my nutch-site.xml. Where am I screwing this > up? > > Thanks for your help on this. > > <?xml version="1.0"?> > <configuration> > <property> > <name>http.agent.name</name> > <value>nutch-solr-integration</value> > </property> > <property> > <name>generate.max.per.host</name> > <value>100</value> > </property> > <property> > <name>plugin.includes</name> > > <value>protocol-http|urlfilter-regex|parse-html|index-(basic|anchor)|query-(basic|site|url)|response-(json|xml)|summary-basic|scoring-opic|urlnormalizer-(pass|regex|basic)</value> > </property> > <property> > <name>searcher.dir</name> > <value>C:/Apache/apache-nutch-1.2/crawl<value> > </property> > </configuration> > > > -----Original Message----- > From: lewis john mcgibbney [mailto:lewis.mcgibb...@gmail.com] > Sent: Thursday, July 14, 2011 5:38 PM > To: user@nutch.apache.org > Subject: Re: Deploying the web application in Nutch 1.2 > > On Thu, Jul 14, 2011 at 8:01 PM, Chip Calhoun <ccalh...@aip.org> wrote: > > > Thanks Lewis. > > > > I'm still having trouble. I've moved the war file to > > $CATALINA_HOME/webapps/nutch/ and unpacked it. I don't' seem to have > > a "catalina.sh" file, so I've skipped that step. > > > From memory the catalina.sh file is used to start you Tomcat server > instance... this has nothing to do with Nutch. Regardless of what lind of > WAR files you have in your Tomcat webapps directory, starting your tomat > server from the command line sould be the same... > > And I've added the following to > > C:\Apache\Tomcat-5.5\webapps\nutch\WEB-INF\classes\nutch-site.xml : > > > > As far as a I can remember nutch-site.xml is already there, however you > need to specify various property values after this has been uploaded the > first time. After rebooting Tomcat all of your property setting will be > running. > > > > > > <property> > > <name>searcher.dir</name> > > <value>C:\Apache\apache-nutch-1.2\crawl<value> <!-- There must be a > > crawl/index directory to run off !--> </property> > > > > Looks fine, however please remove the <!...> as this is not required. > > > > > However, when I go to http://localhost:8080/nutch/ I always get a 404 > with > > the message, "The requested resource () is not available." What am I > > missing? > > > > As I said the name of the WAR file needs to be identical to the webapp you > specify in the tomcat URL... can you confirm this. There should really be > no > problem starting up the Nutch web app if you follow the tutorial carfeully. > > > > Thanks, > > Chip > > > > -----Original Message----- > > From: lewis john mcgibbney [mailto:lewis.mcgibb...@gmail.com] > > Sent: Thursday, July 14, 2011 5:40 AM > > To: user@nutch.apache.org > > Subject: Re: Deploying the web application in Nutch 1.2 > > > > Hi Chip, > > > > Please see this tutorial for 1.2 administration [1], many people have > been > > using it recently and as far as I'm aware it is working perfectly. > > > > Please post back if you have any troubles > > > > [1] http://wiki.apache.org/nutch/NutchTutorial > > > > > > > > On Wed, Jul 13, 2011 at 5:50 PM, Chip Calhoun <ccalh...@aip.org> wrote: > > > > > I'm a newbie trying to set up a Nutch 1.2 web app, because it seems a > > > bit better suited to my smallish site than the Nutch 1.3 / Solr > > > connection. I'm going through the tutorial at > > > http://wiki.apache.org/nutch/Nutch_-_The_Java_Search_Engine , and I've > > > hit the following instruction: > > > > > > Deploy the Nutch web application as the ROOT context > > > > > > I'm not sure what I'm meant to do here. I get the idea that I'm > > > supposed to replace the current contents of > > > $CATALINA_HOME/webapps/ROOT/ with something from my Nutch directory, > but > > I don't know what from my Nutch > > > directory I'm supposed to move. Can someone please explain what I > need > > to > > > move? > > > > > > Thanks, > > > Chip > > > > > > > > > > > -- > > *Lewis* > > > > > > -- > *Lewis* > -- *Lewis*