Hi Nitin > IIRC, the tutorial requires you to start the tomcat instance so it knows > where your index is. > Are you starting tomcat from the directory that has your index (the > suggested way in the tutorial) ? > Or are you indicating to the search servlet the location of your index > in some other way?
The problem with this is that Plesk have configured their software to by default disable Tomcat support until you upgrade to a more expensive license with SWsoft. Once you upgrade the license key then Tomcat magically appears in the Plesk control panel and you can then setup applications directly through Plesk. The problem with this is that I've tried to not use Plesk when configuring nutch but there are inherent problems. For example catalina.sh does not exist on a Plesk server. They have either renamed it or removed it and the only way to startup, restart or stop Tomcat is to do so via the Plesk control panel. > http://today.java.net/pub/a/today/2006/02/16/introduction-to-nutch-2.html > shows how to set the index dir in nutch-site.xml So either you need to > do this or start tomcat from the index dir. I tried to do this afternoon after I was unable to start tomcat from the index directory. I figured this would work as it's forcing tomcat to pull data from the directory I'm specifying and in my case it is, "/usr/local/nutch/crawl/" This contains my indexes, linkdb, segments, etc folders. The problem I believe all has to do with stupid Plesk. For example most of the tutorials reference the following: ~/tomcat/webapps/ROOT but in Plesk and the way they structure it the similar path would actually be: /usr/share/tomcat5/psa-wars/domain.com/ and not as all the tutorials reference. My problem with the tutorial is simply that because of this re-structure that plesk has done there is no WEB-INF/classes/ folder for me to store this xml file. I've gone through all the structures of tomcat5 on the server and if I were to put the nutch-site.xml file anywhere I would guess the best place would be /usr/share/tomcat5/psa-wars/domain.com/ as the nutch-0.8.1.war file is located in this directory. Not an ideal situation this.... Regards Justin On 12/29/06, Nitin Borwankar <[EMAIL PROTECTED]> wrote: > Nitin Borwankar wrote: > > > Justin Hartman wrote: > > > >> Hi guys > >> > >> I have my nutch system working pretty reasonably I think and I am > >> quite happy with the way it is fetching, crawling and indexing. I do > >> have a problem however in that I can not figure out how to make the > >> http searches pull data from the index. > > > > > > > > [....] > > > > Hi Justin, > > > > IIRC, the tutorial requires you to start the tomcat instance so it > > knows where your index is. > > Are you starting tomcat from the directory that has your index (the > > suggested way in the tutorial) ? > > Or are you indicating to the search servlet the location of your index > > in some other way? > > > > Nitin > > > > |<?xml version="1.0"?> > <?xml-stylesheet type="text/xsl" href="nutch-conf.xsl"?> > > <!-- Put site-specific property overrides in this file. --> > > <nutch-conf> > <property> > <name>searcher.dir</name> > <value>/Users/tom/Applications/nutch-0.7.1/crawl-tinysite</value> > </property> > </nutch-conf>| > > > > > -- > Nitin Borwankar > Find, Learn, Act .... > Greener, the search engine for the planet > http://greener.com > [EMAIL PROTECTED] > 510-872-7066 > > -- Regards Justin Hartman PGP Key ID: 102CC123 ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
