On 9/14/06, Tomi NA <[EMAIL PROTECTED]> wrote:
> On 9/5/06, Zaheed Haque <[EMAIL PROTECTED]> wrote:
> > Hi:
>
> I have a problem or two with the described procedure...
>
> > Assuming you have
> >
> > index 1 at /data/crawl1
> > index 2 at /data/crawl2
>
> Used ./bin/nutch crawl urls -dir /home/myhome/crawls/mycrawldir to
> generate an index: luke says the index is valid and I can query it
> using luke's interface.
>
> Does the "searcher.dir" value in nutch-(default|site).xml have any
> impact on the way indexes are created?

No it doesn't have any impact on index creation. searcher.dir value is
for searching only. nutch-site.xml is where you should change..
example...

<property>
  <name>searcher.dir</name>
  <value> /home/myhome/crawls</value>
  <description>
  Path to root of index directories.  This directory is searched (in
  order) for either the file search-servers.txt, containing a list of
  distributed search servers, or the directory "index" containing
  merged indexes, or the directory "segments" containing segment
  indexes.
  </description>
</property>

and the text file should be in this case ...

 /home/myhome/crawls/search-servers.txt


> > In nutch-site.xml
> > searcher.dir = /data
>
> This is the nutch-site.xml of the web UI?

Both. I mean tomcat/webapps/ROOT/WEB-INF/classes/nutch-site.xml as
well as NUTCH HOME/conf/nutch-site.xml.

Web application needs to know where the search-servers.txt file is if
you plan to use tomcat to search.

> > Under /data you have a text file called search-server.txt (I think do
> > check nutch-site search.dir description please)
>
> /home/myhome/crawls/search-servers.txt
>
> > In the text file you will have the following
> >
> > hostname1 portnumber
> > hostname2 portnumber
> >
> > example
> > localhost 1234
> > localhost 5678
>
> I placed
> localhost 12567
> (just one instance, to test)
>
> > Then you need to start
> >
> > bin/nutch server 1234 /data/craw1 &
> >
> > and
> >
> > bin/nutch server 5678 /data/crawl2 &
>
> did that, using port 12567
> ./bin/nutch server 12567 /home/mydir/crawls/mycrawldir &
>
> > bin/nutch org.apache.nutch.search.NutchBean www
> >
> > you should see results :-)
>
> I get:
> ------------
> Exception in thread "main" java.lang.NoClassDefFoundError:
> org/apache/nutch/search/NutchBean
> ------------
>
> Whats more, I get no results to any query I care to pass by the Web
> UI, which suggests the UI isn't connected to the underlying
> DistributedSearch server. :\
>
> Any hints, anyone?
>
> TIA,
> t.n.a.
>

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to