I've injected some urls into an existing database and I'm pretty sure
I've run all the fetching and re-indexing (commands only shown, no other
arguments)
________________________________________________
#To inject the urls into the db:
bin/nutch inject

#To perform the crawl:
bin/nutch generate
bin/nutch fetch
bin/nutch updatedb
bin/nutch analyze
#(3 times to get a bit of depth)

#Add the documents to the db
bin/nutch index
#(3 times - once for each segment generated)

#Clean up the duplicated
bin/nutch dedup
------------------------------------------------


#I can verify that a page has been indexed, has "outlinks" and a "score"
so it must have been crawled ok using:
bin/nutch readdb "myindex/db" -pageurl "http://www.domainname.com/";

These injected pages do not show up using the nutch/tomcat front-end
even after a restart (neither tomcat or server restart). Am I missing
out a step somewhere, I've tried reading the tutorial again but havn't
managed to find my mistake. Any ideas guys?


Thanks for your time,
Tom Davidson.


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_ide95&alloc_id396&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to