I've injected some urls into an existing database and I'm pretty sure I've run all the fetching and re-indexing (commands only shown, no other arguments) ________________________________________________ #To inject the urls into the db: bin/nutch inject
#To perform the crawl: bin/nutch generate bin/nutch fetch bin/nutch updatedb bin/nutch analyze #(3 times to get a bit of depth) #Add the documents to the db bin/nutch index #(3 times - once for each segment generated) #Clean up the duplicated bin/nutch dedup ------------------------------------------------ #I can verify that a page has been indexed, has "outlinks" and a "score" so it must have been crawled ok using: bin/nutch readdb "myindex/db" -pageurl "http://www.domainname.com/" These injected pages do not show up using the nutch/tomcat front-end even after a restart (neither tomcat or server restart). Am I missing out a step somewhere, I've tried reading the tutorial again but havn't managed to find my mistake. Any ideas guys? Thanks for your time, Tom Davidson. ------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_ide95&alloc_id396&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
