craw failing

2008-12-31 Thread blackwater dev
I just grabbed nutch and did some crawls and noticed when I did a search, I was getting an error so looked into the log and see this: Indexer: starting Indexer: linkdb: crawl.test/linkdb Indexer: adding segment: crawl.test/segments/20081224211810 Indexer: adding segment: crawl.test/segments/200812

Removal of deleted pages from the index

2008-12-31 Thread Rinesh1
HI , Please give your suggestions for the following problem. I have a web appwith 3 pages index.jsp , p1.jsp and p2.jsp. links for p1.jsp and p2.jsp are give in the index.jsp page. I have crawled the site and have started the nutch web applciation. I am able to search conten

Recrawling updated pages

2008-12-31 Thread Rinesh1
HI , I was trying to test a scenario in nutch. Scenario - I have a page P1 which has content C1. I have indexed it using bin/nutch .. I have redeployed nutch and on searching I am able to search C1. Now in the same page P1 I ha