Thanks for everyones help so far from my postings. Here is another question.
I am currently merging my crawls, but am wondering if I can skip a few steps and how to do it. I inject a whole slew of urls into a crawl each time, and then merge it with the crawl previously to that. The urls injected are the same each time. Now, my merged segments directory is starting to get larger and the indexing is starting to get slower. However, I only use the generated Lucene index for my website, not any of the segments, etc. Plus, I restart the crawl each and every time. So, would I be able to give the de duper the two lucene index directories I have, and then use IndexMerger to combine the indexes into a new lucene index, and skip over the merge of the linkdb, crawld, segments ? Thanks, S -- View this message in context: http://www.nabble.com/Fun-question-for-index-merge-tf2861621.html#a7995826 Sent from the Nutch - User mailing list archive at Nabble.com. ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
