If I'm not wrong, segments are used by nutch to store parsed data, and after
update the crawldb, and finally build an index.

But when the crawl is finished, for a next recrawl nutch only need the last
crawldb? so not my old segments.
And for building the new index, it only needs my new indexes and the old
index, not the old segs.
(and it seems for the search engine part segment are used just for "show
page cache copy" ?)

It could be nice space saved to delete the segments, but do my argument is
right? 
-- 
View this message in context: 
http://www.nabble.com/When-can-I-delete-segments--%28still-usefull-after-indexing-%29-tf3413479.html#a9511359
Sent from the Nutch - User mailing list archive at Nabble.com.


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to