when you run the "nutch index" and give it the list of segments it will in
one single index.
segments are different chunks of your crawldb.

I guess what is less clear to me, is once the expiry date has gone.
url's will be recrawled and be duplicated into different segments, not sure
how it is taken care of.



2009/7/17 Saurabh Suman <[email protected]>

>
> As i observed , Nutch makes new folder with the current timestamp in the
> segments  directory for each depths.Does new folder under segments
> directory
> made while crawling for depth2  contains all url and parsedText of previous
> depth or it just overwrite previous? If i will search for a query string  ,
> it will search from depth1 or depth2?
> --
> View this message in context:
> http://www.nabble.com/How-segment-depends-on-depth-tp24532471p24532471.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>


-- 
-MilleBii-

Reply via email to