The tutorial is correct, it just uses a different definition of depth
than what you are. :)

The depth is essentially the number of links that must be followed
before reaching a certain page.  For instance:

If you start with http://www.blabla.com/home.html, that page has a
depth of 1.  If that page then contains a link to
http://www.blabla.com/a/b/c/d/e/a.html, that means
http://www.blabla.com/a/b/c/d/e/a.html has a depth of 2.

Remember, you're talking about a web here.  Each page is a node in the
web.  The first node is a depth of 1.  Following its links leads you
to nodes at a depth of 2.  Following the links of those nodes takes
you to nodes of a depth of 3.

On 6/12/07, Manoharam Reddy <[EMAIL PROTECTED]> wrote:
> the tutorial says that depth value is the level of depth of a page
> from the root of a website. so as per the tutorial, if i want to fetch
> a page say, http://www.blabla.com/a/b/c/d/e/a.html, I must set the
> value of depth >= 6.
>
> but I find in the source code that depth is simply a for loop. It will
> run fetch loop as many number of times as mentioned in the depth
> value. so it has no connection with the depth of a page from the root.
>
> please confirm whether my understanding is right. and if so shouldn't
> the tutorial be corrected in order to prevent noobs like me from being
> misled?
>

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to