I am trying to modify Nutch to add level to the website parse data. What I mean by this is suppose you start parsing a website at its homepage that would be level one. Any links in the same site from the homepage would be level two, links from those pages would be level three and so on. I am only counting links in the same site.

How would I go about modifying Nutch to handle this? I was thinking that I would have to modify Fetcher to do this, adding the level to the parse metadata. What I am not gettings is how would I get the link level initially? I was thinking I would have to modify something in the generator but didn't know what.

Dennis


-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to