Just to make this more clear: I want the url of the page the anchor is
on, not the url of the anchor (which, of course, is the page I already
have).
Thanks.
-lucas
On May 16, 2005, at 7:46 PM, Lucas Rockwell wrote:
Hi all,
I am fairly new to nutch (but I have been wading through the code,
docs and mailing lists) and I am wondering if there is a way to get
the url of an anchor as well as the text of an anchor? I have a
feeling there is, but I have not pulled things apart enough to really
know for sure.
Any help would be much appreciated.
Thanks.
-lucas
p.s. nutch is a first-rate piece of software. Thanks to all who have
labored over this amazing tool!