Michael Wechner wrote:

Stefan Groschupf wrote:

have you collected these offers somewhere?


Check the source-forge mail archive.



thanks, will do.

btw, is there an interface within Nutch, where a CMS (e.g. Apache Lenya) can notify Nutch about content changes (or deletion of pages or renaming of URLs)?


I'm new to Nutch and haven't tried this yet, but I've had the same kind of question.

I think the answer is: IWebDBWriter.addPageIfNotPresent()
or maybe just addPage() in your case, for updates.

http://nutch.sourceforge.net/docs/api/net/nutch/db/IWebDBWriter.html#addPageIfNotPresent(net.nutch.db.Page)

See WebDBInjector for sample usage, esp as now you have to create a Page to get this call to work right.




I guess this would make crawling "obsolete" to a certain point (at least for pages
being created by content management systems).


Thanks

Michi








------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to