last links?

James Holderness Sun, 30 Apr 2006 16:39:50 -0700


Mark Nottingham wrote:

Also, if a client doesn't visit for a long time,  it will see
http://journals.aol.com/panzerjohn/abstractioneer/atom.xml?page=2&count=10and assume it already has all of the entries in it, because it's fetchedthat URI before.

Yeah. That's what I was worried about too. The couple of test feeds thatI've subscribed to haven't had any new entries yet so I can't be sure, butwith urls like that I don't see how it can possibly work.

Did you find that algorithm wrong, too hard to understand/implement, ordid you just do a different take on it? Does the approach that you tookend up having the same result?

The problem I had with the algorithm was that it required two passes. Thefirst pass to gather all the links, starting with the current feed documentand moving back in time through the archives; the second pass to actuallyprocess the documents, starting with the oldest and moving forwards in time.Either this required retrieving everything twice, or caching every documentretrieved. Neither of those options sounded particularly appealing to me.

My implementation does everything in one pass. I start by processing thecurrent feed document. If it contains a history link which I haven't seenbefore, I'll retrieve and process that document next. Repeat until there areno more links or I encounter a link that I've seen before. There are subtledifferences in the results that you would get from my algorithm, andtechnically what you're suggesting is more accurate, but I don't think thedifferences are significant enough to care about.

Other than that, I skip steps 1 and 2, and I default to using the "next"link relation (with a fallback to "previous" and "prev"). I may consideradding support for fh:complete at some point, but for now I'm sticking withMicrosoft's cf:treatAs.


Regards
James

Re: Tools that make use of previous/next/first/last links?

Reply via email to