I'm wondering if it would be useful for the parser to give more detail of
why it couldn't get a particular page.  Right now if I tap on a link that
should go to a pdf file for example, I will get a message saying the link
was not downloaded and it lists a few reasons why.  Would it be better to
display a small document created by the parser that says the link requested
contained a document type not supported by the parser and give the
application/pdf content type?

Maybe the same could be done for 404 errors, timeouts, etc?  I would leave
links that have been maxdepth'd out or excluded due to stayonhost alone
(i.e. keep the current behaviour).

Pros:
   would give the user more info about the link they are trying to pluck so
perhaps they could fix it?

Cons:
  slightly (?) larger database size with accompanied longer sync time
  might get requests to handle certain types of documents (i.e. pdf
conversion)

Perhaps most users look at the pages first to see what will get plucked?

Comments?

Bill



_______________________________________________
plucker-dev mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-dev

Reply via email to