I'm wondering if it would be useful for the parser to give more detail of why it couldn't get a particular page. Right now if I tap on a link that should go to a pdf file for example, I will get a message saying the link was not downloaded and it lists a few reasons why. Would it be better to display a small document created by the parser that says the link requested contained a document type not supported by the parser and give the application/pdf content type?
Maybe the same could be done for 404 errors, timeouts, etc? I would leave links that have been maxdepth'd out or excluded due to stayonhost alone (i.e. keep the current behaviour). Pros: would give the user more info about the link they are trying to pluck so perhaps they could fix it? Cons: slightly (?) larger database size with accompanied longer sync time might get requests to handle certain types of documents (i.e. pdf conversion) Perhaps most users look at the pages first to see what will get plucked? Comments? Bill _______________________________________________ plucker-dev mailing list [EMAIL PROTECTED] http://lists.rubberchicken.org/mailman/listinfo/plucker-dev