Wget would be my preferred approach, but, the files are beyond an
authentication paywall which seems to use cookies. I tried passing the
cookies to wget, but for some reason it fails to fetch the file.

Conkeror stores cookies in a sqlite database, which has to be
converted to another format for wget; I used some scripts available on
the web but I can't really check if the conversion is done properly.

Wget might also be confused due to the fact that the links aren't
direct, they're like "getFile.aspx?file=document.pdf", which I'm
guessing is a scripted redirect that could be throwing wget off.

— Andre


On Sunday, July 10, 2011 at 10:02 PM, Jörg Sommer wrote:

> Hi Andre,
>
> Andre Braga Reis <[email protected] (mailto:[email protected])> 
> wrote:
> > I'm trying to automate Conkeror with mozrepl to go to a website, dump
> > the source, parse all links to PDFs, and then download them (through
> > Conkeror too).
>
> Do you have wget or something similar?
>
> % wget -rl 1 -A \*.pdf $URL
>
> Regards, Jörg.
> --
> And 1.1.81 is officially BugFree(tm), so if you receive any bug-reports
> on it, you know they are just evil lies.
>  (Linus Torvalds)
> _______________________________________________
> Conkeror mailing list
> [email protected] (mailto:[email protected])
> https://www.mozdev.org/mailman/listinfo/conkeror
_______________________________________________
Conkeror mailing list
[email protected]
https://www.mozdev.org/mailman/listinfo/conkeror

Reply via email to