Delaying all media downloads until the very end

Antoine Brunel Fri, 22 Apr 2016 09:22:26 -0700

Hello (awesome) scrapy community,

According to scrapy media pipeline docs 
(http://doc.scrapy.org/en/latest/topics/media-pipeline.html), after one url 
is scraped, all of its media files are downloaded, with a higher priority 
so that no other url is scraped before all media files were downloaded.


> - When the item reaches the FilesPipeline, the URLs in the file_urls 
> field are scheduled for download using the standard Scrapy scheduler and 
> downloader (which means the scheduler and downloader middlewares are 
> reused), but with a higher priority, processing them before other pages are 
> scraped. The item remains “locked” at that particular pipeline stage until 
> the files have finish downloading (or fail for some reason).


I just want to do the exact opposite: Scrape all urls first, then, download 
all media files at once. 
How could I do that?

Thanks!

-- 
You received this message because you are subscribed to the Google Groups 
"scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Delaying all media downloads until the very end

Reply via email to