Maybe the book of Dimitrios Kouzis-Loukas https://www.packtpub.com/big-data-and-business-intelligence/learning-scrapy will help you. It has a chapter about performance.
On Thursday, March 3, 2016 at 10:13:01 AM UTC+2, Berkant AYDIN wrote: > > Hi everyone, > > I have to do realtime scraping. I try optimization options on > documentation but still slowly. 1600 page crawling only 9 seconds. Yea its > very speedy but still not enough. 860 mb/s AWS machine. How can increase > performance ? I have to use distributed options ? If yes, which one ? It's > a GIL problem ? I have to continue with PyPy ? > > -- You received this message because you are subscribed to the Google Groups "scrapy-users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/scrapy-users. For more options, visit https://groups.google.com/d/optout.
