You can set the allowed_domains attribute of the spider to only include the domains you're scraping...
On Thu, Oct 23, 2014 at 8:54 AM, Tibo <[email protected]> wrote: > Hi everyone !!! > > I'm having some problems : I had to stop using 301 redirects because > scrapy was just scraping everywhere (following Twitter/FB links amongst > other things). > > Is there a way to accept redirects only if they're done within the current > website ? > > Thanks ! > Tibo > > > -- > You received this message because you are subscribed to the Google Groups > "scrapy-users" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at http://groups.google.com/group/scrapy-users. > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "scrapy-users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/scrapy-users. For more options, visit https://groups.google.com/d/optout.
