Check your headers. Sometimes too many or too few will cause 403 errors. See which headers Chrome is using and replicate.
-- Jonathan Piccolo | [email protected] | 305-926-6739 On Jun 8, 2014, at 1:04 PM, Wei Lo <[email protected]> wrote: > I am now using scrapy to crawl massive data from a website. I encountered the > following problem: > > When I want to crawl data from XXX.com/a, the response code is 200 and it > works without any problem. > > Yet when I want to crawl data from XXX.com/b, the response code is ALWAYS > 403. Remember that XXX indicates the same domain. > > I use chrome browser to take a look at XXX.com/b, it works without any > problem. > > Can somebody give me comments on what is going on here? Thank you!!! > > -- > You received this message because you are subscribed to the Google Groups > "scrapy-users" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at http://groups.google.com/group/scrapy-users. > For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "scrapy-users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/scrapy-users. For more options, visit https://groups.google.com/d/optout.
