Re: Don't understand why my scrapy spider is not extracting items

Jakob de Maeyer Tue, 15 Jul 2014 05:17:31 -0700

Hey Peter,

if you want to parse a response and not just follow links in it, you
need to specify the 'callback' parameter of the crawling rule, i.e. add
"callback = 'parse_page'" in line 16.


Be aware that if you keep the "follow=True", your spider will follow any
links within bouwmachineforum.nl found in the start response and all of
the subsequent responses. You will probably end up crawling *all* pages
publicly available at bouwmachineforum.nl.


Cheers,
-Jakob


On 07/15/2014 01:54 PM, Peter van den Toorn wrote:
> Dear programmers,
> 
> I am new to Scrapy and have just developed my first spider. The
> extraction of items worked fine untill i renamed my method name from
> parse to parse_page (this is needed for the CrawlSpider class).
> 
> Any help is much appreciated.
> 
> My code
> <https://github.com/petervdtoorn/scrapers/blob/master/Spiders/bouwmachineforum/bouwmachineforum/spiders/spider.py>
> 
> -- 
> You received this message because you are subscribed to the Google
> Groups "scrapy-users" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to [email protected]
> <mailto:[email protected]>.
> To post to this group, send email to [email protected]
> <mailto:[email protected]>.
> Visit this group at http://groups.google.com/group/scrapy-users.
> For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups 
"scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Re: Don't understand why my scrapy spider is not extracting items

Reply via email to