yes, thanks so much

2014/1/20 Jan Wrobel <[email protected]>

> Hi,
>
> You can use response.url:
>
> http://doc.scrapy.org/en/latest/topics/request-response.html#scrapy.http.Response.url
>
> Cheers,
> Jan
>
>
> On Fri, Jan 17, 2014 at 9:59 PM, d4v1d <[email protected]> wrote:
> > Hello
> > i have a data base with an url list
> > i would like to crawl each url for find the h1 and update the row (in
> data
> > base) with the value of the h1
> > but i don't understand how i can recover in the update request the value
> of
> > the url
> > I tried url[0] but it say to me that the url is not defined
> > thanks in advance for your help
> > regards
> >
> >
> > class H1searchSpider(BaseSpider):
> >     name = "h1search"
> >
> >     def start_requests(self):
> >         self.db = MySQLdb.connect(host="localhost", user="root",
> passwd="",
> > db="crawler_engine", charset = 'utf8', use_unicode = False)
> >         cur = self.db.cursor()
> >         cur.execute("select url from urls")
> >         for url in cur.fetchall():
> >             yield Request(url[0])
> >
> >     def parse(self, response):
> >         hxs = HtmlXPathSelector(response)
> >         item = DmozItem()
> >         item['h1'] = hxs.select('//h1').extract()
> >         cursor = self.db.cursor()
> >         for j in range(len(item['h1'])):
> >             cursor = self.db.cursor()
> >             sql = "update urls set h1 = '%s' where url = '%s'" %
> > (item['h1'][0], url[0])
> >             cursor.execute(sql)
> >             self.db.commit()
> >         return item
> >
> >
> >
> > --
> > You received this message because you are subscribed to the Google Groups
> > "scrapy-users" group.
> > To unsubscribe from this group and stop receiving emails from it, send an
> > email to [email protected].
> > To post to this group, send email to [email protected].
> > Visit this group at http://groups.google.com/group/scrapy-users.
> > For more options, visit https://groups.google.com/groups/opt_out.
>
> --
> You received this message because you are subscribed to a topic in the
> Google Groups "scrapy-users" group.
> To unsubscribe from this topic, visit
> https://groups.google.com/d/topic/scrapy-users/HMXQDEQNxKk/unsubscribe.
> To unsubscribe from this group and all its topics, send an email to
> [email protected].
> To post to this group, send email to [email protected].
> Visit this group at http://groups.google.com/group/scrapy-users.
> For more options, visit https://groups.google.com/groups/opt_out.
>



-- 
David LANGLADE
5 rue du patuel
42800 Saint martin la plaine
Tel : 06.49.42.38.85

-- 
You received this message because you are subscribed to the Google Groups 
"scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to