Thank you Bill !

I'm trying to get that list of links. If you see it on the browser they're 
within a table inside <p class="txtmedioazulb> , right after "Lista completa 
das localidades (em <b>negrito</b>, com informações)"

That's the only way i figured it out to specify that group of links. I've used 
your method to identify all <a target="_top">, but the result has some links 
from the rest of the page, that are not in that table. 

I can't understand why in the response they're not in a table inside that <p> 
as in the web. If i write it to a .html file, it builds that table quite right.


On 08 Apr 2014, at 14:02, Bill Ebeling <[email protected]> wrote:

> Also, this is how I got there using scrapy shell..
> 
> command list:
> 
> 'Abegoaria de Baixo' in response.body  # returned True, so I knew there was 
> at least a mention somewhere in the page
> hxs.select("//*[contains(.,'Abegoaria de Baixo')]").extract() # an attempt to 
> get lucky on the first try...  didn't work out, produced too much
> hxs.select("//a[contains(.,'Abegoaria de Baixo')]").extract() # my next 
> attempt to get lucky, this one worked out
> hxs.select("//a[@target='_top'").extract() # this produced a list of links 
> about the same length as the list of links your xpath produced on the same 
> page, figured it was about right
> 
> 
> Hope that helps, too
> 
> 
> On Tue, Apr 8, 2014 at 8:55 AM, Bill Ebeling <[email protected]> wrote:
> I didn't drill in entirely to the problem, so I don't know what reindeer 
> games are being played, but I did have some luck with this:
> 
> //a[@target='_top']
> 
> I'll try to figure it out when I get some time later on.
> 
> 
> On Tue, Apr 8, 2014 at 8:37 AM, André Campos <[email protected]> wrote:
> Ok
> 
> http://www.mapadeportugal.net/concelho.asp?c=1401
> 
> You can see in the browser that 
> '//p[@class="txtmedioazulb"]//td[@class="txtmedio"]/a' it's available, but 
> not in shell or spider crawling
> 
> I've tried loading in shell, and the sel.xpath(..) doesn't retrieve nothing, 
> how can i use shell to figure it out ?
> 
> Thanks!
>  
> On 08 Apr 2014, at 13:23, Bill Ebeling <[email protected]> wrote:
> 
>> can you post the link to the actual page?
>> 
>> Without more information, any suggestions would just be guessing.  If you 
>> can't, I'd recommend loading the page in scrapy shell and trying to figure 
>> it out that way.
>> 
>> -- 
>> You received this message because you are subscribed to a topic in the 
>> Google Groups "scrapy-users" group.
>> To unsubscribe from this topic, visit 
>> https://groups.google.com/d/topic/scrapy-users/rvq9fGDPRWI/unsubscribe.
>> To unsubscribe from this group and all its topics, send an email to 
>> [email protected].
>> To post to this group, send email to [email protected].
>> Visit this group at http://groups.google.com/group/scrapy-users.
>> For more options, visit https://groups.google.com/d/optout.
> 
> 
> -- 
> You received this message because you are subscribed to a topic in the Google 
> Groups "scrapy-users" group.
> To unsubscribe from this topic, visit 
> https://groups.google.com/d/topic/scrapy-users/rvq9fGDPRWI/unsubscribe.
> To unsubscribe from this group and all its topics, send an email to 
> [email protected].
> To post to this group, send email to [email protected].
> Visit this group at http://groups.google.com/group/scrapy-users.
> For more options, visit https://groups.google.com/d/optout.
> 
> 
> 
> -- 
> You received this message because you are subscribed to a topic in the Google 
> Groups "scrapy-users" group.
> To unsubscribe from this topic, visit 
> https://groups.google.com/d/topic/scrapy-users/rvq9fGDPRWI/unsubscribe.
> To unsubscribe from this group and all its topics, send an email to 
> [email protected].
> To post to this group, send email to [email protected].
> Visit this group at http://groups.google.com/group/scrapy-users.
> For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups 
"scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Reply via email to