I believe this question was answered on StackOverflow

https://stackoverflow.com/questions/40701227/using-scrapy-linkextractor-to-locate-specific-domain-extensions

On Sunday, November 20, 2016 at 3:11:29 PM UTC+1, lee hodgson wrote:
>
> I want to use Scrapy's LinkExtractor() to only follow links in the .th 
> domain
>
> I see there is a deny_extensions(list) parameter, but no 
> allow_extensions() parameter.
>
> Given that, how do I restrict links just to allow domains in .th ?
>

-- 
You received this message because you are subscribed to the Google Groups 
"scrapy-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/scrapy-users.
For more options, visit https://groups.google.com/d/optout.

Reply via email to