I believe this question was answered on StackOverflow https://stackoverflow.com/questions/40701227/using-scrapy-linkextractor-to-locate-specific-domain-extensions
On Sunday, November 20, 2016 at 3:11:29 PM UTC+1, lee hodgson wrote: > > I want to use Scrapy's LinkExtractor() to only follow links in the .th > domain > > I see there is a deny_extensions(list) parameter, but no > allow_extensions() parameter. > > Given that, how do I restrict links just to allow domains in .th ? > -- You received this message because you are subscribed to the Google Groups "scrapy-users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/scrapy-users. For more options, visit https://groups.google.com/d/optout.
