Sorry I just found the *deny_extensions *parameter. Problem solved!
Den torsdagen den 8:e maj 2014 kl. 14:22:02 UTC+2 skrev James Ford: > > Hello, > > I am wondering if it's possible to change the default IGNORED_EXTENSIONS > as found in scrapy.linkextractor? > > What I wan't to achieve is to use the SgmlLinkExtractor to extract PDF and > office-suite urls. > > Thanks, > -- You received this message because you are subscribed to the Google Groups "scrapy-users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/scrapy-users. For more options, visit https://groups.google.com/d/optout.
