[ https://issues.apache.org/jira/browse/SOLR-7929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Erik Hatcher updated SOLR-7929: ------------------------------- Attachment: SOLR-7929.patch Here's a quick patch that just makes "*" for filetypes special in web crawling such that it doesn't do a reverse contentType -> file extension check. > SimplePostTool (also bin/post) -filetypes "*" does not work properly in 'web' > mode > ---------------------------------------------------------------------------------- > > Key: SOLR-7929 > URL: https://issues.apache.org/jira/browse/SOLR-7929 > Project: Solr > Issue Type: Bug > Affects Versions: 5.3 > Reporter: Erik Hatcher > Assignee: Erik Hatcher > Priority: Critical > Fix For: 5.4 > > Attachments: SOLR-7929.patch > > > {code} > $ bin/post -c tmp http://lucene.apache.org/solr/assets/images/book_sia.png > -filetypes “*” > ... > Entering auto mode. Indexing pages with content-types corresponding to file > endings * > Entering crawl at level 0 (1 links total, 1 new) > SimplePostTool: WARNING: Skipping URL with unsupported type image/png > {code} > the mapping from image/png to a file type does not exist, and thus fails. > This works in 'file' mode though. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org