[ 
https://issues.apache.org/jira/browse/SOLR-7929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erik Hatcher updated SOLR-7929:
-------------------------------
    Attachment: SOLR-7929.patch

Here's a quick patch that just makes "*" for filetypes special in web crawling 
such that it doesn't do a reverse contentType -> file extension check.

> SimplePostTool (also bin/post) -filetypes "*" does not work properly in 'web' 
> mode
> ----------------------------------------------------------------------------------
>
>                 Key: SOLR-7929
>                 URL: https://issues.apache.org/jira/browse/SOLR-7929
>             Project: Solr
>          Issue Type: Bug
>    Affects Versions: 5.3
>            Reporter: Erik Hatcher
>            Assignee: Erik Hatcher
>            Priority: Critical
>             Fix For: 5.4
>
>         Attachments: SOLR-7929.patch
>
>
> {code}
>  $ bin/post -c tmp http://lucene.apache.org/solr/assets/images/book_sia.png 
> -filetypes “*”
>     ...
>  Entering auto mode. Indexing pages with content-types corresponding to file 
> endings *
>  Entering crawl at level 0 (1 links total, 1 new)
>  SimplePostTool: WARNING: Skipping URL with unsupported type image/png
> {code}
> the mapping from image/png to a file type does not exist, and thus fails.  
> This works in 'file' mode though.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to