Karl Wright created SOLR-4358:
---------------------------------

             Summary: SolrJ, by preventing multi-part post, loses key 
information about file name that Tika needs
                 Key: SOLR-4358
                 URL: https://issues.apache.org/jira/browse/SOLR-4358
             Project: Solr
          Issue Type: Bug
          Components: clients - java
    Affects Versions: 4.0
            Reporter: Karl Wright


SolrJ accepts a ContentStream, which has a name field.  Within 
HttpSolrServer.java, if SolrJ makes the decision to use multipart posts, this 
filename is transmitted as part of the form boundary information.  However, if 
SolrJ chooses not to use multipart post, the filename information is lost.

This information is used by SolrCell (Tika) to make decisions about content 
extraction, so it is very important that it makes it into Solr in one way or 
another.  Either SolrJ should set appropriate equivalent headers to send the 
filename automatically, or it should force multipart posts when this 
information is present.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to