Max Hansmire created SOLR-4335:
----------------------------------

             Summary: Solrj UpdateRequest can send illegal XML to Solr
                 Key: SOLR-4335
                 URL: https://issues.apache.org/jira/browse/SOLR-4335
             Project: Solr
          Issue Type: Bug
          Components: clients - java
    Affects Versions: 3.4
            Reporter: Max Hansmire


If you include illegal XML characters like U+FFFF in document in an 
UpdateRequest, they cause an error on the server.

{noformat}
java.lang.RuntimeException: [was class java.io.CharConversionException] Invalid 
UTF-8 character 0xffff at char #1940, byte #127)
{noformat}

Other Illegal XML characters are replaced by the code in 
org.apache.solr.common.util.XML. For instance U+0000 is replaced with "#0;". 
SolrJ should be consistent in how it handles illegal XML characters.

>From the source code it looks like this issue affects the most recent versions 
>of Solr, but I did not attempted to reproduce on 4.0.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to