[ 
https://issues.apache.org/jira/browse/XERCESC-2240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613049#comment-17613049
 ] 

Scott Cantor commented on XERCESC-2240:
---------------------------------------

Unless somebody who knows the code better can propose a patch I definitely 
don't see much that I could do about this.

> Junk characters (including null) allowed in XML declaration
> -----------------------------------------------------------
>
>                 Key: XERCESC-2240
>                 URL: https://issues.apache.org/jira/browse/XERCESC-2240
>             Project: Xerces-C++
>          Issue Type: Bug
>          Components: Non-Validating Parser
>    Affects Versions: 3.2.3
>         Environment: Linux
>            Reporter: Benjamin Fritz
>            Priority: Minor
>         Attachments: basic_bad_bytes.xml, basic_bad_bytes2.xml
>
>
> In a library we've written using Xerces-C++ to validate XML files against a 
> given XSD, we have discovered that the XercesDOMParser::parse() function does 
> not record any errors if the XML declaration at the beginning of an XML 
> document contains "junk" characters, including control characters (^K) or 
> null bytes. The null control character specifically should be invalid in any 
> XML document. I.e. the following XML file (attaching as basic_bad_bytes.xml) 
> parses without error, but it should not:
> <?xml version="1.0" encoding^@\^@\^@\^@\^@="UTF-8" ?>
> <root_elem>
>   <child_elem some_attr="abc" />
>   <child_elem some_attr="def" />
> </root_elem>
> The following XML (attaching as basic_bad_bytes2.xml) correctly reports an 
> error:
> <?xml version="1.0" encoding="UTF-8" ?>
> <root_elem^@\^@\^@\^@\^@>
>   <child_elem some_attr="abc" />
>   <child_elem some_attr="def" />
> </root_elem>
> This is similar to XERCESC-1701, where the end of the document after the root 
> element was found to allow "junk" characters during parsing.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org
For additional commands, e-mail: c-dev-h...@xerces.apache.org

Reply via email to