[ 
https://issues.apache.org/jira/browse/XERCESC-2065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Scott Cantor updated XERCESC-2065:
----------------------------------
    Description: 
Documents with CR entities don't seem to round trip properly in the parser if 
you parse them and then serialize them. It's possible the bug is in the 
serializer because signed documents don't end up with corrupt signatures, but 
that may be due to insufficient testing as of yet.

A simple example:

{code}
<?xml version="1.0" encoding="UTF-8"?>
<foo>
   text&#13;more&lt;&amp;
</foo>
{code}

Running that through DOMPrint or SAX2Print:

{code}
<foo>
more&lt;&amp;
</foo>
{code}

Notice the CR entity is removed, but also all of the characters immediately in 
front of it.

  was:
Documents with CR entities (&#13;) don't seem to round trip properly in the 
parser if you parse them and then serialize them. It's possible the bug is in 
the serializer because signed documents don't end up with corrupt signatures, 
but that may be due to insufficient testing as of yet.

A simple example:

{code}
<?xml version="1.0" encoding="UTF-8"?>
<foo>
   text&#13;more&lt;&amp;
</foo>
{code}

Running that through DOMPrint or SAX2Print:

{code}
<foo>
more&lt;&amp;
</foo>
{code}

Notice the CR entity is removed, but also all of the characters immediately in 
front of it.


> Carriage return entities are not handled properly
> -------------------------------------------------
>
>                 Key: XERCESC-2065
>                 URL: https://issues.apache.org/jira/browse/XERCESC-2065
>             Project: Xerces-C++
>          Issue Type: Bug
>          Components: DOM, Non-Validating Parser, SAX/SAX2
>    Affects Versions: 3.1.3
>            Reporter: Scott Cantor
>            Priority: Critical
>
> Documents with CR entities don't seem to round trip properly in the parser if 
> you parse them and then serialize them. It's possible the bug is in the 
> serializer because signed documents don't end up with corrupt signatures, but 
> that may be due to insufficient testing as of yet.
> A simple example:
> {code}
> <?xml version="1.0" encoding="UTF-8"?>
> <foo>
>    text&#13;more&lt;&amp;
> </foo>
> {code}
> Running that through DOMPrint or SAX2Print:
> {code}
> <foo>
> more&lt;&amp;
> </foo>
> {code}
> Notice the CR entity is removed, but also all of the characters immediately 
> in front of it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org
For additional commands, e-mail: c-dev-h...@xerces.apache.org

Reply via email to