[ 
https://issues.apache.org/jira/browse/XERCESC-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17133691#comment-17133691
 ] 

Roger Leigh commented on XERCESC-2206:
--------------------------------------

There would certainly be no urgency in releasing what's on the master branch.  
We could potentially stage the changes there and leave it until next year, and 
maybe also queue up any breaking changes which couldn't be applied for 
compatibility reasons before now for 3.2.  This would permit us to do the work 
without committing to support two releases at the same time, if that would be 
acceptable?

In benchmarking my application code, I've found that over 50% of the total CPU 
time could end up spent in transcoding, and a big part of that was conversion 
of UTF-8 to UTF-16 as input to Xerces-C++ and then more for reconversion of the 
output.  If it were possible, I'd find much more value in UTF-8 end-to-end 
without involving UTF-16 or UTF-32.  But being able to use UTF-16 literals and 
std::ustring directly would reduce the overheads by a fairly significant amount.

> Use char16_t and unicode literals to replace various XMLCh types
> ----------------------------------------------------------------
>
>                 Key: XERCESC-2206
>                 URL: https://issues.apache.org/jira/browse/XERCESC-2206
>             Project: Xerces-C++
>          Issue Type: Bug
>          Components: Miscellaneous
>    Affects Versions: 3.3.0
>            Reporter: Roger Leigh
>            Assignee: Roger Leigh
>            Priority: Major
>             Fix For: 3.3.0
>
>
> Currently, XMLCh can be a variety of 16-bit types depending upon the 
> platform, from wchar_t, uint16_t, unsigned short, to char16_t.
> To reduce the platform-specific variability, fix XMLCh to char16_t, and also 
> permit the use of u"" unicode string literals in the codebase.  This will 
> allow replacement of Unicode constants with direct use of literals.
> This will additionally reduce the size of the test matrix with only one 
> character variant to test.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org
For additional commands, e-mail: c-dev-h...@xerces.apache.org

Reply via email to