Re: [2.1] Overzealous escaping of high Unicode code points

2017-06-08 Thread gelo1234
Chris, Even with C3 (cocoon 3.0 beta) unless you specify optional encoding in your Serializer config, you fallback to default UTF-8: org.apache.cocoon.optional.servlet.components.sax.serializers.util public class ConfigurationUtils { private ConfigurationUtils() { } public static S

Re: [2.1] Overzealous escaping of high Unicode code points

2017-06-08 Thread gelo1234
It depends on what type of Serializer you use and what kind of Serlializer config you put into your sitemap? By default XMLSerializer/HTMLSerializer uses UTF-8 encoding. So instead of 1 UTF-16 char you got 2 chars UTF-8 encoded. Of cource there might be also issue with emoji charset, but I would f