Is it possible (and would it even help) to normalize all strings with regards to surrogate pairs at indexing time instead? or will container still possibly differ in byte for byte output?

- Donovan

On Sep 11, 2009, at 5:34 PM, Chris Hostetter wrote:


: > why don't we just output the raw bytes ourselves?
:
: That would require writing TextResponseWriter and friends as binary
: writers, right? Or did you have a different way in mind for injecting
: bytes into the output stream?


Grr.... you're right. i got so turned arround thinking about
counting the bytes and encapsulating it all in the PHPS markup i forgot that we still want/need the *raw* bytes output as part of the character
stream ... not some sort of string representation of the bytes ... wow
that sounds even more comfusing when i look at it on paper.

character data sucks.

I still repeat my earlier vote: let's yank this patch and just document that byte counts for strings included in the PHPS format are only accurate for characters outside the BMP if the servlet container being used writes them correctly -- that seems like a totally fair requirement to having in place, and points the figner squarly where in belongs (without putting us
at risk for having broken code if/when jetty fixes this problem.

Alternately: have an option and/or system property to force/disable this
behavior even if jetty.home isn't/is set.

Even if we change nothing else: there needs to be a big fat freaking
disclaimer in the javadocs for the writer that it's looking at the
jetty.home variable, and explaining why.



-Hoss


Reply via email to