On Tue, Jan 15, 2013 at 3:55 PM, Alexandre Rafalovitch <[email protected]> wrote: > Basically, the > recommendation is to avoid CDATA and automatically encode characters such > as yours, as well as less/more and ampersand.
Unfortunately that doesn't even work. Just as a raw control character like a 0 byte is invalid XML, so is an encoded 0 byte like � XML on it's own is simply incapable of representing all unicode code points (without some further encoding on top like base64 or whatever). You could always use JSON... -Yonik http://lucidworks.com
