Hi Alex, Thanks very much for helps! I switched to (I am using PHP in client side)
createTextNode(urlencode($value)) so CTRL character problem is avoided, but I noticed that somehow solr did not perform urldecode($value), so my initial value abc xyz becomes abc+xyz I have not fully read through solr code on this part, but guess maybe it is a configuration issue (when using CDATA I donot have this issue)? Thanks and best regards, Lisheng -----Original Message----- From: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Sent: Tuesday, January 15, 2013 12:56 PM To: solr-user@lucene.apache.org Subject: Re: Solr exception when parsing XML Interesting point. Looks like CDATA is more limiting than I thought: http://en.wikipedia.org/wiki/CDATA#Issues_with_encoding . Basically, the recommendation is to avoid CDATA and automatically encode characters such as yours, as well as less/more and ampersand. Regards, Alex.