Thanks Indy – how is that meant to flow, something like this? declare namespace xhtml = "http://www.w3.org/1999/xhtml";
declare function do:makeXMLsafe( $Str as xs:string ) { let $Str:=fn:escape-html-uri($Str) let $Str:=xdmp:tidy($Str, <options xmlns="xdmp:tidy"><output-xhtml>yes</output-xhtml> </options>)[2]/xhtml:html/xhtml:body/node() return $Str }; From: <general-boun...@developer.marklogic.com> on behalf of Indrajeet Verma <indrajeet.ve...@gmail.com> Reply-To: MarkLogic <general@developer.marklogic.com> Date: Wednesday, February 8, 2017 at 10:28 AM To: MarkLogic <general@developer.marklogic.com> Subject: Re: [MarkLogic Dev General] Is xdml:unquote appropriate for handling accent characters? See if this works for you. declare namespace xhtml = "http://www.w3.org/1999/xhtml"; xdmp:tidy($Str, <options xmlns="xdmp:tidy"><output-xhtml>yes</output-xhtml> </options>)[2]/xhtml:html/xhtml:body/node() Regards, Indy On Wed, Feb 8, 2017 at 11:40 PM, Kari Cowan <kco...@alm.com<mailto:kco...@alm.com>> wrote: I guess I can make it palatable with the function I added below – then have them unfurl it on the front end. When I pulled actual doc source – even though ‘Pokémon’ displayed in Qconsole, it was actually encoded as è declare function do:makeXMLsafe( $Str as xs:string ) { let $Str:=fn:escape-html-uri($Str) return $Str }; >> changes ‘Pokémon’ to ‘Pok%C3%A9mon’ Is there any better way to deal with it? From: <general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>> on behalf of Kari Cowan <kco...@alm.com<mailto:kco...@alm.com>> Reply-To: MarkLogic <general@developer.marklogic.com<mailto:general@developer.marklogic.com>> Date: Tuesday, February 7, 2017 at 2:34 PM To: MarkLogic <general@developer.marklogic.com<mailto:general@developer.marklogic.com>> Subject: Re: [MarkLogic Dev General] Is xdml:unquote appropriate for handling accent characters? (note outlook stripped out the unknown character below, in the <title> node it was “Pok?mon” From: Kari Cowan <kco...@alm.com<mailto:kco...@alm.com>> Date: Tuesday, February 7, 2017 at 2:31 PM To: MarkLogic <general@developer.marklogic.com<mailto:general@developer.marklogic.com>> Subject: Is xdml:unquote appropriate for handling accent characters? The doc contains a node with text including an accent grave, example: <HEADLINE>VOIR DIRE: Pokémon Drive?</HEADLINE> I tried to handle it with: let $theTitle:=xdmp:unquote($theTitle, "", ("repair-full")) But I still get an output with an unknown character in xml <title>VOIR DIRE: Pokmon Drive?</title> >> XML Parsing Error: not well-formed Anyone have a tip they can share on how to handle it? _______________________________________________ General mailing list General@developer.marklogic.com<mailto:General@developer.marklogic.com> Manage your subscription at: http://developer.marklogic.com/mailman/listinfo/general
_______________________________________________ General mailing list General@developer.marklogic.com Manage your subscription at: http://developer.marklogic.com/mailman/listinfo/general