Hi,

Background
I have a piece of content which is neither a valid HTML or XML in my legacy 
database. Considering the fact, it would be difficult to clean the legacy, I 
want to tidy this up in MarkLogic(version 8.0-3) using xdmp:tidy.
The content looks like :
          [cid:[email protected]]
Please find the attached query I'm executing on ML QConsole to tidy this up.

Problem
The problem here is that the response I'm getting after applying tidy 
functionality is not a valid XML(verified it via XML validator). Also when I 
try to insert document with the resulted xml body via POSTMAN or RESTClient, it 
throws an error saying 'MALFORMED BODY | Invalid Processing Instruction names'.

Response XML :
          [cid:[email protected]]

Expectations
My expectation is, that the Marklogic Tidy functionality should rather refrain 
to tidy-up this type of content and throw an error, which it does not do in the 
current scenario. If I get the error from the Marklogic Tidy itself, I will 
rather get this dirty or bad data removed from the legacy database.

Please help me to get through this problem or suggest me workaround to get this 
resolved.

Things Tried So Far
I have tried various options listed out in xdmp:tidy but it didn't help me 
much. Also I investigated on the Processing Instructions but couldn't find a 
way through as it doesn't looks like a valid PI either

Kind Regards,
Rachit Rampal

Attachment: tidy-query.xqy
Description: tidy-query.xqy

_______________________________________________
General mailing list
[email protected]
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to