Hi,
Background
I have a piece of content which is neither a valid HTML or XML in my legacy
database. Considering the fact, it would be difficult to clean the legacy, I
want to tidy this up in MarkLogic(version 8.0-3) using xdmp:tidy.
The content looks like :
[cid:[email protected]]
Please find the attached query I'm executing on ML QConsole to tidy this up.
Problem
The problem here is that the response I'm getting after applying tidy
functionality is not a valid XML(verified it via XML validator). Also when I
try to insert document with the resulted xml body via POSTMAN or RESTClient, it
throws an error saying 'MALFORMED BODY | Invalid Processing Instruction names'.
Response XML :
[cid:[email protected]]
Expectations
My expectation is, that the Marklogic Tidy functionality should rather refrain
to tidy-up this type of content and throw an error, which it does not do in the
current scenario. If I get the error from the Marklogic Tidy itself, I will
rather get this dirty or bad data removed from the legacy database.
Please help me to get through this problem or suggest me workaround to get this
resolved.
Things Tried So Far
I have tried various options listed out in xdmp:tidy but it didn't help me
much. Also I investigated on the Processing Instructions but couldn't find a
way through as it doesn't looks like a valid PI either
Kind Regards,
Rachit Rampal
tidy-query.xqy
Description: tidy-query.xqy
_______________________________________________ General mailing list [email protected] Manage your subscription at: http://developer.marklogic.com/mailman/listinfo/general
