I have been trying to come up with a way to determine the "format" of a document in MarkLogic. The only api call that seems directly related is xdmp:document-uri-format, but this seems to operate on the uri without any reference to the contents of a document. Instead, I tried testing:

node-kind(doc($uri)/node()[1])


but we just found an XML document for which this returns "text" - apparently it has a BOM at the start, so the document node has two child nodes: one text (containing the BOM) and one element (the root element). Presumably there could be comments there too and processing instructions, so this strategy is clearly flawed.

Does anybody have a good way to determine whether a document in Mark Logic is an XML document, a text document or a binary document?

-Mike

_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Reply via email to