Hi Saptarshi, If the XML from entity:enrich does not suit your needs, it sounds like you will need to use cts:entity-highlight to define the XML based on your own taxonomy:
http://developer.marklogic.com/pubs/4.1/apidocs/SearchBuiltins.html#cts:entity-highlight You can use cts:entity-highlight to write a function to transform the entity mark up to return whatever you need. You can also see chapter 9 (~p109) of the Search Developer’s Guide: http://developer.marklogic.com/pubs/4.1/books/search-dev-guide.pdf It outlines how entity enrichment works with MarkLogic Server. I am not sure I understand your question about schema and DITA. Perhaps if you gave a specific example of what you are trying to do and what you are having trouble doing, we might be able to help you find a solution. The sample pipelines that use third-party technologies (such as Temis) are designed to show integration with these other technologies. Entity extraction technologies are often very specialized to particular types of content, and MarkLogic can work with a wide array of different technologies. Hope that helps, -Danny From: [email protected] [mailto:[email protected]] On Behalf Of [email protected] Sent: Thursday, July 02, 2009 10:27 PM To: [email protected] Subject: [MarkLogic Dev General] Enrichment of content Hi, In my project, I shall be using Marklogic and we have a requirement for content enrichment. I have the content and a taxonomy structure defined. I want to enrich the content using that taxonomy structure. I would like to do the inline metadata tagging on the content. Following are my few questions: 1) From the enrich module API, I have understood that using the enrich function I can add the metadata on the given XML. Here it seems to me that the taxonomy structure and values based on which the metadata is tagged is managed by Marklogic. In my project, I have my own taxonomy definition for the marked up elements. I would like to use that taxonomy definition for enriching the content. How can I add that into Marklogic? 2) Secondly, I have noticed that if that XML has any schema defined and that does not allow children element, Marklogic does not enrich that node. That is fine. But if I send a DITA formatted XML, can I get a DITA formatted XML as output with the enriched content? It will be very helpful, if you can give some example on this topic. I also would like to explore more on this topic. If you can provide me some more resource that will be great. 3) I have also seen that Marklogic has partnered with Temis Luxid for content enrichment. I could not understand that what Marklogic is providing and what Temis is doing extra on top of Marklogic. Any help in this regard will be great. Thanks in advance. regards, Saptarshi Das Tata Consultancy Services United States Mailto: [email protected] Website: http://www.tcs.com ____________________________________________ Experience certainty.IT Services Business Solutions Outsourcing ____________________________________________ =====-----=====-----===== Notice: The information contained in this e-mail message and/or attachments to it may contain confidential or privileged information. If you are not the intended recipient, any dissemination, use, review, distribution, printing or copying of the information contained in this e-mail message and/or attachments to it are strictly prohibited. If you have received this communication in error, please notify us by reply e-mail or telephone and immediately and permanently delete the message and any attachments. Thank you
_______________________________________________ General mailing list [email protected] http://xqzone.com/mailman/listinfo/general
