Re: [MarkLogic Dev General] Trailing Spaces Removed from Attribute Values--Bug or Feature?

Eliot Kimber Tue, 11 Mar 2008 07:50:35 -0700

Mike Sokolov wrote:

Agreed; however it's not clear that trailing whitespace needs to bepreserved in order to be able to search for DITA tokens, as in theoriginal example. I guess it might depend on just what the tokensconsist of but a word- or phrase-search might be able to make use of theimplicit tokenization done by the indexer without the need for thetrailing whitespace.
EG: cts:attribute-word-search(..."topic/topic") ought to match"topic/topic" and not match "mytopic/topic-foo", I think.

It's not just a question of what will work from a MarkLogic query butwhat consumers of the elements brought out of MarkLogic will get. Forexample, the XSLT pattern for processing DITA content is:


<xsl:template match="*[contains(@class, ' topic/topic ')]">

If I get stuff out of MarkLogic and hand it to an XSLT transform (e.g.,the DITA Open Toolkit) then the above match would fail for generictopics (because the literal value of class= would be "- topic/topic" not"- topic/topic ").

Likewise, editors and other tools that expect the trailing space inorder to bind behavior to elements would fail.

So even in the best case it would be necessary to moderate any elementextraction through a filter that either removes the class= attributesentirely (falling back on the schema- or DTD-defined defaults, assumingthe DTD or schema association is restored or maintained in the result)or that adds the missing trailing space to the literal class= values inthe instance.


Cheers,

Eliot

--
Eliot Kimber
Senior Solutions Architect
"Bringing Strategy, Content, and Technology Together"
Main: 610.631.6770
www.reallysi.com
www.rsuitecms.com
_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Re: [MarkLogic Dev General] Trailing Spaces Removed from Attribute Values--Bug or Feature?

Reply via email to