Not sure if this will work, but have you tried replacing hyphens with an element such as: multi<hy>-</hy>lingual and defining the element as a phrase-around? If ML actually treats that as a single word, you should be ok; you would also have to remove hyphens from search terms

-Mike

David Sewell wrote:
A bit of experimentation shows that the Unicode soft hyphen character,
U+00AD, is treated as word-dividing for the purposes of MarkLogic word
indexing. I.e. given in one's underlying data

  multi&#xAD;lingual

then cts:word-query("multilingual") won't match.

Is there any workaround?

_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Reply via email to