Peter Hickman wrote:
Ryan Grimm wrote:
Hi Peter,

A couple questions for you.

This document that takes 40 seconds to highlight, how large is it?

It's only 1.5M, however the majority of the other documents are in the 10-100k region so it is large in comparison.

Ok, that's a good size but not out of this world. However, I am surprised that it is taking 40 seconds to highlight a 1.5MB document.

Have you configured a fragmentation policy?

Our fragmentation policy is to keep each article in it's own fragment so that we can use the estimate function with some degree of success. Or have I misunderstood the effects of fragmentation on the calculation of the estimate. As I understand it fragmenting an article will result in that article being counted into the estimate once for each time that a fragment matches. How would fragmenting the article speed up cts:highlight?

You're view of how fragmentation effects xdmp:estimate() is correct and fragmentation probably won't speed up cts:highlight(). I was curious what your fragmentation policy was so I could get a better understanding of how you've got things setup.

I would try my previous suggestion of using cts:search() to narrow down what content you want to highlight. Here's a very basic example of how you could do this if your content was XHTML.

declare namespace xhtml = "http://www.w3.org/1999/xhtml";

let $query := cts:word-query("find me")
for $result in cts:search(/xhtml:html, $query)[1 to 10]
let $para := cts:search(doc(base-uri($result))//xhtml:p, $query)[1]
return cts:highlight($para, $query, <strong>{ $cts:text }</strong>)


Hope that helps.

--Ryan

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Reply via email to