Moin Jens,

Jens Fischer schrieb:
I was wondering if there's an option to return statistics about
distances from the query terms to the most frequent terms in the
result documents.

The additional information I'm looking for is the average distance
between these terms and my search term.

So let's say I have two docs

"the house is red"
"I live in a red house"

The search for "house" should also return the info

the:1
is:1
red:1.5
I:5
live:4

Could you explain what the "distance" here is? Something like "edit
distance"? Ah, I see: You want the textual distance between the search
term and other terms in the document, and then you want that averaged,
i.e. the cumulative distance divided by the number of occurrences.

No idea if that functionality is available.

However, the sort of calculation you want to perform requires the engine
to not only collect all the terms to present as facets (much improved in
1.4, as I've just learned), but to also analyze each document (if I'm
not mistaken) to determine the distance for each facet term from your
primary query term. (Or terms.)

The number of lookup operations is likely to scale as the product of
the number of your primary search results, the number of your search
terms, and the number of your facets.

I assume this is an expensive operation.

Michael Ludwig

Reply via email to