On 6/22/06, Yonik Seeley <[EMAIL PROTECTED]> wrote:
On 6/22/06, Mike Klaas <[EMAIL PROTECTED]> wrote:

> It uses term vectors, if present, to speed up highlighting (else the
> stored field needs to be re-analyzed).

Does your current code already handle the re-analyze case?

Yes, it grabs the correct analyzer from IndexSchema.

It does seem like it would be easier for clients to parse document
associated data if it is included directly in the <doc> element.

One way might be to create an extension point where Documents could be
manipulated and fields could be added.  This could also be useful for
integrating with large stored fields that might not be kept in the
index, but in a separate database instead.

That brings up another point... the ability to highlight something
that doesn't have termvectors stored, and doesn't have the field
stored might be useful.  The same interface used to add new fields
above could potentially be used here as well.

Creating an external storage hook seems like a bit more than I am
comfortable biting off (among other things, it seems like a major solr
design decision).  The ability to highlight arbitrary text is easy
once solr has gotten hold of it, however.  There also a matter of
specifying an Analyzer for the highlighting.

<>
> FieldType, and the only RequestHandler customization is the selection
> of which fields to highlight.

I'm not sure if this is really the property of a field.
Another possibility is using init params in the request handler
defined in solrconfig.xml, with the possibility of overriding them in
a request.

True, though it seems nettlesome to have per-field highlighting
customization using that strategy.  Perhaps a distinction could be
made between highlighting and summarizing, which are two rather
different use cases.

cheers,
-MIke

Reply via email to