TermVector usage

Marvin Humphrey Mon, 20 Feb 2006 19:29:21 -0800

Greets,

KinoSearch 0.05, which for now I'm calling a "loose port" of Lucene,was published to CPAN a few weeks ago. It's nice and fast, butmissing some features, most notably multiple segment support andincremental indexing. Before I get to that though, I'm addingexcerpting and highlighting.

The version of KinoSearch which preceded the Lucene-based rewritealso had a highlighter which depended on what were effectivelyTermVectors with stored offsets. However, unlike Lucene, these werestored along with the stored fields. As I've been preparing to portall the support apparatus for TermVectors, I've been wonderingwhether I shouldn't go back to that. It sure would be less work tocode up. Theoretically there ought to be less disk activity, too.

From following the Lucene lists off and on, I've gotten theimpression that lots of people use TermVectors to feed thehighlighter, but I haven't seen many applications for them besidesthat. LSI-type ideas percolate every once in a while. Besideshighlighting, how many people are using TermVectors and how are theyusing them?


Marvin Humphrey
Rectangular Research
http://www.rectangular.com/


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

TermVector usage

Reply via email to