Best way to gather span/token positions from query?

Sean O'Connor Wed, 29 Apr 2009 19:07:02 -0700

Hello,

I'm trying to find a decent approach for getting token positions outof (or is that into?) solr query results. Is the best approach to extenda QueryComponent and/or HighlightComponent? I'm new to solr, and stillon fairly shaky ground soany pointers or suggestions are quite welcome.


   As a little BACKGROUND:

I am trying to migrate a custom lucene-only content anaylsysproject to solr. The 'old' system programmatically runs a few thousandpredefined queries against a corpus, and then analyzes the results. Thelucene score is good, but the actual position of the hits is also quiteimportant.

My previous system did a simple query parsing to create SpanQuerys,and then used a modified dumpSpans() to get the token position from thespans. Now I am trying to find how to use solr's goodness (andMemoryIndex approach?) to get the span positions in a more logicalmanner. I think the answer is in the highlighter, but I'm getting alittle twisted around, and could use a pointer.

I am using a recent Solr nightly snapshot, grails, Aduna Aperture,and Intellij (if any of that matters)

Thanks,

Sean

Best way to gather span/token positions from query?

Reply via email to