It depends :)
It's a trade-off. If storing is not prohibitive, I recommend that as
it makes life easier for highlighting.
Erik
On Mar 7, 2009, at 6:37 AM, Amin Mohammed-Coleman wrote:
hi
that's what i was thinking about. i would need to get the file and
extract
the text again and then pass through the highlighter. The other
option is
storing the content in the index the downside being index is going
to be
large. Which would be the recommended approach?
Cheers
Amin
On Sat, Mar 7, 2009 at 10:50 AM, Erik Hatcher <e...@ehatchersolutions.com
>wrote:
With the caveat that if you're not storing the text you want
highlighted,
you'll have to retrieve it somehow and send it into the Highlighter
yourself.
Erik
On Mar 7, 2009, at 5:40 AM, Michael McCandless wrote:
You should look at contrib/highlighter, which does exactly this.
Mike
Amin Mohammed-Coleman wrote:
Hi
I am currently indexing documents (pdf, ms word, etc) that are
uploaded,
these documents can be searched and what the search returns to
the user
are
summaries of the documents. Currently the summaries are
extracted when
indexing the file (summary constructed by taking the first 10
lines of
the
document and stored in the index as field). This is not ideal
(static
summary), and I was wondering if it would be possible to create a
dynamic
summary when a hit is found and highlight the terms found. The
content
of
the document is not stored in the index.
So basically what I'm looking to do is:
1) PDF indexed
2) PDF body contains the word "search"
3) Do a search and return the hit
4) Construct a summary with the term "search" included.
I'm not sure how to go about doing this (I presume it is
possible). I
would
be grateful for any advice.
Cheers
Amin
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org