Lee Mallabone wrote:
> Okay, I'm now not entirely certain how useful a generic solution will be
> to me, given the non-generic nature of the content I'm indexing. I think
> there a lot of optomizations I can make that wouldn't be generic.
"Early optimization is the root of all evil."
On Tue, 2001-10-23 at 17:48, Doug Cutting wrote:
> > This doesn't seem quite as useful as getContext(Hashset queryTerms,
> > Reader originalDocument); which is what I was originally
> > aiming towards.
> to use more generic code, you could
> call such a generic method once for each field of your
> From: Lee Mallabone [mailto:[EMAIL PROTECTED]]
> >
> > How did the title ever get indexed as the title?
>
> I'm indexing HTML documents marked up with comments to indicate field
> boundaries. So I'd typically have:
>
>
> blurb
>
> more blurb
>
> and so on. The documents were indexed by l
On Mon, 2001-10-22 at 17:43, Doug Cutting wrote:
> > I'm trying to implement this and should be able to contribute any
> > succesful results, but I need to produce context on a per-field basis.
>
> How did the title ever get indexed as the title? Presumably you split the
> document into fields w
> From: Lee Mallabone [mailto:[EMAIL PROTECTED]]
>
> I'm trying to implement this and should be able to contribute any
> succesful results, but I need to produce context on a per-field basis.
> Eg. if I got a token hit in the text body of a document, but the first
> hit token was a word in the se
On Fri, 2001-10-19 at 17:01, Doug Cutting wrote:
> > Rather than highlight terms, I would just extract the first hit token,
> > and a certain number of characters either side of it.
>
> I think this is the best approach. Since you'll probably only be displaying
> around ten hits at a time, the co
> From: Lee Mallabone [mailto:[EMAIL PROTECTED]]
>
> This is something I also need to implement in the very near future. My
> current thoughts are to use a variant of Maik Schreiber's way of doing
> term highlighting in documents. See:
> http://www.iq-computing.de/lucene/highlight.htm
>
> Rather
On Thu, 2001-10-18 at 17:29, Benjamin Kopic wrote:
> We are trying to implement Lucene, and one of the requirements for the
> search is to provide a context within which the search term appears in a
> document.
> Has anyone done something similar? Any help would be appreciated.
Hi,
This is so