Thanks, Brian.

Yes I know that storing content will be an issue. That is way I would
like to access the cached content instead. I have just finished a Lucene
only project where we stored a lot of content from binaries and
performance was not as good compared to Nutch. Therfore I would like to
access the cached content if possible.

Regards, 
Ronny


-----Opprinnelig melding-----
Fra: Brian Whitman [mailto:[EMAIL PROTECTED] 
Sendt: 19. juni 2007 19:52
Til: [EMAIL PROTECTED]
Emne: Re: Lucene client and nutch index


On Jun 19, 2007, at 1:39 PM, Naess, Ronny wrote:

> I have made a small Lucene client reading my nutch index created with
> Nutch-0.9
>
> This works fine. However since 'content' is not stored only indexed in

> the index I have to find a way to access the content to create a 
> summary (and highlighting the query terms).
>

You can simply set the content to be stored in the Lucene index then
highlighting will work normally from any Lucene client. Search the
mailing list (there was a post just yesterday) about how to accomplish
this, there's a single line of code to change. Do realise that storing
content will slow down some queries and your index size will grow very
large.

-Brian



!DSPAM:467817bf321421501980509!


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to