Thanks, Brian. Yes I know that storing content will be an issue. That is way I would like to access the cached content instead. I have just finished a Lucene only project where we stored a lot of content from binaries and performance was not as good compared to Nutch. Therfore I would like to access the cached content if possible.
Regards, Ronny -----Opprinnelig melding----- Fra: Brian Whitman [mailto:[EMAIL PROTECTED] Sendt: 19. juni 2007 19:52 Til: [EMAIL PROTECTED] Emne: Re: Lucene client and nutch index On Jun 19, 2007, at 1:39 PM, Naess, Ronny wrote: > I have made a small Lucene client reading my nutch index created with > Nutch-0.9 > > This works fine. However since 'content' is not stored only indexed in > the index I have to find a way to access the content to create a > summary (and highlighting the query terms). > You can simply set the content to be stored in the Lucene index then highlighting will work normally from any Lucene client. Search the mailing list (there was a post just yesterday) about how to accomplish this, there's a single line of code to change. Do realise that storing content will slow down some queries and your index size will grow very large. -Brian !DSPAM:467817bf321421501980509! ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
