enabled plugins that implement IndexingFilter are run for each file to generate the fields to index. enabled plugins can be found in conf/nutch-default.xml or conf/nutch-site.xml.
You can look at http://wiki.apache.org/nutch/IndexStructure. Kai_testing Middleton wrote: > Not sure ... this is kind of an off-the-cuff reply, but Luke might give you > that information (google for apache luke). > > ----- Original Message ---- > From: Daniel Clark <[EMAIL PROTECTED]> > To: [EMAIL PROTECTED] > Sent: Tuesday, July 17, 2007 3:22:26 PM > Subject: IndexFilter > > Which indexFilter plugin does Nutch use out-of-the-box? Or how do I find > out? I'm trying to figure out how the following fields are being indexed. > > > > anchor > > boost > > content > > digest > > host > > segment > > site > > title > > tstamp > > url > > > > > > > > > > > > > > ____________________________________________________________________________________ > Moody friends. Drama queens. Your life? Nope! - their life, your story. Play > Sims Stories at Yahoo! Games. > http://sims.yahoo.com/ > ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
