Well new field extraction requires purposed indexing filter plug-in so if none of the current plug-in does it for you then you have to build one for yourself. Relatively easy and well explained on the wiki. http://www.mail-archive.com/[email protected]/msg14397.html
I'm not completely sure what the conf/custom-fields.xml is used for, but it does not create new field. I know because I made the same assumption as you did when I started. I assume it tells the different components what to do when they see the new field. 2009/8/28 Mohamed Parvez <[email protected]> > what plug-in is it? > > In the plug-in directory i don't see anything by name custom-fields. > > ---- > Thanks/Regards, > Parvez > GV : 786-693-2228 > > > On Fri, Aug 28, 2009 at 4:40 AM, MilleBii <[email protected]> wrote: > > > If there is nothing in the index it is most probably that you forgot to > add > > the plug-in that indexes this field. > > > > 2009/8/28 Mohamed Parvez <[email protected]> > > > > > Hello All, > > > > > > I am using Nutch 1.0 > > > > > > In html pages of my website, there is a meta tag called > > page_title, > > > which will have the actual page title. > > > > > > I see that there is an option to add custom fields in the file > > > conf/custom-fields.xml > > > > > > <properties> > > > <entry key="field.name">page_title</entry> > > > <entry key="field.indexed">yes</entry> > > > <entry key="field.stored">yes</entry> > > > <entry key="field.tokenized">no</entry> > > > <entry key="field.boost">1.0</entry> > > > <entry key="field.multi">false</entry>--> > > > </properties> > > > > > > I added the field name in that file but don't see it in the > index, > > > when i opened in the index using luke. > > > > > > Is there any documentation on using the file > > conf/custom-fields.xml > > > or If some one can tell me how to use, it will be great help. > > > > > > --- > > > Thanks/Regards, > > > Parvez > > > > > > > > > > > -- > > -MilleBii- > > > -- -MilleBii-
