Hello - there is no Boilerpipe support for 2.x. Markus
-----Original message----- > From:Nana Pandiawan <nana.pandia...@solusi247.com.INVALID> > Sent: Monday 4th July 2016 6:16 > To: user@nutch.apache.org > Subject: Re: Remove Header from content > > Hi Markus Jelsma, > > If Boilerpipe support for Apache Nutch 2.3.1? i have try > https://issues.apache.org/jira/secure/attachment/12708817/nutch-2.x-boilerpipe.patch, > > but doesnt work. > > regards > > On 29/06/16 17:06, Markus Jelsma wrote: > > Manish - you're in luck. Nutch 1.12 was released and has Boilerpipe > > support. Check: > > https://issues.apache.org/jira/browse/NUTCH-961 > > > > Markus > > > > > > > > -----Original message----- > >> From:Manish Verma <m_ve...@apple.com> > >> Sent: Tuesday 28th June 2016 23:46 > >> To: user@nutch.apache.org > >> Subject: Remove Header from content > >> > >> Hi, > >> > >> I don’t want to index header and footer of content , I know we can make > >> changes in HtmlParser.java but I don’t want to change nutch core code, is > >> there any other way(plugin) to eleminate Header div from content. > >> > >> Thanks MV > >> > >> > >