Hi folks, I am sorry for adding another question to the same mail. I am also writing a plug-in extending HtmlParser. How do I test it with JUnit?
I see the "filter" method takes Content content, ParseResult parseResult,HTMLMetaTags metaTags, DocumentFragment doc as argument. How can I generate these parameters of the test purpose? Thanks, Abi On Tue, Feb 1, 2011 at 12:10 PM, .: Abhishek :. <[email protected]> wrote: > Hi all, > > I am planning to implement a negative keyword indexer such that if a > negative keyword appears in a segment I should never show up it during the > search. I have the following steps in mind, please let me know if its right. > > - Writing a plug-in > - Extend the IndexingFilter. > - Do a NutchDocument.removeField for the negative keyword. > - return the doc > > Now the questions are, > > - The NutchDocument is always mapped as a HTML page, so if I am doing > the thing above, Am I really removing the segment from getting indexed or > am > I preventing the page from being indexed? > > Also, please let me know what I am intending to do is right? Thanks again > all for your time. > > Cheers, > Abhi >

