[ https://issues.apache.org/jira/browse/NUTCH-485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12495319 ]
Andrzej Bialecki commented on NUTCH-485: ----------------------------------------- I think a more natural change would be this: ParseResult filter(Content content, ParseResult parseResult, HTMLMetaTags metaTags, DocumentFragment doc); That is, HtmlParseFilter would get and return an instance of ParseResult (possible the same instance), adding/removing stuff from it as needed. Existing plugins could function as before - they would just need to work on the Parse instance that corresponds to Content.getUrl(). > Change HtmlParseFilter 's to return ParseResult object instead of Parse object > ------------------------------------------------------------------------------ > > Key: NUTCH-485 > URL: https://issues.apache.org/jira/browse/NUTCH-485 > Project: Nutch > Issue Type: Improvement > Components: fetcher > Affects Versions: 1.0.0 > Environment: All > Reporter: Gal Nitzan > Fix For: 1.0.0 > > Attachments: NUTCH-485.200705122151.patch > > > The current implementation of HtmlParseFilters.java doesn't allow a filter to > add parse objects to the ParseResult object. > A change to the HtmlParseFilter is needed which allows the filter to return > ParseResult . and ofcourse a change to HtmlParseFilters . -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.