[ https://issues.apache.org/jira/browse/NUTCH-2443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16211065#comment-16211065 ]
Jorge Luis Betancourt Gonzalez commented on NUTCH-2443: ------------------------------------------------------- It's not hard to add more tags, but honestly I'm seeing a lot of those tags with URL-value attributes for the first time, the question is should have them _all_ in the actual implementation? > Extract links from the video tag with the parse-html plugin > ----------------------------------------------------------- > > Key: NUTCH-2443 > URL: https://issues.apache.org/jira/browse/NUTCH-2443 > Project: Nutch > Issue Type: Improvement > Components: parser, plugin > Affects Versions: 1.13 > Reporter: Jorge Luis Betancourt Gonzalez > Assignee: Jorge Luis Betancourt Gonzalez > Priority: Minor > Fix For: 1.14 > > > At the moment the {{parse-html}} extracts links from the tags {{a, area, > form}} (configurable){{, frame, iframe, script, link, img}}. Since we allow > extracting links to binary files (images) extracting links also from the > {{video}} tag should be supported. -- This message was sent by Atlassian JIRA (v6.4.14#64029)