[ https://issues.apache.org/jira/browse/NUTCH-2443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16239740#comment-16239740 ]
Hudson commented on NUTCH-2443: ------------------------------- SUCCESS: Integrated in Jenkins build Nutch-trunk #3465 (See [https://builds.apache.org/job/Nutch-trunk/3465/]) NUTCH-2443 add source tag to the parse-html and parse-tika outlink (jorge-luis.betancourt: [https://github.com/apache/nutch/commit/d34a002b25a770369ad6a5a20475c7072d8fa02b]) * (edit) src/plugin/parse-tika/src/test/org/apache/nutch/tika/TestDOMContentUtils.java * (edit) src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMContentUtils.java * (edit) src/plugin/parse-tika/src/java/org/apache/nutch/parse/tika/DOMContentUtils.java * (edit) src/plugin/parse-html/src/test/org/apache/nutch/parse/html/TestDOMContentUtils.java > Extract links from the video tag with the parse-html plugin > ----------------------------------------------------------- > > Key: NUTCH-2443 > URL: https://issues.apache.org/jira/browse/NUTCH-2443 > Project: Nutch > Issue Type: Improvement > Components: parser, plugin > Affects Versions: 1.13 > Reporter: Jorge Luis Betancourt Gonzalez > Assignee: Jorge Luis Betancourt Gonzalez > Priority: Minor > Fix For: 1.14 > > > At the moment the {{parse-html}} extracts links from the tags {{a, area, > form}} (configurable){{, frame, iframe, script, link, img}}. Since we allow > extracting links to binary files (images) extracting links also from the > {{video}} tag should be supported. -- This message was sent by Atlassian JIRA (v6.4.14#64029)