[
https://issues.apache.org/jira/browse/NUTCH-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12534869
]
Enis Soztutar commented on NUTCH-442:
-
Using nutch with solr has been a very demanding request, so it will be
[
https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marcin Okraszewski updated NUTCH-488:
-
Attachment: ignore_tags_v3.patch
OK, yet another approach based on Doğacan comments.
[
https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535014
]
Dennis Kubes commented on NUTCH-488:
Tested. Working good +1
Avoid parsing uneccessary links and get a more