[
https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-585:
--
Priority: Major (was: Minor)
> [PARSE-HTML plugin] Block certain parts of HTML code from being
[
https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-585:
--
Component/s: parse-filter
HTML
parser
plugin
[
https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-585:
--
Fix Version/s: 1.20
> [PARSE-HTML plugin] Block certain parts of HTML code from being indexed
>
[
https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Roberto Gardenier updated NUTCH-585:
Comment: was deleted
(was: I have compiled nutch 1.5.1 with the provided plugin and used
[
https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-585:
Fix Version/s: (was: 1.5)
1.6
20120304-push-1.6
[
https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Nioche updated NUTCH-585:
Fix Version/s: (was: 1.4)
1.5
Marking for 1.5. Needs reviewing and won't
[
https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Elisabeth Adler updated NUTCH-585:
--
Attachment: blacklist_whitelist_plugin.patch
Based on the suggestions/code above, I have
[
https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-585:
Patch Info: [Patch Available]
Fix Version/s: 1.4
Assignee: Markus Jelsma
Marked for
[
https://issues.apache.org/jira/browse/NUTCH-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
N. Hira updated NUTCH-585:
--
Attachment: nutch-585-jostens-excludeDIVs.patch
We use Solr/Nutch on our corporate web site and are very happy
9 matches
Mail list logo