[
https://issues.apache.org/jira/browse/NUTCH-821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12885583#action_12885583
]
Andrzej Bialecki commented on NUTCH-821:
-
+1 for this patch for now - all good comm
[
https://issues.apache.org/jira/browse/NUTCH-821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12885547#action_12885547
]
Chris A. Mattmann commented on NUTCH-821:
-
Hi Julien:
I reviewed your patch, and am
Hi Cesar,
This can definitely be done using a custom parse plugin and an indexing
plugin. We did something like this sometime ago to classify adult pages
using our text classification API (
http://code.google.com/p/textclassification/) which is based on SVM.
Out of interest, what categories are y
Nutch Developers,
I'm at the last year of Computer Science and my graduation project is
related to web search. The plan is to add a filter of page's category to
Nutch, in a attempt to use SVM to classify the crawled pages.
So I ask you: do you think I'll have to change internals of Nutch or can
t
[
https://issues.apache.org/jira/browse/NUTCH-821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12885463#action_12885463
]
Julien Nioche commented on NUTCH-821:
-
@Chris : isn't this restricted to the jars *we* p
[
https://issues.apache.org/jira/browse/NUTCH-821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12885452#action_12885452
]
Piet Schrijver commented on NUTCH-821:
--
+1 for maven, also having HBase in there would
[
https://issues.apache.org/jira/browse/NUTCH-821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12885447#action_12885447
]
Doğacan Güney commented on NUTCH-821:
-
+1 to Chris. In fact, I would ask to piggyback Go
7 matches
Mail list logo