[
https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12579285#action_12579285
]
Emmanuel Joke commented on NUTCH-530:
-
OK
> Add a combiner to improve performance on up
[
https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12578961#action_12578961
]
Andrzej Bialecki commented on NUTCH-530:
-
If there are no new arguments for/against
[
https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12525475
]
Andrzej Bialecki commented on NUTCH-530:
-
I'm still against this patch, exactly because we are not sure how m
[
https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12525418
]
Doğacan Güney commented on NUTCH-530:
-
Andrzej, what do you think about this one in light of Emmanuel's last comme
[
https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516675
]
Emmanuel Joke commented on NUTCH-530:
-
Actually I don't re-use CrawlDbReducer, I've define a new class as Combiner
[
https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516673
]
Andrzej Bialecki commented on NUTCH-530:
-
-1 from me.
See the recent discussion on Hadoop-dev - combiners si
[
https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516621
]
Doğacan Güney commented on NUTCH-530:
-
Yeah, you are right.
+1 from me.
> Add a combiner to improve performance
[
https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516602
]
Emmanuel Joke commented on NUTCH-530:
-
I'm sure to follow your point regarding the outlinks number.
I don't thin
[
https://issues.apache.org/jira/browse/NUTCH-530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516357
]
Doğacan Güney commented on NUTCH-530:
-
Ehm, I am not sure about this... After this, we call updateDbScore twice,