[
https://issues.apache.org/jira/browse/NUTCH-289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508445
]
Doğacan Güney commented on NUTCH-289:
-
It seems this issue has kind of died down, but this would be a great
[
https://issues.apache.org/jira/browse/NUTCH-499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508449
]
Sami Siren commented on NUTCH-499:
--
+1, seems good to me
Refactor LinkDb and LinkDbMerger to reuse code
Hi list,
There is this sentence at the end of every JIRA message:
You can reply to this email to add a comment to the issue online.
But, replying to a JIRA message through nutch-dev doesn't add it as a
comment. So you have to either reply to an email through JIRA (in
which case, it looks like
[
https://issues.apache.org/jira/browse/NUTCH-434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney closed NUTCH-434.
---
Issue resolved and committed.
Replace usage of ObjectWritable with something based on GenericWritable
[
https://issues.apache.org/jira/browse/NUTCH-499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney resolved NUTCH-499.
-
Resolution: Fixed
Fix Version/s: 1.0.0
Committed in rev. 551098.
Refactor LinkDb and
[
https://issues.apache.org/jira/browse/NUTCH-499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney closed NUTCH-499.
---
Issue resolved and committed.
Refactor LinkDb and LinkDbMerger to reuse code
[
https://issues.apache.org/jira/browse/NUTCH-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508479
]
Rob Young commented on NUTCH-479:
-
Hi I've found a bug in this patch. If I search for title:red ORtitle:blue I
[
https://issues.apache.org/jira/browse/NUTCH-479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rob Young updated NUTCH-479:
Attachment: or.patch
I've changed the patch slightly to work around the bug I mentioned earlier.
Now the
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508505
]
Doğacan Güney commented on NUTCH-498:
-
I tested creating a linkdb from ~6M urls:
Combine input records
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508506
]
Andrzej Bialecki commented on NUTCH-498:
-
+1.
Use Combiner in LinkDb to increase speed of linkdb
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12508508
]
Sami Siren commented on NUTCH-498:
--
+1
Use Combiner in LinkDb to increase speed of linkdb generation
[
https://issues.apache.org/jira/browse/NUTCH-498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney resolved NUTCH-498.
-
Resolution: Fixed
Fix Version/s: 1.0.0
Assignee: Doğacan Güney
Committed in rev.
The problem is that nutch-dev (like most Apache mailing lists) sets the
Reply-to header to be itself, so that responses don't go back to the
sender. If you override this when responding (changing the To: line)
and respond to the sender, then it should end up as a comment, which
will be then
wow, setting db.max.outlinks.per.page immediately fixed my problem. It looks
like I totally mis-diagnosed things.
May I pose two questions:
1) how did you view all the outlinks?
2) how severe is NUTCH-119 - does it occur on a lot of sites?
- Original Message
From: Doğacan Güney
14 matches
Mail list logo