[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists

2007-06-22 Thread Vishal Shah (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507144 ] Vishal Shah commented on NUTCH-503: --- Hi Emmanuel, Can you please dump the contents of your crawldb after

[jira] Commented: (NUTCH-471) Fix synchronization in NutchBean creation

2007-06-22 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507145 ] Hudson commented on NUTCH-471: -- Integrated in Nutch-Nightly #125 (See

where to put hadoop native lib in tomcat?

2007-06-22 Thread qi wu
Where should I put the hadoop native lib file like libhadoop.so for the searching function ? I have tried to put it in the dir like: /data/apache-tomcat-5.5.23/webapps/ROOT/WEB-INF/lib/native.. and this doesn't work. Thanks!

[jira] Created: (NUTCH-504) NUTCH-443 broke parsing during fetching

2007-06-22 Thread JIRA
NUTCH-443 broke parsing during fetching --- Key: NUTCH-504 URL: https://issues.apache.org/jira/browse/NUTCH-504 Project: Nutch Issue Type: Bug Components: fetcher Affects Versions: 1.0.0

[jira] Updated: (NUTCH-504) NUTCH-443 broke parsing during fetching

2007-06-22 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney updated NUTCH-504: Attachment: parse_in_fetchers.patch Patch for the problem. I think it would be nice to add a test

[jira] Commented: (NUTCH-504) NUTCH-443 broke parsing during fetching

2007-06-22 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507162 ] Doğacan Güney commented on NUTCH-504: - Also, should we actually index documents even if their parses have failed?

[jira] Commented: (NUTCH-465) I download nutch 0.9 used tar zxvf nutch-0.9.tar.gz at last A lone zero block

2007-06-22 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507167 ] Doğacan Güney commented on NUTCH-465: - Which mirror did you download it from? I download nutch 0.9 used tar

[jira] Commented: (NUTCH-504) NUTCH-443 broke parsing during fetching

2007-06-22 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507168 ] Andrzej Bialecki commented on NUTCH-504: - +1 - we should skip documents that failed to parse properly, in

[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists

2007-06-22 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507169 ] Doğacan Güney commented on NUTCH-503: - Also, how many machines are there on your cluster and which version of

[jira] Issue Comment Edited: (NUTCH-503) Generator exits incorrectly for small fetchlists

2007-06-22 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507169 ] Doğacan Güney edited comment on NUTCH-503 at 6/22/07 1:58 AM: -- Also, how many machines

[jira] Updated: (NUTCH-504) NUTCH-443 broke parsing during fetching

2007-06-22 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney updated NUTCH-504: Attachment: NUTCH-504_v2.patch New version. * Includes older patch. * Indexer filters unsuccessful

[jira] Commented: (NUTCH-479) Support for OR queries

2007-06-22 Thread Rob Young (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507221 ] Rob Young commented on NUTCH-479: - How would this work in the following case? search phrase category:cat1 OR

[jira] Commented: (NUTCH-468) Scoring filter should distribute score to all outlinks at once

2007-06-22 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507366 ] Doğacan Güney commented on NUTCH-468: - Latest patch still applies to current trunk. If no one has objections I am

[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists

2007-06-22 Thread Emmanuel Joke (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507469 ] Emmanuel Joke commented on NUTCH-503: - Sorry, my mistake. My compiled jar was not correctly included in my

[jira] Commented: (NUTCH-479) Support for OR queries

2007-06-22 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507473 ] Doug Cutting commented on NUTCH-479: Neither. It would end up as the Lucene query: +search phrase

[jira] Commented: (NUTCH-503) Generator exits incorrectly for small fetchlists

2007-06-22 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507535 ] Doğacan Güney commented on NUTCH-503: - Nice to hear, Emmanuel. I believe this is ready for committing, but,