[
https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507144
]
Vishal Shah commented on NUTCH-503:
---
Hi Emmanuel,
Can you please dump the contents of your crawldb after
[
https://issues.apache.org/jira/browse/NUTCH-471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507145
]
Hudson commented on NUTCH-471:
--
Integrated in Nutch-Nightly #125 (See
Where should I put the hadoop native lib file like libhadoop.so for the
searching function ? I have tried to put it in the dir like:
/data/apache-tomcat-5.5.23/webapps/ROOT/WEB-INF/lib/native..
and this doesn't work.
Thanks!
NUTCH-443 broke parsing during fetching
---
Key: NUTCH-504
URL: https://issues.apache.org/jira/browse/NUTCH-504
Project: Nutch
Issue Type: Bug
Components: fetcher
Affects Versions: 1.0.0
[
https://issues.apache.org/jira/browse/NUTCH-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney updated NUTCH-504:
Attachment: parse_in_fetchers.patch
Patch for the problem. I think it would be nice to add a test
[
https://issues.apache.org/jira/browse/NUTCH-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507162
]
Doğacan Güney commented on NUTCH-504:
-
Also, should we actually index documents even if their parses have failed?
[
https://issues.apache.org/jira/browse/NUTCH-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507167
]
Doğacan Güney commented on NUTCH-465:
-
Which mirror did you download it from?
I download nutch 0.9 used tar
[
https://issues.apache.org/jira/browse/NUTCH-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507168
]
Andrzej Bialecki commented on NUTCH-504:
-
+1 - we should skip documents that failed to parse properly, in
[
https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507169
]
Doğacan Güney commented on NUTCH-503:
-
Also, how many machines are there on your cluster and which version of
[
https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507169
]
Doğacan Güney edited comment on NUTCH-503 at 6/22/07 1:58 AM:
--
Also, how many machines
[
https://issues.apache.org/jira/browse/NUTCH-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Doğacan Güney updated NUTCH-504:
Attachment: NUTCH-504_v2.patch
New version.
* Includes older patch.
* Indexer filters unsuccessful
[
https://issues.apache.org/jira/browse/NUTCH-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507221
]
Rob Young commented on NUTCH-479:
-
How would this work in the following case?
search phrase category:cat1 OR
[
https://issues.apache.org/jira/browse/NUTCH-468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507366
]
Doğacan Güney commented on NUTCH-468:
-
Latest patch still applies to current trunk. If no one has objections I am
[
https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507469
]
Emmanuel Joke commented on NUTCH-503:
-
Sorry, my mistake.
My compiled jar was not correctly included in my
[
https://issues.apache.org/jira/browse/NUTCH-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507473
]
Doug Cutting commented on NUTCH-479:
Neither. It would end up as the Lucene query:
+search phrase
[
https://issues.apache.org/jira/browse/NUTCH-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12507535
]
Doğacan Güney commented on NUTCH-503:
-
Nice to hear, Emmanuel.
I believe this is ready for committing, but,
16 matches
Mail list logo