[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility

2007-02-13 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12472581 ] Doğacan Güney commented on NUTCH-444: - Hi nutch.newbie, Can you mail me a list of the failing atom urls(or if

[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility

2007-02-13 Thread nutch.newbie (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12472596 ] nutch.newbie commented on NUTCH-444: Hi Dogacan: I have done some digging around Rome yesterday and it seems to

[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser

2007-02-13 Thread nutch.newbie (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12472669 ] nutch.newbie commented on NUTCH-443: Chris: I been testing NUTCH-444 and NUTCH-443 lately. Renaud and Dogacan

[jira] Assigned: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility

2007-02-13 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-444: --- Assignee: Chris A. Mattmann Possibly use a different library to parse RSS feed for

[jira] Resolved: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore

2007-02-13 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved NUTCH-258. - Resolution: Cannot Reproduce With recent API changes to Hadoop, and with the note from

[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser

2007-02-13 Thread Renaud Richardet (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12472733 ] Renaud Richardet commented on NUTCH-443: hi All, Glad to see that this patch is moving forward :-) I have

[jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser

2007-02-13 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12472821 ] Doug Cutting commented on NUTCH-443: this patch in some places removes the log guards Most of the log guards

log guards

2007-02-13 Thread Doug Cutting
Doug Cutting (JIRA) wrote: this patch in some places removes the log guards Most of the log guards are misguided. Log guards should only be used on DEBUG level messages in performance-critical inner loops. Since INFO is the expected log level, a guard on INFO WARN level messages does not

Re: log guards

2007-02-13 Thread Jérôme Charron
These guards were all introduced by a patch some time ago. I complained at the time and it was promised that this would be repaired, but it has not yet been. Yes, Sorry Doug that's my own fault I really don't have time to fix this :-( Best regards Jérôme

[jira] Commented: (NUTCH-437) MapFile in Hadoop 0.10.2 has changed, must update references

2007-02-13 Thread [EMAIL PROTECTED] (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12472841 ] [EMAIL PROTECTED] commented on NUTCH-437: - +1. I reviewed and applied patch along with a hadoop-0.11.1-core

Re: log guards

2007-02-13 Thread Chris Mattmann
Hi Doug, and Jerome, Ah, yes, the log guard conversation. I remember this from a while back. Hmmm, do you guys know what issue that this recorded as in JIRA? I have some free time recently, so I will be able to add this to my list of Nutch stuff to work on, and would be happy to take the lead

Re: log guards

2007-02-13 Thread Jérôme Charron
Hi Chris, The JIRA issue is the 309 : https://issues.apache.org/jira/browse/NUTCH-309 Thanks for your help. Jérôme On 2/13/07, Chris Mattmann [EMAIL PROTECTED] wrote: Hi Doug, and Jerome, Ah, yes, the log guard conversation. I remember this from a while back. Hmmm, do you guys know what

RE: NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue

2007-02-13 Thread Gal Nitzan
Thanks Dennis, it seems it did the trick. Not sure totally, but so it seems :) Gal. -Original Message- From: Dennis Kubes [mailto:[EMAIL PROTECTED] Sent: Tuesday, February 13, 2007 11:09 PM To: nutch-dev@lucene.apache.org Subject: Re: NPE in

[jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility

2007-02-13 Thread Nick Lothian (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12472907 ] Nick Lothian commented on NUTCH-444: I'm a developer on the ROME project and I done some patches to FeedParser.

RE: NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue

2007-02-13 Thread Armel T. Nene
Dennis I was wondering if this patch could fix my problem which is, if not the same, very similar to this one. I am using Nutch 0.8.2-dev, I have made checkout awhile ago from SVN but never updated again. I was able to crawl 1 xml files before with no error whatsoever. This is the following

How to get score in search.jsp

2007-02-13 Thread ????? ???????
Hi Nutch Gurus! I have a small problem. I need to add some changes into search.jsp. I need to get first 50 results and to sort them in different way. I will change the score of each result with formula new_score = nutch_score + domain_score_from_my_db to sort. But i don't understand how to get

How to get score in search.jsp

2007-02-13 Thread Anton Potekhin
Hi Nutch Gurus! I have a small problem. I need to add some changes into search.jsp. I need to get first 50 results and to sort them in different way. I will change the score of each result with formula new_score = nutch_score + domain_score_from_my_db to sort. But i don't understand how to get