[jira] Commented: (NUTCH-419) unavailable robots.txt kills fetch

2009-03-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678574#action_12678574 ] Hudson commented on NUTCH-419: -- Integrated in Nutch-trunk #742 (See [http://hudson.zones.apach

[jira] Commented: (NUTCH-700) Neko1.9.11 goes into a loop

2009-03-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678575#action_12678575 ] Hudson commented on NUTCH-700: -- Integrated in Nutch-trunk #742 (See [http://hudson.zones.apach

[jira] Commented: (NUTCH-669) Consolidate code for Fetcher and Fetcher2

2009-03-03 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678573#action_12678573 ] Hudson commented on NUTCH-669: -- Integrated in Nutch-trunk #742 (See [http://hudson.zones.apach

Hudson build is back to normal: Nutch-trunk #742

2009-03-03 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Nutch-trunk/742/

Re: site: operator with no query term

2009-03-03 Thread John Martyniak
Frank, I don't know what the timing on completing something like this is, but this would be a nice feature to have in 1.0, if that is even possible at this time. -John On Mar 3, 2009, at 5:19 PM, Otis Gospodnetic wrote: Absolutely! I see you are at home with JIRA, so I don't have to

Re: Is there the functions of "More Like This" and "Spell Checking"?

2009-03-03 Thread dealmaker
How do I use morelikethis with Nutch? Nutch no longer have index document number, it's using key. Otis Gospodnetic-2 wrote: > > > If you are not using Solr then you can't really use Solr's MoreLikeThis, > no. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > >

Re: site: operator with no query term

2009-03-03 Thread Otis Gospodnetic
Absolutely! I see you are at home with JIRA, so I don't have to ask. :) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Frank McCown > To: nutch-dev@lucene.apache.org > Sent: Tuesday, March 3, 2009 9:39:24 AM > Subject: site: operator wi

Re: Is there the functions of "More Like This" and "Spell Checking"?

2009-03-03 Thread Otis Gospodnetic
If you are not using Solr then you can't really use Solr's MoreLikeThis, no. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: dealmaker > To: nutch-dev@lucene.apache.org > Sent: Tuesday, March 3, 2009 1:33:01 PM > Subject: Re: Is there the

Re: Is there the functions of "More Like This" and "Spell Checking"?

2009-03-03 Thread dealmaker
I am not using solr. I am using nutch to search for related urls to a url that user type. Can I still use solr's morelikethis in this case? Otis Gospodnetic-2 wrote: > > > If you use the Nutch->Solr functionality, you can rely on Solr's > MoreLikeThis and Solr's SpellCheckComponent (both are

[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack

2009-03-03 Thread Tim Hawkins (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678335#action_12678335 ] Tim Hawkins commented on NUTCH-709: --- Ill install this on our rig and test it and get back

[jira] Updated: (NUTCH-650) Hbase Integration

2009-03-03 Thread Andrew McCall (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew McCall updated NUTCH-650: Attachment: meta.patch I've updated the way the TMP_X_MARK metadata is handled to allow multiple fet

[jira] Created: (NUTCH-710) Support for rel="canonical" attribute

2009-03-03 Thread Frank McCown (JIRA)
Support for rel="canonical" attribute - Key: NUTCH-710 URL: https://issues.apache.org/jira/browse/NUTCH-710 Project: Nutch Issue Type: New Feature Affects Versions: 1.1 Reporter: Frank McCo

[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack

2009-03-03 Thread Tim Hawkins (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678330#action_12678330 ] Tim Hawkins commented on NUTCH-709: --- I was going to do exactly the same thing for now :-)

site: operator with no query term

2009-03-03 Thread Frank McCown
Google, Yahoo, and Live list all pages they have indexed for the "site:www.example.com" query. But Nutch returns back 0 results unless a query term is also supplied (e.g., "site:www.example.com term"). Would it be better for Nutch to respond in the same manner that other search engines do? This i

[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack

2009-03-03 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678325#action_12678325 ] Julien Nioche commented on NUTCH-709: - the patch above does not fix the issue but preven

[jira] Updated: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack

2009-03-03 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-709: Attachment: JSParseFilter.error.patch This patch catches errors in the walk method of JSParser and r

[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack

2009-03-03 Thread Martina Koch (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678319#action_12678319 ] Martina Koch commented on NUTCH-709: I've had the same issue (see mail http://www.mail-

[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack

2009-03-03 Thread Tim Hawkins (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678316#action_12678316 ] Tim Hawkins commented on NUTCH-709: --- afraid not, its happened several hours into a crawl,

[jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack

2009-03-03 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678314#action_12678314 ] Julien Nioche commented on NUTCH-709: - do you know the URL of the document causing this

[jira] Created: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack

2009-03-03 Thread Tim Hawkins (JIRA)
JSParseFilter gets into an infinate loop and ets all the stack --- Key: NUTCH-709 URL: https://issues.apache.org/jira/browse/NUTCH-709 Project: Nutch Issue Type: Bug Comp

Re: Is there the functions of "More Like This" and "Spell Checking"?

2009-03-03 Thread Otis Gospodnetic
If you use the Nutch->Solr functionality, you can rely on Solr's MoreLikeThis and Solr's SpellCheckComponent (both are described on Solr's wiki) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: dealmaker > To: nutch-dev@lucene.apache.org

[jira] Updated: (NUTCH-650) Hbase Integration

2009-03-03 Thread Andrew McCall (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew McCall updated NUTCH-650: Attachment: slash.patch Fixes an issue the above patch creates, if there is a zero length filename

Re: Is there the functions of "More Like This" and "Spell Checking"?

2009-03-03 Thread dealmaker
morelikethis is from the lucene library, and it was compatible with nutch 0.9. But it seems that it is no longer compatible with nutch 1.0/nightly build. Can anyone get morelikethis working with nutch nightly build? buddha1021 wrote: > > hi: > Is there the functions of "More Like This" an