Re: java.io.EOFException in latest nightly in mergesegs from hadoop.io.DataOutputBuffer

2007-01-19 Thread Paul Sponagl
+1 for a bug (tested two days agon - was not sure if i simply missed something) 2007-01-17 12:03:07,691 WARN util.NativeCodeLoader - Unable to load native-hadoop library for your platform... using builtin-java classes where applicable 2007-01-17 12:03:07,722 WARN mapred.LocalJobRunner

[jira] Commented: (NUTCH-48) Did you mean query enhancement/refignment feature request

2007-01-19 Thread fantoni benjamin (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-48?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465990 ] fantoni benjamin commented on NUTCH-48: --- Somebody can show me how I can integrate the DID YOU MEAN plugin into

Re: java.io.EOFException in latest nightly in mergesegs from hadoop.io.DataOutputBuffer

2007-01-19 Thread Andrzej Bialecki
Paul Sponagl wrote: +1 for a bug (tested two days agon - was not sure if i simply missed something) Could you guys come up with exact data that causes this bug (primarily I'm interested in a seed list, because then I can see that you simply use the crawl tool, and finally try to run

java.lang.IllegalStateException

2007-01-19 Thread Armel T. Nene
Hi guys, I am using Nutch 0.8.1, for the past 2 days I have been getting the following exception: Java.Lang.IllegalStateException. The exception started after I implementing the Nutch-61 patch; Adaptive Re-crawl Interval. In short, this happens: I am trying to crawl XML files (locally and

Re: java.io.EOFException in latest nightly in mergesegs from hadoop.io.DataOutputBuffer

2007-01-19 Thread Paul Sponagl
seed: http://www.koeln.de crawl: bin/nutch crawl urls -dir crawl -depth 3 -topN 10 Am 19.01.2007 um 10:29 schrieb Andrzej Bialecki: Paul Sponagl wrote: +1 for a bug (tested two days agon - was not sure if i simply missed something) Could you guys come up with exact data that causes this

Re: java.io.EOFException in latest nightly in mergesegs from hadoop.io.DataOutputBuffer

2007-01-19 Thread Brian Whitman
On Jan 19, 2007, at 4:29 AM, Andrzej Bialecki wrote: Could you guys come up with exact data that causes this bug (primarily I'm interested in a seed list, because then I can see that you simply use the crawl tool, and finally try to run mergesegs). Thanks! My seed list is simply my

Re: Next Nutch release

2007-01-19 Thread Doug Cutting
Stefan Groschupf wrote: I don't want to start a emotional discussion here, however talking about the problem in public might help. What, specifically, is the problem you perceive? Doug

Re: Next Nutch release

2007-01-19 Thread Dennis Kubes
Just to put in my view. Stefan Groschupf wrote: Hi Andrzej, thank you for taking the time to comment, I highly value your comments. * I guess that for each case where Nutch seems inappropriate I could give you a counter-example of Nutch being used commercially with much success. I guess it

Re: Next Nutch release

2007-01-19 Thread Doug Cutting
Dennis Kubes wrote: I will say that it is difficult for people to understand how to get more involved. I have been working with Nutch and Hadoop for almost a year now on a daily basis and only now am I understanding how to contribute through jira, etc. There needs to be more guidance in

Re: Next Nutch release

2007-01-19 Thread Andrzej Bialecki
Dennis Kubes wrote: I completely agree with this. I am interested in devoting as much time as possible to seeing the success of Nutch, Hadoop, and Lucene. As our business grows I would also be willing to devote developers full time to work on Nutch, Hadoop, and Lucene. I think that at