Added 0 pages

2006-07-14 Thread Julius Schorzman
I'm having trouble figuring out why I keep getting Added 0 pages when running the crawl with nutch. I've searched the site and can't find an answer to as what might be going wrong. I'm running this on windows using eclipse because I may have to change the code slightly. I've already made a few

Re: 0.8.0 stable enough to use?

2006-07-14 Thread Lukas Vlcek
Hi, That thread was in dev list and you can see it here: http://www.mail-archive.com/nutch-developers@lists.sourceforge.net/msg08765.html I can see several people had some objections (including me) on the other hand, releasing 0.8 now could speed up patch frequency of some of these issues :-)

Unused Segments

2006-07-14 Thread Lourival Júnior
How can i discover which segments are unused by the index? After many recrawl I have a lot of segments. So, I would like to erase someones... Who can help me? -- Lourival Junior Universidade Federal do Pará Curso de Bacharelado em Sistemas de Informação http://www.ufpa.br/cbsi Msn: [EMAIL

Nullpointer exception dependent on search terms

2006-07-14 Thread Chris Stephens
Certain search terms cause this exception: 60714 133854 25443840 SEVERE Servlet.service() for servlet OpenSearch threw exception java.lang.RuntimeException: java.lang.NullPointerException at org.apache.nutch.searcher.FetchedSegments.getSummary(FetchedSegments.java:190) at

debian 3.1

2006-07-14 Thread Schackenberg, Benedikt
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 hello, list, i would like to install nutch on debian 3.1 can anyone give me a wiki or howto? - -- - - -- SP data GmbH T 06131 218111 F 06131 218112 E [EMAIL PROTECTED] W www.termindoc.de PGP-Key-ID: 0x0D2E4AE4 Unser Impressum finden Sie unter

Re: Nullpointer exception dependent on search terms

2006-07-14 Thread Chris Stephens
FYI I fixed the issue. I didn't realize Fedora uses GCJ by default. I switched to Sun JDK 1.5 and everything works fine now. Chris Stephens wrote: Certain search terms cause this exception: 60714 133854 25443840 SEVERE Servlet.service() for servlet OpenSearch threw exception

Nutch on Windows

2006-07-14 Thread Kerry Wilson
Trying to use nutch on windows and the executables are shell scripts, how do you use nutch on windows? -- Kerry Wilson Lead Developer Williams Web [EMAIL PROTECTED] | 423.485.4747

Intranet Recrawl Script for 0.8.0

2006-07-14 Thread Matthew Holt
Does anyone have a good Intranet recrawl script for nutch-0.8.0? Thanks.. Matt

RE: Nutch on Windows

2006-07-14 Thread Dan Morrill
Kerry, I am completely on windows with nutch, use cygwin. If you have other questions give me a shout. r/d -Original Message- From: Kerry Wilson [mailto:[EMAIL PROTECTED] Sent: Friday, July 14, 2006 11:50 AM To: nutch-user@lucene.apache.org Subject: Nutch on Windows Trying to use nutch

RE: Nutch on Windows

2006-07-14 Thread Bipin Parmar
Kerry, cygwin is definitely a good option, however if you do not want to use cygwin, I can send the nutch.bat file. Could you please let me know the version of nutch you are using. There is a small difference in running nutch 0.7.2, 0.8-dev (june version which uses nutch.log.dir jvm param) and

Re: Nutch on Windows

2006-07-14 Thread Håvard W. Kongsgård
Kerry Wilson wrote: Trying to use nutch on windows and the executables are shell scripts, how do you use nutch on windows? http://wiki.apache.org/nutch/GettingNutchRunningWithWindows

Re: page ranking computation in Nutch 08

2006-07-14 Thread Feng Ji
I have difficult to find which Java class I could find these functions. thanks, Feng Ji On 6/25/06, Andrzej Bialecki [EMAIL PROTECTED] wrote: TDLN wrote: In 0.8-dev score is calculated in a ScoringFilter implementaion, default is score-opic plugin

Parser returning several ParseData?

2006-07-14 Thread HUYLEBROECK Jeremy RD-ILAB-SSF
I am in need of feedback/ideas. ;) What would be the cleanest way to not return only one ParseData (or Parse) object from a getParse but return several and still use the rest of the framework? Anybody did this? I look at the different classes and where it could be done but I always find me