Re: Perfomance problems and segmenting

2007-04-23 Thread Briggs
How much memory are you currently allocating to the search servers? On 4/23/07, JoostRuiter [EMAIL PROTECTED] wrote: Hi All, First off, I'm quite the noob when it comes to Nutch, so don't bash me if the following is an enormously stupid question. We're using Nutch on a P4 Duo Core system

Perfomance problems and segmenting

2007-04-23 Thread JoostRuiter
Hi All, First off, I'm quite the noob when it comes to Nutch, so don't bash me if the following is an enormously stupid question. We're using Nutch on a P4 Duo Core system (800mhz fsb) with 4gig RAM and a 500gig SATA (3gig/sec) HD. We indexed 350 000 pages into 1 segment of 15gig. Performance

Re: [Nutch-dev] Creating a new scoring filter

2007-04-23 Thread Briggs
Yes. I too need to alter the score based on attributes and such of the particular url passed. May I ask what you have done? On 4/22/07, Lorenzo [EMAIL PROTECTED] wrote: Perfect! Now I have it working, and it performs quite well for a focused serch engine like ours! Do you think it could be an

Re: Perfomance problems and segmenting

2007-04-23 Thread JoostRuiter
Dear Briggs, Currently we allocated 1gig for JVM and Resin/Tomcat. Greetings, Joost Briggs wrote: How much memory are you currently allocating to the search servers? On 4/23/07, JoostRuiter [EMAIL PROTECTED] wrote: Hi All, First off, I'm quite the noob when it comes to Nutch,

Re: Perfomance problems and segmenting

2007-04-23 Thread Dennis Kubes
Without more information this sounds like your tomcat search nutch-site.xml file is setup to use the DFS rather than the local file system. Remember that processing jobs occurs on the DFS but for searching, indexes are best moved to the local file system. Dennis Kubes JoostRuiter wrote: Hi

Re: ApacheCon in Amsterdam

2007-04-23 Thread Doug Cutting
Tom White wrote: I will be there too. Unfortunately I won't be able to attend after all. The new baby in the house won't let me! Doug

Re: Perfomance problems and segmenting

2007-04-23 Thread Briggs
One more thing... Are you using a distributed index? If this is so, you do not want to do this; indexes should be local to the machine that is being searched. On 4/23/07, Dennis Kubes [EMAIL PROTECTED] wrote: Without more information this sounds like your tomcat search nutch-site.xml file is

modifications to geoPosition plugin to get it working on nutch 0.9

2007-04-23 Thread Mike Schwartz
I have modified the geoPosition plugin (http://wiki.apache.org/nutch/GeoPosition) code to work with nutch 0.9. (The code was built originally using nutch 0.7.) I'd like to contribute my changes to the nutch project. I already communicated with the code's author (Matthias Jaekle), and he

[jira] Updated: (NUTCH-468) Scoring filter should distribute score to all outlinks at once

2007-04-23 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney updated NUTCH-468: Attachment: scoring-v2.patch That makes sense, patch with the suggested change. Scoring filter