Re: how to prune unmatched url??

2007-04-23 Thread franklinb4u
i guess java program can be compiled once and then it can be run anywhere... so once compiled in widows and then if that package can be used in Unix,then explain me the further steps.. so if its possible to compile the code in eclipse,then please tell me how to do.. i don have any idea abt eclips

Re: how to prune unmatched url??

2007-04-23 Thread Ratnesh,V2Solutions India
y don't u compile nutch in eclipse if you are working in windows enviornment, then u need not to download ant . if you can proceed with that then i can explain you rest. in linux i have worked only till deployment and not done any testing and running of nutch source code. Thanks "Ratnesh,V2Solut

Re: how to prune unmatched url??

2007-04-23 Thread franklinb4u
hi, I ve downloaded apache-ant-1.7.0 version... the idea is to compile the nutch source code.. and i ve placed in my nutch directory.. does this means the installation of ant is over...? or is there any steps to be followed... if so kindly tell me the steps which i have to follow to compile the j

[jira] Updated: (NUTCH-468) Scoring filter should distribute score to all outlinks at once

2007-04-23 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney updated NUTCH-468: Attachment: scoring-v2.patch That makes sense, patch with the suggested change. > Scoring filter sh

modifications to geoPosition plugin to get it working on nutch 0.9

2007-04-23 Thread Mike Schwartz
I have modified the geoPosition plugin (http://wiki.apache.org/nutch/GeoPosition) code to work with nutch 0.9. (The code was built originally using nutch 0.7.) I'd like to contribute my changes to the nutch project. I already communicated with the code's author (Matthias Jaekle), and he agre

[jira] Created: (NUTCH-469) changes to geoPosition plugin to make it work on nutch 0.9

2007-04-23 Thread Mike Schwartz (JIRA)
changes to geoPosition plugin to make it work on nutch 0.9 -- Key: NUTCH-469 URL: https://issues.apache.org/jira/browse/NUTCH-469 Project: Nutch Issue Type: Improvement Compon

Re: Perfomance problems and segmenting

2007-04-23 Thread Briggs
One more thing... Are you using a distributed index? If this is so, you do not want to do this; indexes should be local to the machine that is being searched. On 4/23/07, Dennis Kubes <[EMAIL PROTECTED]> wrote: Without more information this sounds like your tomcat search nutch-site.xml file is

[jira] Commented: (NUTCH-468) Scoring filter should distribute score to all outlinks at once

2007-04-23 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12491051 ] Nicolás Lichtmaier commented on NUTCH-468: -- This patch would be useful to me. Just one very minor thing: He

Re: ApacheCon in Amsterdam

2007-04-23 Thread Doug Cutting
Tom White wrote: I will be there too. Unfortunately I won't be able to attend after all. The new baby in the house won't let me! Doug

Re: Perfomance problems and segmenting

2007-04-23 Thread Dennis Kubes
Without more information this sounds like your tomcat search nutch-site.xml file is setup to use the DFS rather than the local file system. Remember that processing jobs occurs on the DFS but for searching, indexes are best moved to the local file system. Dennis Kubes JoostRuiter wrote: Hi

Re: Perfomance problems and segmenting

2007-04-23 Thread JoostRuiter
Dear Briggs, Currently we allocated 1gig for JVM and Resin/Tomcat. Greetings, Joost Briggs wrote: > > How much memory are you currently allocating to the search servers? > > > > On 4/23/07, JoostRuiter <[EMAIL PROTECTED]> wrote: >> >> Hi All, >> >> First off, I'm quite the noob when it c

Re: [Nutch-dev] Creating a new scoring filter

2007-04-23 Thread Briggs
Yes. I too need to alter the score based on attributes and such of the particular url passed. May I ask what you have done? On 4/22/07, Lorenzo <[EMAIL PROTECTED]> wrote: Perfect! Now I have it working, and it performs quite well for a focused serch engine like ours! Do you think it could be a

Perfomance problems and segmenting

2007-04-23 Thread JoostRuiter
Hi All, First off, I'm quite the noob when it comes to Nutch, so don't bash me if the following is an enormously stupid question. We're using Nutch on a P4 Duo Core system (800mhz fsb) with 4gig RAM and a 500gig SATA (3gig/sec) HD. We indexed 350 000 pages into 1 segment of 15gig. Performance

Re: Perfomance problems and segmenting

2007-04-23 Thread Briggs
How much memory are you currently allocating to the search servers? On 4/23/07, JoostRuiter <[EMAIL PROTECTED]> wrote: Hi All, First off, I'm quite the noob when it comes to Nutch, so don't bash me if the following is an enormously stupid question. We're using Nutch on a P4 Duo Core system