RE: [MASSMAIL]Re: about boost field extremely high

2015-05-20 Thread Markus Jelsma
rmine top ranking hosts on a large scale. -Original message- > From:Eyeris RodrIguez Rueda > Sent: Wednesday 20th May 2015 23:28 > To: user@nutch.apache.org > Subject: Re: [MASSMAIL]Re: about boost field extremely high > > Thanks to all by your quick reply. >

Re: [MASSMAIL]Re: about boost field extremely high

2015-05-20 Thread Eyeris RodrIguez Rueda
-- Mensaje original - De: "Markus Jelsma" Para: user@nutch.apache.org Enviados: Miércoles, 20 de Mayo 2015 16:53:26 Asunto: RE: [MASSMAIL]Re: about boost field extremely high Yes indeed. But it also makes sense to rely on Lucene's scoring algorithms and custom boosting functions.

RE: [MASSMAIL]Re: about boost field extremely high

2015-05-20 Thread Markus Jelsma
o use Nutch' LinkRank, it is batch oriented but much more powerful. -Original message- > From:Julien Nioche > Sent: Wednesday 20th May 2015 22:10 > To: user@nutch.apache.org > Subject: Re: [MASSMAIL]Re: about boost field extremely high > > See https://issues.apach

Re: [MASSMAIL]Re: about boost field extremely high

2015-05-20 Thread Julien Nioche
: user@nutch.apache.org > Enviados: Miércoles, 20 de Mayo 2015 15:06:38 > Asunto: [MASSMAIL]Re: about boost field extremely high > > Hi Eyeris > > The boost value is simply the output of what the ScoringFilters give for a > document. Are you using OPIC? > > Julien > >

Re: [MASSMAIL]Re: about boost field extremely high

2015-05-20 Thread Eyeris RodrIguez Rueda
aware of possible intermittent problems with the underlying commons-httpclient library. - Mensaje original - De: "Julien Nioche" Para: user@nutch.apache.org Enviados: Miércoles, 20 de Mayo 2015 15:06:38 Asunto: [MASSMAIL]Re: about boost field extremely high Hi Eyeris

Re: about boost field extremely high

2015-05-20 Thread Julien Nioche
Hi Eyeris The boost value is simply the output of what the ScoringFilters give for a document. Are you using OPIC? Julien On 20 May 2015 at 19:32, Eyeris RodrIguez Rueda wrote: > Hi all. > Im using nutch 1.9 in local mode and solr 4.10 with half million of > documents. > An adaptive fetch sche

about boost field extremely high

2015-05-20 Thread Eyeris RodrIguez Rueda
Hi all. Im using nutch 1.9 in local mode and solr 4.10 with half million of documents. An adaptive fetch schedule is being used for crawl pages that changes frequently. I have detected that nutch is calculting a extremely high boost for some documents and the document score in Solr is extremely