rmine top ranking
hosts on a large scale.
-Original message-
> From:Eyeris RodrIguez Rueda
> Sent: Wednesday 20th May 2015 23:28
> To: user@nutch.apache.org
> Subject: Re: [MASSMAIL]Re: about boost field extremely high
>
> Thanks to all by your quick reply.
>
-- Mensaje original -
De: "Markus Jelsma"
Para: user@nutch.apache.org
Enviados: Miércoles, 20 de Mayo 2015 16:53:26
Asunto: RE: [MASSMAIL]Re: about boost field extremely high
Yes indeed. But it also makes sense to rely on Lucene's scoring algorithms and
custom boosting functions.
o use Nutch' LinkRank,
it is batch oriented but much more powerful.
-Original message-
> From:Julien Nioche
> Sent: Wednesday 20th May 2015 22:10
> To: user@nutch.apache.org
> Subject: Re: [MASSMAIL]Re: about boost field extremely high
>
> See https://issues.apach
: user@nutch.apache.org
> Enviados: Miércoles, 20 de Mayo 2015 15:06:38
> Asunto: [MASSMAIL]Re: about boost field extremely high
>
> Hi Eyeris
>
> The boost value is simply the output of what the ScoringFilters give for a
> document. Are you using OPIC?
>
> Julien
>
>
aware of possible intermittent problems with the
underlying commons-httpclient library.
- Mensaje original -
De: "Julien Nioche"
Para: user@nutch.apache.org
Enviados: Miércoles, 20 de Mayo 2015 15:06:38
Asunto: [MASSMAIL]Re: about boost field extremely high
Hi Eyeris
Hi Eyeris
The boost value is simply the output of what the ScoringFilters give for a
document. Are you using OPIC?
Julien
On 20 May 2015 at 19:32, Eyeris RodrIguez Rueda wrote:
> Hi all.
> Im using nutch 1.9 in local mode and solr 4.10 with half million of
> documents.
> An adaptive fetch sche
Hi all.
Im using nutch 1.9 in local mode and solr 4.10 with half million of documents.
An adaptive fetch schedule is being used for crawl pages that changes
frequently.
I have detected that nutch is calculting a extremely high boost for some
documents and the document score in Solr is extremely
7 matches
Mail list logo