RE: Halloween Joke at Google

2005-11-02 Thread Byron Miller
I wish it did have something to do with halloween :)

Google tells no lies! :P

--- Nick Lothian [EMAIL PROTECTED] wrote:

 If you just do the search you'll see a link at the
 side of the page:
 
 Why these results?
 These results may seem politically
 slanted. Here's what happened.
 www.google.com/googleblog
 
 which links to

http://googleblog.blogspot.com/2005/09/googlebombing-failure.html
 
 This particular Google Bomb has been around for
 quite a while. See
 http://en.wikipedia.org/wiki/Google_bomb (and has
 nothing to do with
 Halloween!)
 
 Nick 



RE: Halloween Joke at Google

2005-11-02 Thread Byron Miller
Actually, to add fuel to the fire, using nutch out of
the box, searching for miserable failure yields the
same thing.

http://www.mozdex.com/search.jsp?query=miserablefailure

--- Fuad Efendi [EMAIL PROTECTED] wrote:

 Thanks Nick,
 
 So this is why some search engines are not honest. I
 mean the commercial
 policy of putting links on top of a search for extra
 money.
 
 This particular Google Bomb has been around for
 quite a while. See
 http://en.wikipedia.org/wiki/Google_bomb (and has
 nothing to do with
 Halloween!)
 
 Nick 
 
 



Re: Halloween Joke at Google

2005-11-02 Thread Andrzej Bialecki

Byron Miller wrote:


Actually, to add fuel to the fire, using nutch out of
the box, searching for miserable failure yields the
same thing.

http://www.mozdex.com/search.jsp?query=miserablefailure

 



I'm curious... could you check if the anchors come from the same site, 
or from different sites? Do you run with 
fetchlist.score.by.link.count=true and indexer.boost.by.link.count=true?


Anyway, that's how the PageRank is _supposed_ to work - it should give a 
higher score to sites that are highly linked, and also it should 
strongly consider the anchor text as an indication of the page's true 
subject ... ;-)


--
Best regards,
Andrzej Bialecki 
___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com




Re: Halloween Joke at Google

2005-11-02 Thread Byron Miller
We run with

fetchlist.score.by.link.count=true and
indexer.boost.by.link.count=true

We haven't run a stand alone analyze, so it's how the
database is updated when we run updatedb. (per the
recommendations a few months back when it was found to
be pretty darn close results!)

Even though my scale is still much smaller than
Googles, it is amazing how closely the results can
match!

Makes you wonder just how much of the net is usefull
;)

-byron



--- Andrzej Bialecki [EMAIL PROTECTED] wrote:

 Byron Miller wrote:
 
 Actually, to add fuel to the fire, using nutch out
 of
 the box, searching for miserable failure yields the
 same thing.
 

http://www.mozdex.com/search.jsp?query=miserablefailure
 
   
 
 
 I'm curious... could you check if the anchors come
 from the same site, 
 or from different sites? Do you run with 
 fetchlist.score.by.link.count=true and
 indexer.boost.by.link.count=true?
 
 Anyway, that's how the PageRank is _supposed_ to
 work - it should give a 
 higher score to sites that are highly linked, and
 also it should 
 strongly consider the anchor text as an indication
 of the page's true 
 subject ... ;-)
 
 -- 
 Best regards,
 Andrzej Bialecki 
  ___. ___ ___ ___ _ _  
 __
 [__ || __|__/|__||\/|  Information Retrieval,
 Semantic Web
 ___|||__||  \|  ||  |  Embedded Unix, System
 Integration
 http://www.sigram.com  Contact: info at sigram dot
 com
 
 
 



Re: Halloween Joke at Google

2005-11-02 Thread Michael Ji
hi Byron:

Did you run LinkAnalysisTool to update score in the
fetched segment? I guess that is the most accurate
PageRank score, otherwise, in IndexSegment.java Nutch
do score calculation based on the number of anchor
links for source page.

Michael Ji,

--- Byron Miller [EMAIL PROTECTED] wrote:

 We run with
 
 fetchlist.score.by.link.count=true and
 indexer.boost.by.link.count=true
 
 We haven't run a stand alone analyze, so it's how
 the
 database is updated when we run updatedb. (per the
 recommendations a few months back when it was found
 to
 be pretty darn close results!)
 
 Even though my scale is still much smaller than
 Googles, it is amazing how closely the results can
 match!
 
 Makes you wonder just how much of the net is
 usefull
 ;)
 
 -byron
 
 
 
 --- Andrzej Bialecki [EMAIL PROTECTED] wrote:
 
  Byron Miller wrote:
  
  Actually, to add fuel to the fire, using nutch
 out
  of
  the box, searching for miserable failure yields
 the
  same thing.
  
 

http://www.mozdex.com/search.jsp?query=miserablefailure
  

  
  
  I'm curious... could you check if the anchors come
  from the same site, 
  or from different sites? Do you run with 
  fetchlist.score.by.link.count=true and
  indexer.boost.by.link.count=true?
  
  Anyway, that's how the PageRank is _supposed_ to
  work - it should give a 
  higher score to sites that are highly linked, and
  also it should 
  strongly consider the anchor text as an indication
  of the page's true 
  subject ... ;-)
  
  -- 
  Best regards,
  Andrzej Bialecki 
   ___. ___ ___ ___ _ _  
  __
  [__ || __|__/|__||\/|  Information Retrieval,
  Semantic Web
  ___|||__||  \|  ||  |  Embedded Unix, System
  Integration
  http://www.sigram.com  Contact: info at sigram dot
  com
  
  
  
 
 




__ 
Start your day with Yahoo! - Make it your home page! 
http://www.yahoo.com/r/hs