Hi Milan,

I'm not sure I understand exactly what you mean by external factor. I'll try to 
explain what I do, and maybe that will address your issue.

First, FunctionQuery. Everything I do revolves around the FunctionQuery. I 
strongly suggest reading the java docs. I found them to be helpful. 
http://lucene.apache.org/solr/api/org/apache/solr/search/function/FunctionQuery.html

How it works is you have some field in your documents that contains a number.  
At search time, that number is retrieved, and some function is performed using 
that number. You can either supply the function or use one of the pre-made 
functions available with FunctionQuery. Let's say, just for fun, that you 
square the value (that's most likely the wrong thing to do, but it's a very 
simple function, and it works for an example.)  So this value that is computed 
by the function, in our example the square of the value stored in a field in 
your document, is used in the scoring of the page. This is different than 
sorting, but it still allows for you to influence the scoring based on some 
criteria you set.

Now, you are talking about some external factor, and I'm not sure how you might 
want to go about computing that. Here's how my specific situation works. Maybe 
this will help you understand how to begin proper implementation for your 
situation. I have several different sections of a large website, let's call 
them A, B, and C.  Generally, results from C are more important than those from 
A and B, and the results in A and B are equally important. So I give all the 
documents in C some value, let's say 50, and A and B get a smaller value, let's 
say 20.  Those are values that I decided and are simply stored in my documents. 
I'm not calculating them from anywhere. I'm just setting them.  Now, results in 
C get a boost, and usually results in C appear above results in A and B.  But 
if something from A or B has a really good score, it's still possible for it to 
appear above results in C.  It's just less likely than if I wasn't using 
FunctionQuery to influence my scores.

Hopefully that makes at least a small amount of sense. Again, I think the java 
docs are valuable. I would suggest giving them a glance.  

Have a good day,
Ann

----- Original Message ----
From: Milan Krendzelak <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Sent: Friday, June 22, 2007 11:21:17 AM
Subject: RE: How to score a paticular page higher than the other pages

Hi Ann, I am really would be interested in custom boosting of the documents by 
some external factor.
for example, external factor is the score of the web page validation according 
to the W3C...
in this case, I would like to display search result not only according to the 
relevancy but also according the validation score.... And I don't want to sort, 
but make influence on the doc boost by external factor.
 
I will appreciate any help or suggestions about this.
Thanks.
 
Milan Krendzelak

Senior Software Developer
dotMobi (mTLD Top Level Domain, Ltd.)
11 Exchange Place, IFSC, Dublin 1, Ireland
Phone: + 353.1.854.1100 Fax: +353.1.791.8569

________________________________

From: Annona Keene [mailto:[EMAIL PROTECTED]
Sent: Fri 22/06/2007 17:06
To: [EMAIL PROTECTED]
Subject: Re: How to score a paticular page higher than the other pages



Hi Harmesh,

I did something similar to this, and I can offer a few suggestions. I'm not 
sure any of these is the *right* answer, but I've found it to be effective for 
my purposes.

Have a field that you boost significantly for the page(s) you'd like higher in 
the results. We have something like a keywords field where we put terms that 
should bring up a page very high in the results. It's worked quite well.

Another thing we've done is use the FunctionQuery from solr. (Though I believe 
this might be part of Lucene 2.2.0 proper now. I'm not certain.)  The API is 
here: 
http://lucene.apache.org/solr/api/org/apache/solr/search/function/FunctionQuery.html
This allows you to assign some numerical value in a field for your site, then 
the score is influenced by the value. So in our case, we set high values for 
the "import" pieces of the site and smaller values for less "important" pieces.

If any of this was unclear, or I didn't actually answer your question, please 
let me know. I've done a lot of "hacking" to influence the result rankings.

Have a great day,
Ann

----- Original Message ----
From: "Harmesh, V2solutions" <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Sent: Thursday, June 21, 2007 5:06:00 AM
Subject: How to score a paticular page higher than the other pages


Hi ,
  Can any one help me out that how to score a paticular page higher than the
other,
For eg.

         In my case i crawl through foums, where threads are more important
than the other pages.
I want the links like (http://forum.ottawagolf.com/showthread.php) on the
top of my result instead of
i am getting links like this
(http://forums.roadbikereview.com/forumdisplay.php), which i would perfer at
lower
priority..
Thanks in advance.

--
View this message in context: 
http://www.nabble.com/How-to-score-a-paticular-page-higher-than-the-other-pages-tf3957718.html#a11230107
Sent from the Nutch - User mailing list archive at Nabble.com.








      
___________________________________________________________________________________
You snooze, you lose. Get messages ASAP with AutoCheck
in the all-new Yahoo! Mail Beta.
http://advision.webevents.yahoo.com/mailbeta/newmail_html.html 








       
____________________________________________________________________________________
Boardwalk for $500? In 2007? Ha! Play Monopoly Here and Now (it's updated for 
today's economy) at Yahoo! Games.
http://get.games.yahoo.com/proddesc?gamekey=monopolyherenow  
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to