Hi Milan,
I'm not sure I understand exactly what you mean by external factor. I'll try to
explain what I do, and maybe that will address your issue.
First, FunctionQuery. Everything I do revolves around the FunctionQuery. I
strongly suggest reading the java docs. I found them to be helpful.
http://lucene.apache.org/solr/api/org/apache/solr/search/function/FunctionQuery.html
How it works is you have some field in your documents that contains a number.
At search time, that number is retrieved, and some function is performed using
that number. You can either supply the function or use one of the pre-made
functions available with FunctionQuery. Let's say, just for fun, that you
square the value (that's most likely the wrong thing to do, but it's a very
simple function, and it works for an example.) So this value that is computed
by the function, in our example the square of the value stored in a field in
your document, is used in the scoring of the page. This is different than
sorting, but it still allows for you to influence the scoring based on some
criteria you set.
Now, you are talking about some external factor, and I'm not sure how you might
want to go about computing that. Here's how my specific situation works. Maybe
this will help you understand how to begin proper implementation for your
situation. I have several different sections of a large website, let's call
them A, B, and C. Generally, results from C are more important than those from
A and B, and the results in A and B are equally important. So I give all the
documents in C some value, let's say 50, and A and B get a smaller value, let's
say 20. Those are values that I decided and are simply stored in my documents.
I'm not calculating them from anywhere. I'm just setting them. Now, results in
C get a boost, and usually results in C appear above results in A and B. But
if something from A or B has a really good score, it's still possible for it to
appear above results in C. It's just less likely than if I wasn't using
FunctionQuery to influence my scores.
Hopefully that makes at least a small amount of sense. Again, I think the java
docs are valuable. I would suggest giving them a glance.
Have a good day,
Ann
----- Original Message ----
From: Milan Krendzelak <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Sent: Friday, June 22, 2007 11:21:17 AM
Subject: RE: How to score a paticular page higher than the other pages
Hi Ann, I am really would be interested in custom boosting of the documents by
some external factor.
for example, external factor is the score of the web page validation according
to the W3C...
in this case, I would like to display search result not only according to the
relevancy but also according the validation score.... And I don't want to sort,
but make influence on the doc boost by external factor.
I will appreciate any help or suggestions about this.
Thanks.
Milan Krendzelak
Senior Software Developer
dotMobi (mTLD Top Level Domain, Ltd.)
11 Exchange Place, IFSC, Dublin 1, Ireland
Phone: + 353.1.854.1100 Fax: +353.1.791.8569
________________________________
From: Annona Keene [mailto:[EMAIL PROTECTED]
Sent: Fri 22/06/2007 17:06
To: [EMAIL PROTECTED]
Subject: Re: How to score a paticular page higher than the other pages
Hi Harmesh,
I did something similar to this, and I can offer a few suggestions. I'm not
sure any of these is the *right* answer, but I've found it to be effective for
my purposes.
Have a field that you boost significantly for the page(s) you'd like higher in
the results. We have something like a keywords field where we put terms that
should bring up a page very high in the results. It's worked quite well.
Another thing we've done is use the FunctionQuery from solr. (Though I believe
this might be part of Lucene 2.2.0 proper now. I'm not certain.) The API is
here:
http://lucene.apache.org/solr/api/org/apache/solr/search/function/FunctionQuery.html
This allows you to assign some numerical value in a field for your site, then
the score is influenced by the value. So in our case, we set high values for
the "import" pieces of the site and smaller values for less "important" pieces.
If any of this was unclear, or I didn't actually answer your question, please
let me know. I've done a lot of "hacking" to influence the result rankings.
Have a great day,
Ann
----- Original Message ----
From: "Harmesh, V2solutions" <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Sent: Thursday, June 21, 2007 5:06:00 AM
Subject: How to score a paticular page higher than the other pages
Hi ,
Can any one help me out that how to score a paticular page higher than the
other,
For eg.
In my case i crawl through foums, where threads are more important
than the other pages.
I want the links like (http://forum.ottawagolf.com/showthread.php) on the
top of my result instead of
i am getting links like this
(http://forums.roadbikereview.com/forumdisplay.php), which i would perfer at
lower
priority..
Thanks in advance.
--
View this message in context:
http://www.nabble.com/How-to-score-a-paticular-page-higher-than-the-other-pages-tf3957718.html#a11230107
Sent from the Nutch - User mailing list archive at Nabble.com.
___________________________________________________________________________________
You snooze, you lose. Get messages ASAP with AutoCheck
in the all-new Yahoo! Mail Beta.
http://advision.webevents.yahoo.com/mailbeta/newmail_html.html
____________________________________________________________________________________
Boardwalk for $500? In 2007? Ha! Play Monopoly Here and Now (it's updated for
today's economy) at Yahoo! Games.
http://get.games.yahoo.com/proddesc?gamekey=monopolyherenow -------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general