I'm pretty sure google gives priority to the words appearing in the title and URL.
I believe sect 4.2.5 says this here: http://citeseer.nj.nec.com/cache/papers/cs/13017/http:zSzzSzwww-db.stanf ord.eduzSzpubzSzpaperszSzgoogle.pdf/brin98anatomy.pdf from here: http://citeseer.nj.nec.com/brin98anatomy.html So you have to have Lucene store the title as a separate field. This is then what you'd have if like me you boost (the caret is "boost") the title by *5 and the URL by *2: +(title:george^5.0 url:george^2.0 contents:george) +(title:bush^5.0 url:bush^2.0 contents:bush) +(title:white^5.0 url:white^2.0 contents:white) +(title:house^5.0 url:house^2.0 contents:house) -----Original Message----- From: Ian Lea [mailto:[EMAIL PROTECTED]] Sent: Saturday, February 23, 2002 8:15 AM To: Lucene Users List Subject: Re: Googlifying lucene querys +george +bush +white +house -- Ian. Jari Aarniala wrote: > > Hello, > > Despite of the confusing subject ;) my question is simple. I'm just > trying out Lucene for the first time and would like to know how one > would go on implementing the search on the index with the same logic > that Google uses. > For example, if the user input is "george bush white house", how > do I easily construct a query that searches ALL of the words above? If I > have understood correctly, passing the search string above to the > queryParser creates a query that search for ANY of the words above. > > Thanks for any help, -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>