How to disable document boosting?

2010-11-05 Thread Matthias Paul
Hi, how can I disable document boosting? I'm indexing to Solr and don't need document boosts (as they boost html-pages much more than other content which I have in Solr). I found that I have the scoring-opic plugin registered, is this the problem? Thanks Matthias

Re: Updates of websites

2010-11-05 Thread Chris
Yes .. that looks good - there is a white list for enterprise searches. Sounds exactly as one part I need. How about the other? Is there a way of doing a diff between two versions? Do you know that? Am 05.11.2010 13:49, schrieb Eric Martin: I know urlfilter will allow you to specify domain

Re: Crawling some specific url avoiding other urls

2010-11-05 Thread Edward Drapkin
On 11/5/2010 10:37 PM, nitin hardeniya wrote: dear All, I am using nutch for crawling all the user reviews on a page of IMDB .the url will be http://www.imdb.com/title/tt1375666/usercomments http://www.imdb.com/title/tt1375666/usercomments?start=50 I want to crawl all these with only user