No, I meant if the crawling url lists have http://www.abc.com and http://www.bcc.com, and both urls contains the term "hello". bbc.com also has a link that references ccc.com which also contains the term "hello" but it is not part of the crawling url lists.
So when I do a search on "hello", will Nutch return abc.com, bcc.com and ccc.com in default? If so, how to force Nutch to return both abc.com and bcc.com without ccc.com? Thanks. Zaheed Haque wrote: > > Hi > > You mean show results from a site http://abc.com only. If so you need > to turn on your index-more and query-more plugins in nutch-site.xml > then you need to use query like site:http://abc.com +query term or > url: .. I think its site not sure. > > Cheers > > On 8/31/06, victor_emailbox <[EMAIL PROTECTED]> wrote: >> >> Hi, >> I enter 10 urls in the url crawling list. Nutch does its thing to >> fetch >> and index them. How to I force Nutch to return search results that >> belongs >> to the url list? e.g. if the url crawling list has only >> http://www.abc.com >> and http://www.bcc.com, then all search result should be under either >> abc.com or bbc.com, not ccc.com even if bbc.com contains links referring >> to >> ccc.com. >> >> Many thanks. >> -- >> View this message in context: >> http://www.nabble.com/How-to-Make-Nutch-Return-Search-Results-Belonged-to-the-Crawl-URL-List--tf2194391.html#a6072986 >> Sent from the Nutch - User forum at Nabble.com. >> >> > > -- View this message in context: http://www.nabble.com/How-to-Make-Nutch-Return-Search-Results-Belonged-to-the-Crawl-URL-List--tf2194391.html#a6073242 Sent from the Nutch - User forum at Nabble.com. ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
