Robin Haswell wrote:
> Hey there
>
> I'm having issues searching with my newly (vastly) expanded database.
> Could anyone shed any light on this? Basically, on a newly started
> server, I search for "test", and this appears in catalina.out:
>
> 2006-12-20 10:51:40,710 INFO  NutchBean - creating new bean
> 2006-12-20 10:51:40,725 INFO  NutchBean - opening merged index in
> crawl/index
> 2006-12-20 10:51:40,871 INFO  Configuration - found resource
> common-terms.utf8 at
> file:/nutch/apache-tomcat-5.5/webapps/ROOT/WEB-INF/classes/common-terms.utf8
> 2006-12-20 10:51:40,880 INFO  NutchBean - opening segments in
> crawl/segments
> 2006-12-20 10:51:40,898 INFO  SummarizerFactory - Using the first
> summarizer extension found: Basic Summarizer
> 2006-12-20 10:51:40,901 INFO  NutchBean - opening linkdb in crawl/linkdb
> 2006-12-20 10:51:40,907 INFO  NutchBean - query request from
> 195.166.60.2
> 2006-12-20 10:51:40,925 INFO  NutchBean - query: test
> 2006-12-20 10:51:40,925 INFO  NutchBean - lang: en
> 2006-12-20 10:51:40,974 INFO  NutchBean - searching for 20 raw hits
> 2006-12-20 10:52:13,306 ERROR [jsp] - Servlet.service() for servlet jsp
> threw exception
> java.lang.OutOfMemoryError: Java heap space
>   

This is the problem - you need to increase the heap space in your 
Tomcat. Since you expanded you index, the bigger index won't fit in the 
same heap space as before ... especially when you run searches that 
touch more of the index, parts of it need to be loaded into memory - so 
this problem may not occur for searches that return only few results.

-- 
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to