Hi,

I'm indexing 500 XML files each ~150Mb on an 8 CPU machine.

I'm wondering what the best strategy for making maximum use of resources is. I have 
the tweaked the single process indexer to index 5000 records (not files) in memory 
before writing out to disk.

Should i create an IndexThread and share the IndexWriter object across 5 threads..then 
monitor when one ends to start another, etc. Or should i create difference indexes 
then to a series of merges.

any help would be appreciated,

thanks,
Marc Dumontier
Bioinformatics Application Developer
Blueprint Initiative
Mount Sinai Hospital
Toronto
http://www.bind.ca

Reply via email to