Hi, I'm indexing 500 XML files each ~150Mb on an 8 CPU machine.
I'm wondering what the best strategy for making maximum use of resources is. I have the tweaked the single process indexer to index 5000 records (not files) in memory before writing out to disk. Should i create an IndexThread and share the IndexWriter object across 5 threads..then monitor when one ends to start another, etc. Or should i create difference indexes then to a series of merges. any help would be appreciated, thanks, Marc Dumontier Bioinformatics Application Developer Blueprint Initiative Mount Sinai Hospital Toronto http://www.bind.ca