I've been following the Lucene/Solr community for a long time and finally have found (or: taken) the time to start implementing some of my ideas how to improve on it; this will be my first proposed patch.
I'm working on some changes to the Collector API to significantly improve the performance of some use cases, but my change may have a negative effect on other use cases (though I doubt it), including memory resources. Of course such an effect would only be measurable for larger index sizes. My question is: how can I best test this? Is there a common dataset/index that is used to verify that patches do not degrade search performance? I can do some testing on my own wikipedia index of course, but I guess that aligning with the performance tools you guys are using, will be better Thank you, keep up the good work, Anne -- Anne Veling BeyondTrees.com +31 6 50 969 170 @anneveling