On 4/22/07, Traut <[EMAIL PROTECTED]> wrote:
I'm working on some presentation for my co-workers about Lucene/Solr index/search process. The question is why lucene (especially Solr) is such a fast engine?
I think some of it is the nature of open source. You have people who care, contributing patches to make things faster. There's no such thing as "well this meets requirements, now lets move on to the next feature checkbox".
Has it something to do with index file formats (http://lucene.apache.org/java/docs/fileformats.html)?
That's certainly the foundation. The scorers above that need to be fast too.
or is it all about using reverse indexing strategy?
AFAIK, full-text search products generally use an inverted index. So Solr is fast because Lucene is fast (and getting faster). Solr also adds a lot of smart configurable caching, and very fast set implementations that speed up filtering and faceting. -Yonik