speed up indexing for example schema ------------------------------------ Key: SOLR-1859 URL: https://issues.apache.org/jira/browse/SOLR-1859 Project: Solr Issue Type: Task Components: Schema and Analysis Reporter: Robert Muir Assignee: Robert Muir Fix For: 3.1
The example schema should use the lucene core PorterStemmer (coded in Java by Martin Porter) instead of the Snowball one that is auto-generated code. Although we have sped up the Snowball stemmer, its still pretty slow and the example should be fast. Below is the output of ant test -Dtestcase=TestIndexingPerformance -Dargs="-server -Diter=100000" These results are consistent with large document indexing times that I have seen on large english collections with Lucene, we double indexing speed. {noformat} solr1.5branch: iter=100000 time=5841 throughput=17120 iter=100000 time=5839 throughput=17126 iter=100000 time=6017 throughput=16619 trunk (unpatched): iter=100000 time=4132 throughput=24201 iter=100000 time=4142 throughput=24142 iter=100000 time=4151 throughput=24090 trunk (patched) iter=100000 time=2998 throughput=33355 iter=100000 time=3021 throughput=33101 iter=100000 time=3006 throughput=33266 {noformat} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.