speed up indexing for example schema
------------------------------------
Key: SOLR-1859
URL: https://issues.apache.org/jira/browse/SOLR-1859
Project: Solr
Issue Type: Task
Components: Schema and Analysis
Reporter: Robert Muir
Assignee: Robert Muir
Fix For: 3.1
The example schema should use the lucene core PorterStemmer (coded in Java by
Martin Porter)
instead of the Snowball one that is auto-generated code.
Although we have sped up the Snowball stemmer, its still pretty slow and the
example should be fast.
Below is the output of ant test -Dtestcase=TestIndexingPerformance
-Dargs="-server -Diter=100000"
These results are consistent with large document indexing times that I have
seen on large english
collections with Lucene, we double indexing speed.
{noformat}
solr1.5branch:
iter=100000 time=5841 throughput=17120
iter=100000 time=5839 throughput=17126
iter=100000 time=6017 throughput=16619
trunk (unpatched):
iter=100000 time=4132 throughput=24201
iter=100000 time=4142 throughput=24142
iter=100000 time=4151 throughput=24090
trunk (patched)
iter=100000 time=2998 throughput=33355
iter=100000 time=3021 throughput=33101
iter=100000 time=3006 throughput=33266
{noformat}
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.