speed up indexing for example schema
------------------------------------

                 Key: SOLR-1859
                 URL: https://issues.apache.org/jira/browse/SOLR-1859
             Project: Solr
          Issue Type: Task
          Components: Schema and Analysis
            Reporter: Robert Muir
            Assignee: Robert Muir
             Fix For: 3.1


The example schema should use the lucene core PorterStemmer (coded in Java by 
Martin Porter)
 instead of the Snowball one that is auto-generated code.

Although we have sped up the Snowball stemmer, its still pretty slow and the 
example should be fast.

Below is the output of ant test -Dtestcase=TestIndexingPerformance 
-Dargs="-server -Diter=100000"
These results are consistent with large document indexing times that I have 
seen on large english
collections with Lucene, we double indexing speed.

{noformat}
solr1.5branch:
iter=100000 time=5841 throughput=17120
iter=100000 time=5839 throughput=17126
iter=100000 time=6017 throughput=16619

trunk (unpatched):
iter=100000 time=4132 throughput=24201
iter=100000 time=4142 throughput=24142
iter=100000 time=4151 throughput=24090

trunk (patched)
iter=100000 time=2998 throughput=33355
iter=100000 time=3021 throughput=33101
iter=100000 time=3006 throughput=33266
{noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to