[ 
https://issues.apache.org/jira/browse/SOLR-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12852375#action_12852375
 ] 

Robert Muir commented on SOLR-1859:
-----------------------------------

Any objections? If not I would like to commit later today.

Thanks!

> speed up indexing for example schema
> ------------------------------------
>
>                 Key: SOLR-1859
>                 URL: https://issues.apache.org/jira/browse/SOLR-1859
>             Project: Solr
>          Issue Type: Task
>          Components: Schema and Analysis
>            Reporter: Robert Muir
>            Assignee: Robert Muir
>             Fix For: 3.1
>
>         Attachments: SOLR-1859.patch
>
>
> The example schema should use the lucene core PorterStemmer (coded in Java by 
> Martin Porter)
>  instead of the Snowball one that is auto-generated code.
> Although we have sped up the Snowball stemmer, its still pretty slow and the 
> example should be fast.
> Below is the output of ant test -Dtestcase=TestIndexingPerformance 
> -Dargs="-server -Diter=100000"
> These results are consistent with large document indexing times that I have 
> seen on large english
> collections with Lucene, we double indexing speed.
> {noformat}
> solr1.5branch:
> iter=100000 time=5841 throughput=17120
> iter=100000 time=5839 throughput=17126
> iter=100000 time=6017 throughput=16619
> trunk (unpatched):
> iter=100000 time=4132 throughput=24201
> iter=100000 time=4142 throughput=24142
> iter=100000 time=4151 throughput=24090
> trunk (patched)
> iter=100000 time=2998 throughput=33355
> iter=100000 time=3021 throughput=33101
> iter=100000 time=3006 throughput=33266
> {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to