[
https://issues.apache.org/jira/browse/SOLR-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir updated SOLR-1859:
------------------------------
Attachment: SOLR-1859.patch
attached is a patch. I fixed every instance for general types like "text"
in every schema file i could find, including test ones, and commented-out
instances, too. All tests pass.
> speed up indexing for example schema
> ------------------------------------
>
> Key: SOLR-1859
> URL: https://issues.apache.org/jira/browse/SOLR-1859
> Project: Solr
> Issue Type: Task
> Components: Schema and Analysis
> Reporter: Robert Muir
> Assignee: Robert Muir
> Fix For: 3.1
>
> Attachments: SOLR-1859.patch
>
>
> The example schema should use the lucene core PorterStemmer (coded in Java by
> Martin Porter)
> instead of the Snowball one that is auto-generated code.
> Although we have sped up the Snowball stemmer, its still pretty slow and the
> example should be fast.
> Below is the output of ant test -Dtestcase=TestIndexingPerformance
> -Dargs="-server -Diter=100000"
> These results are consistent with large document indexing times that I have
> seen on large english
> collections with Lucene, we double indexing speed.
> {noformat}
> solr1.5branch:
> iter=100000 time=5841 throughput=17120
> iter=100000 time=5839 throughput=17126
> iter=100000 time=6017 throughput=16619
> trunk (unpatched):
> iter=100000 time=4132 throughput=24201
> iter=100000 time=4142 throughput=24142
> iter=100000 time=4151 throughput=24090
> trunk (patched)
> iter=100000 time=2998 throughput=33355
> iter=100000 time=3021 throughput=33101
> iter=100000 time=3006 throughput=33266
> {noformat}
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.