+1

perhaps we want a "schema.xml"  and a "schema-kitchen-sink.xml"

While it is good that the default is fast, we also want to make sure everything has a functioning example somewhere.


On Mar 7, 2009, at 10:12 AM, Yonik Seeley wrote:

I've occasionally run across people going with another search engine
because it was faster at indexing.
The example schema that people may be using as a base to do their
benchmarking (with perhaps minimal modifications) is slow.
There are many people out there that check what's fastest first, and
*then* check if it is satisfactory to meet their needs in other areas.

With very simple synthetic test documents (just a few fields each) and
the CSV loader, I've personally seen the indexing rate go from
~330/sec to ~3000/sec, when I removed the default field values, term
vectors, copyFields, etc.  The default example schema should still be
able to show how something can be done, but that doesn't mean it needs
to be enabled by default.

So what do people think about speeding up the default/example schema before 1.4?

-Yonik
http://www.lucidimagination.com

Reply via email to