You could build your index using Lucene directly and then point a
Solr instance at it once its built. My suspicion is that the
overhead of forming a document as an XML string and posting to Solr
via HTTP won't be that much different than indexing with Lucene
directly.
My largest Solr index is currently at 1.4M and it takes a max of 3ms
to add a document (according to Solr's console), most of them 1ms.
My single threaded indexer is indexing around 1000 documents per
minute, but I think I can get this number even faster by
parallelizing the indexer.
I'm curious what rates others are indexing at ???
Erik
On Feb 20, 2007, at 2:21 AM, Jack L wrote:
Hello,
I have a question about solr's performance of accepting
inserts and indexing. If I have 10 million documents that
I'd like to index, I suppose it will take some time to
submit them to solr. Is there any faster way to do this
than through the web interface?
--
Best regards,
Jack
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com