Hello All,
               I am trying to find a better approach ( perfomance wise
) to index documents. Document count is approximately a million+.
First, I thought of writing multiple threads using
CommonsHttpSolrServer to submit documents. But later I found out
StreamingUpdateSolrServer, which says we can forget about batching.

1) We can pass thread-count parameter to StreamingUpdateSolrServer,
does it exactly serve the same as writing multiple threads using
CommonsHttpSolrServer ?.

2) Also, is CommonsHttpSolrServer  thread safe?

3) To be brief, which one of above is a better one ? ( and why ? ) .
Either writing multiple threads for Commons or directly adding each
and every document to Streaming ?

4) queuesize parameter : What could be the rough-value when it comes
to real time application having a million+ documents to be indexed ?

   I am not so aware in depth. Please dont mind if something is wrong.

Thanks.

Reply via email to