But in that case, I assume Solr does a commit per document added.
Lets say I wanted to index a collection of 1 million pages, would it
take much longer if I comited at each insertion rather than comiting at
the end?
Daniel Shane
Grant Ingersoll wrote:
On Aug 13, 2009, at 10:33 AM, Daniel Shane wrote:
Does anyone have an idea on how I could check an index that is in the
process of being indexed (things added, things deleted) for the
uniquess of a given field *at the time I index a document* ?
Solr has de-duplication built-in at indexing time:
http://wiki.apache.org/solr/Deduplication
--------------------------
Grant Ingersoll
http://www.lucidimagination.com/
Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)
using Solr/Lucene:
http://www.lucidimagination.com/search
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org