Since this is a one-time problem, Have you thought of just dumping all the IDs and looking for dupes using sort and awk or something similar to that?
Michael Della Bitta Applications Developer o: +1 646 532 3062 | c: +1 917 477 7906 appinions inc. “The Science of Influence Marketing” 18 East 41st Street New York, NY 10017 t: @appinions <https://twitter.com/Appinions> | g+: plus.google.com/appinions<https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts> w: appinions.com <http://www.appinions.com/> On Tue, Jul 30, 2013 at 2:38 PM, Dotan Cohen <dotanco...@gmail.com> wrote: > On Tue, Jul 30, 2013 at 9:23 PM, Michael Della Bitta > <michael.della.bi...@appinions.com> wrote: > > Are you talking about the document's ID field? > > > > If so, you can't have duplicates... the latter document would overwrite > the > > earlier. > > > > If not, sorry for asking irrelevant questions. :) > > > > In Solr 4.1 we were using overwrite=false&allowDups=false in order to > discard the new document, not overwrite the extant document. We knew > at the time that the features were depreciated, and apparently > allowDups=false stopped working in 4.3. We are testing new solutions, > but we need to identify the dupes to get them out. > > -- > Dotan Cohen > > http://gibberish.co.il > http://what-is-what.com >