Since this is a one-time problem, Have you thought of just dumping all the
IDs and looking for dupes using sort and awk or something similar to that?

Michael Della Bitta

Applications Developer

o: +1 646 532 3062  | c: +1 917 477 7906

appinions inc.

“The Science of Influence Marketing”

18 East 41st Street

New York, NY 10017

t: @appinions <https://twitter.com/Appinions> | g+:
plus.google.com/appinions<https://plus.google.com/u/0/b/112002776285509593336/112002776285509593336/posts>
w: appinions.com <http://www.appinions.com/>


On Tue, Jul 30, 2013 at 2:38 PM, Dotan Cohen <dotanco...@gmail.com> wrote:

> On Tue, Jul 30, 2013 at 9:23 PM, Michael Della Bitta
> <michael.della.bi...@appinions.com> wrote:
> > Are you talking about the document's ID field?
> >
> > If so, you can't have duplicates... the latter document would overwrite
> the
> > earlier.
> >
> > If not, sorry for asking irrelevant questions. :)
> >
>
> In Solr 4.1 we were using overwrite=false&allowDups=false in order to
> discard the new document, not overwrite the extant document. We knew
> at the time that the features were depreciated, and apparently
> allowDups=false stopped working in 4.3. We are testing new solutions,
> but we need to identify the dupes to get them out.
>
> --
> Dotan Cohen
>
> http://gibberish.co.il
> http://what-is-what.com
>

Reply via email to