[ 
https://issues.apache.org/jira/browse/SOLR-13320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16791318#comment-16791318
 ] 

Shalin Shekhar Mangar edited comment on SOLR-13320 at 3/13/19 5:35 AM:
-----------------------------------------------------------------------

If the definition of duplicate is just having the same id then that can also be 
done today using optimistic concurrency. Use {{_version_}} with a negative 
value. See 
https://lucene.apache.org/solr/guide/6_6/updating-parts-of-documents.html#UpdatingPartsofDocuments-OptimisticConcurrency

If duplicate depends on the content of the document then you need to use the 
SignatureUpdateProcessorFactory


was (Author: shalinmangar):
If the definition of duplicate is just having the same id then that can also be 
done today using optimistic concurrency. Use `_version_` with a negative value. 
See 
https://lucene.apache.org/solr/guide/6_6/updating-parts-of-documents.html#UpdatingPartsofDocuments-OptimisticConcurrency

If duplicate depends on the content of the document then you need to use the 
SignatureUpdateProcessorFactory

> add a param ignoreDuplicates=true to updates to not overwrite existing docs
> ---------------------------------------------------------------------------
>
>                 Key: SOLR-13320
>                 URL: https://issues.apache.org/jira/browse/SOLR-13320
>             Project: Solr
>          Issue Type: New Feature
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Noble Paul
>            Assignee: Noble Paul
>            Priority: Major
>
> Updates should have an option to ignore duplicate documents and drop them if 
> an option  {{ignoreDuplicates=true}} is specified



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to