Add support for hash based exact/near duplicate document handling
-----------------------------------------------------------------
Key: SOLR-799
URL: https://issues.apache.org/jira/browse/SOLR-799
Project: Solr
Issue Type: New Feature
Components: update
Reporter: Mark Miller
Priority: Minor
Hash based duplicate document detection is efficient and allows for blocking as
well as field collapsing. Lets put it into solr.
http://wiki.apache.org/solr/Deduplication
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.