The simplest and maybe best approach is to use the edismax query parser and query all terms using the OR operator and use the PF1, PF2, and PF3 parameters to boost phrases so that the closest matches rank higher.
No need to do any special indexing. You can tune the ps, ps2, and ps3 parameters as well to loosen the tightness of phrases that get boosted. -- Jack Krupansky On Mon, Aug 10, 2015 at 1:54 AM, Roshan Agarwal <ros...@siddhast.com> wrote: > Dear All, > > Can any one let us know how to implement plagiarism Checker with solr, > how to index content with shingles and what to send in queries > > Roshan > > -- > > Siddhast Ip innovation (P) ltd > 907 chandra vihar colony > Jhansi-284002 > M:+919871549769 > M:+917376314900 >