Here's my config for the updateProcessor. It not uses another signature method but i've used TextProfileSignature as well and it works - sort of.
<updateRequestProcessorChain name="dedupe"> <processor class="org.apache.solr.update.processor.SignatureUpdateProcessorFactory"> <bool name="enabled">true</bool> <str name="signatureField">sig</str> <bool name="overwriteDupes">true</bool> <str name="fields">content</str> <str name="signatureClass">org.apache.solr.update.processor.Lookup3Signature</str> </processor> <processor class="solr.LogUpdateProcessorFactory" /> <processor class="solr.RunUpdateProcessorFactory" /> </updateRequestProcessorChain> Of course, you must define the updateProcessor in your requestHandler, it's commented out in mine at the moment. <requestHandler name="/update" class="solr.XmlUpdateRequestHandler"> <!-- <lst name="defaults"> <str name="update.processor">dedupe</str> </lst> --> </requestHandler> Also, i see you define minTokenLen = 3. Where does that come from? Haven't seen anything on the wiki specifying such a parameter. On Tuesday 08 June 2010 19:45:35 Neeb wrote: > Hey Andrew, > > Just wondering if you ever managed to run TextProfileSignature based > deduplication. I would appreciate it if you could send me the code fragment > for it from solrconfig. > > I have currently something like this, but not sure if I am doing it right: > > <updateRequestProcessorChain name="dedupe"> > <processor > class="org.apache.solr.update.processor.SignatureUpdateProcessorFactory"> > <bool name="enabled">true</bool> > <str name="signatureField">signature</str> > <bool name="overwriteDupes">true</bool> > <str name="fields">title,author,abstract</str> > <str > name="signatureClass">org.apache.solr.update.processor.TextProfileSignature > </str> <str name="minTokenLen">3</str> > </processor> > <processor class="solr.LogUpdateProcessorFactory" /> > <processor class="solr.RunUpdateProcessorFactory" /> > </updateRequestProcessorChain> > > -- > > Thanks in advance, > -Ali > Markus Jelsma - Technisch Architect - Buyways BV http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350