Chris Hostetter wrote:
in the case where doc boosts and field boosts aren't used, it seems like it would be very easy to write a maintenance app that did something like...get instance of similarity based on input foreach fieldName in input { int[] termCounts = new int[maxDoc]; foreach Term in TermEnum for field { foreach TermDoc on that Term { termCounts[td.doc()]+=td.freq() } } foreach doc { IndexReader.setNorm(doc, fieldName, similarity.encodeNorm (similarity.lengthNorm(fieldName, termCounts[doc])) } } ...does anyone see anything wrong with the overall appraoch?
Looks good to me. Doug --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
