On 15 Jun 2005, at 12:02, Bertrand Delacretaz wrote:
But can't we generate such a warning from a document attribute which says "this document has been imported automatically from a corpus which is known to contain a lot of cruft and needs to be cleaned up before we're happy about it" ?
I'm also leaning towards a boolean field which says reviewed=true|false for the document.
</Steven> -- Steven Noels http://outerthought.org/ Outerthought Open Source Java & XML stevenn at outerthought.org stevenn at apache.org