I have incorporated this requested change in a new patch that I attached to ticket https://issues.apache.org/jira/browse/MAHOUT-675.
It appears that the previous patch has already been applied. Should I repull the repo, make a new ticket, and create a new patch? Thanks, Chris On Apr 18, 2011, at 1:54 PM, Ted Dunning wrote: That sounds right to me. It might be plausible to blow an exception if a (configurable) large percentage of all documents have to be rejected. That is a minor improvement, though. On Mon, Apr 18, 2011 at 10:52 AM, Christopher Jordan <[email protected]<mailto:[email protected]>> wrote: I believe, at least in my situation, a better approach is for the LuceneIterator to log a warning with the idField when it encounters a problem document and move onto the next one.
