Re: [math] Least Squares Outlier Rejection

Gilles Thu, 11 Sep 2014 16:36:19 -0700

Hello.

On Thu, 11 Sep 2014 14:29:49 -0400, Evan Ward wrote:

Hi,
A while ago I had bought up the idea of adding residual editing (akadataediting, outlier rejection, robust regression) to our non-linearleastsquares implementations.[1] As the name suggests, the idea is tode-weightobservations that don't match the user's model. There are severalways tothis including choosing a fixed cutoff, a fixed standard deviationcutoff,
or reducing a residual's weight based on its magnitude.[2]
However we add the data editing feature I think it will causebackward
incompatibilities with the released API. I've outlined below the two
options I see. I'm open to other ideas as well.
1. Replace edited residuals with 0's in the residual vector andJacobian
(i.e. apply a 0 weight). This has the advantage of being simple to
implement and that our existing optimizers are already able to handleit.
The downside is evident when the user tries to obtain the number of
residuals that were edited. It is hard to tell the difference betweenanedited residual, an apriori zero weight, or a model evaluation wheretheresidual and gradient is, in fact, zero. We can provide easy accessto thenumber of edited residuals by adding a method to the Evaluationinterface.(This is what I implemented in the patch in the original thread.) Nowthat
the code has been released though, this would cause a backward
incompatibility for some advanced users. Most users will likely usethe
included factory and builder methods to define their
LeastSquaresProblem(LSP) and these users would not be affected by the
change. Only the users that provide a custom implementation of
LSP.Evauation would be affected.
2. Remove edited residuals from the gradient and Jacobian, so thattheresulting vector and matrix have fewer rows. The advantage here isthat theuser can compare the length of the residual vector in the Optimum tothe
number of observations in the LSP to determine the number of edited
residuals. The problem is that returning vectors/matrices withdifferentsizes from LSP.evaluate() would violate the contract. Additionally wewouldhave to modify our existing optimizers to deal with the variablelengths.For GaussNewton the modification would be small, but forLevenburgMarquardtI would likely have to re-write it since I don't understand the code(notfor lack of trying :P ). Users that implement LeastSquaresOptimizerswould
likely have to modify their code as well.
To summarize, in both cases users that only use the provided [math]classes
would not have to modify their code, while users that provide custom
implementations of [math] interfaces would have to modify their code.
I would like to get this feature wrapped in for the next release.Pleaselet me know if you have a preference for either implementation and ifthere
are any other issues I should consider.


Compatibility breaks cannot occur in minor releases.

The next major release should not occur before deprecated classes areallreplaced. [I'm thinking about the optimizers, for which the fluent APIshould

be implemented based on your design of NLLS.]

It would be nice to recode the whole "LevenbergMarquardtOptimizer" infull OOJava. But it should be implemented and tested before any new feature isadded

to the mix.

Do I understand correctly that in the "robust" fit, the weights aremodified

during the optimization?
If so, would the algorithms still be "standard"?

At first sight, I'd avoid modification of the sizes of input data(option 2);from an API usage viewpoint, I imagine that user code will requireadditional

"length" tests.

Couldn't the problem you mention in option 1 disappear by havingdifferent

methods that return the a-priori weights and the modified weights?


Best regards,
Gilles


Best Regards,
Evan

[1] http://markmail.org/message/e53nago3swvu3t52
     https://issues.apache.org/jira/browse/MATH-1105
[2] http://www.mathworks.com/help/curvefit/removing-outliers.html

http://www.mathworks.com/help/curvefit/least-squares-fitting.html



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [math] Least Squares Outlier Rejection

Reply via email to