[ https://issues.apache.org/jira/browse/MAHOUT-1929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15906388#comment-15906388 ]
Saikat Kanjilal commented on MAHOUT-1929: ----------------------------------------- Ok more progress, I decided to take the strategy that scalation is taking in implementing GLM (https://github.com/scalation/scalation/blob/master/src/main/scala/scalation/analytics/par/GLM.scala), I have: 1) Added a set of apply functions that reuse our existing infrastructure to compute the models that already exist, namely for Linear Regression and Ordinary Least Squares 2) I have placed this code inside GLMModel for now Code is here: https://github.com/skanjila/mahout/blob/mahout-1929/math-scala/src/main/scala/org/apache/mahout/math/algorithms/regression/GlmModel.scala Would really appreciate a looksy from all interested parties identified in the above comments before I get too much farther :), next steps include: 1) unit tests for the apply functions 2) figure out a creative way to tie in the other functions into the apply API infrastructure: def fit(drmX: DrmLike[K], drmTarget: DrmLike[K], hyperparameters: (Symbol, Any)*): GlmModel[K] def setStandardHyperparameters(hyperparameters: Map[Symbol, Any] = Map('foo -> None)): Unit def calculateStandardError[M[K] <: GlmModel[K]](X: DrmLike[K], drmTarget: DrmLike[K], drmXtXinv: Matrix, model: M[K]): M[K] def modelPostprocessing[M[K] <: GlmModel[K]](model: M[K], X: DrmLike[K], drmTarget: DrmLike[K], drmXtXinv: Matrix): M[K] Here are my thoughts on the above, my philosophy is that we keep around as much of the existing infrastructure that [~rawkintrevo] has put in place and therefore these may need to just be implemented inside each of the derived classes, I would like to see the apply function eventually do the following: 1) compute the model 2) train the model 3) do the prediction 4) build quality measures around each of the models Thoughts ? > Add Generalized Linear Models > ----------------------------- > > Key: MAHOUT-1929 > URL: https://issues.apache.org/jira/browse/MAHOUT-1929 > Project: Mahout > Issue Type: Wish > Components: Algorithms > Affects Versions: 0.13.1 > Reporter: Trevor Grant > > Implement generalize Linear Models (GLM) > https://en.wikipedia.org/wiki/Generalized_linear_model -- This message was sent by Atlassian JIRA (v6.3.15#6346)