[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14630940#comment-14630940 ]
Xiangrui Meng commented on SPARK-8518: -------------------------------------- [~yanboliang] Thanks for working the design doc! I hope that you enjoyed the process. I'm not super familiar with survival models. The simplest one I know is the censored log-linear formulation resulting from proportional hazard models. The purpose of this JIRA is not to support all survival models, but support one that is most commonly used and easy to parallel in Spark. So I think the design doc also needs to answer the following: 1. Which algorithm is the most popular one? 2. What is the size of the model? 3. How do the algorithms fit into Spark? Are they easy to be parallelized? 4. What is the complexity? Also CC [~rams]. > Log-linear models for survival analysis > --------------------------------------- > > Key: SPARK-8518 > URL: https://issues.apache.org/jira/browse/SPARK-8518 > Project: Spark > Issue Type: New Feature > Components: ML > Reporter: Xiangrui Meng > Assignee: Yanbo Liang > Original Estimate: 168h > Remaining Estimate: 168h > > We want to add basic log-linear models for survival analysis. The > implementation should match the result from R's survival package > (http://cran.r-project.org/web/packages/survival/index.html). > Design doc from [~yanboliang]: > https://docs.google.com/document/d/1fLtB0sqg2HlfqdrJlNHPhpfXO0Zb2_avZrxiVoPEs0E/pub -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org