[jira] [Commented] (SPARK-14464) Logistic regression performs poorly for very large vectors, even when the number of non-zero features is small

2016-07-18 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15382382#comment-15382382 ] Daniel Siegmann commented on SPARK-14464: - Nick, thanks for pointing out this iss

[jira] [Commented] (SPARK-732) Recomputation of RDDs may result in duplicated accumulator updates

2014-11-30 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14229277#comment-14229277 ] Daniel Siegmann commented on SPARK-732: --- This is very disappointing. Essentially, Spa

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2014-09-15 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14134101#comment-14134101 ] Daniel Siegmann commented on SPARK-2620: I have tested the case in spark-shell on

[jira] [Commented] (SPARK-732) Recomputation of RDDs may result in duplicated accumulator updates

2014-06-12 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14029496#comment-14029496 ] Daniel Siegmann commented on SPARK-732: --- Any update on this issue? As it currently st

[jira] [Commented] (SPARK-2115) Stage kill link is too close to stage details link

2014-06-12 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14029808#comment-14029808 ] Daniel Siegmann commented on SPARK-2115: While I agree with this, I'd also like to

[jira] [Commented] (SPARK-2620) case class cannot be used as key for reduce

2014-07-22 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14070543#comment-14070543 ] Daniel Siegmann commented on SPARK-2620: I have confirmed this on Spark 1.0.1 as w

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2016-01-18 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15105772#comment-15105772 ] Daniel Siegmann commented on SPARK-4105: I had this happen in Spark 1.5.0. It only

[jira] [Commented] (SPARK-14033) Merging Estimator, Model, & Transformer

2016-03-22 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15206455#comment-15206455 ] Daniel Siegmann commented on SPARK-14033: - To me, the semantics of this proposal

[jira] [Commented] (SPARK-14033) Merging Estimator & Model

2016-03-24 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15210772#comment-15210772 ] Daniel Siegmann commented on SPARK-14033: - Another thing to consider is that, as

[jira] [Created] (SPARK-14464) Logistic regression performs poorly for very large vectors, even when the number of non-zero features is small

2016-04-07 Thread Daniel Siegmann (JIRA)
Daniel Siegmann created SPARK-14464: --- Summary: Logistic regression performs poorly for very large vectors, even when the number of non-zero features is small Key: SPARK-14464 URL: https://issues.apache.org/jira/

[jira] [Updated] (SPARK-14464) Logistic regression performs poorly for very large vectors, even when the number of non-zero features is small

2016-04-07 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Siegmann updated SPARK-14464: Description: When training (a.k.a. fitting) org.apache.spark.ml.classification.LogisticReg

[jira] [Commented] (SPARK-14464) Logistic regression performs poorly for very large vectors, even when the number of non-zero features is small

2016-04-07 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15230990#comment-15230990 ] Daniel Siegmann commented on SPARK-14464: - I am working on a fix. I have used it

[jira] [Commented] (SPARK-3928) Support wildcard matches on Parquet files

2015-05-07 Thread Daniel Siegmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14533296#comment-14533296 ] Daniel Siegmann commented on SPARK-3928: Yes, passing multiple paths as varargs is