[jira] [Commented] (SPARK-20133) User guide for spark.ml.stat.ChiSquareTest

2017-03-30 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15949418#comment-15949418 ] Benjamin Fradet commented on SPARK-20133: - Can I take this one? > User guide for

[jira] [Created] (SPARK-20097) Fix visibility discrepancy with numInstances and degreesOfFreedom in LR and GLR

2017-03-25 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-20097: --- Summary: Fix visibility discrepancy with numInstances and degreesOfFreedom in LR and GLR Key: SPARK-20097 URL: https://issues.apache.org/jira/browse/SPARK-20097

[jira] [Commented] (SPARK-16857) CrossValidator and KMeans throws IllegalArgumentException

2016-10-27 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15611926#comment-15611926 ] Benjamin Fradet commented on SPARK-16857: - I was wondering why a KMeansEvalutor computing the

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-05-28 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305275#comment-15305275 ] Benjamin Fradet commented on SPARK-15581: - Thanks, we should maybe add it to the roadmap, don't

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-05-27 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304869#comment-15304869 ] Benjamin Fradet commented on SPARK-15581: - [~josephkb] Just out of curiosity: I don't see any

[jira] [Commented] (SPARK-15200) Add documentaion and examples for GaussianMixture

2016-05-08 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15275524#comment-15275524 ] Benjamin Fradet commented on SPARK-15200: - woops, didnt see it linked to 15101 > Add

[jira] [Commented] (SPARK-15200) Add documentaion and examples for GaussianMixture

2016-05-07 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15275227#comment-15275227 ] Benjamin Fradet commented on SPARK-15200: - I've started working on this > Add documentaion and

[jira] [Created] (SPARK-15200) Add documentaion and examples for GaussianMixture

2016-05-07 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-15200: --- Summary: Add documentaion and examples for GaussianMixture Key: SPARK-15200 URL: https://issues.apache.org/jira/browse/SPARK-15200 Project: Spark

[jira] [Commented] (SPARK-14985) Update LinearRegression, LogisticRegression summary internals to handle model copy

2016-04-30 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15265261#comment-15265261 ] Benjamin Fradet commented on SPARK-14985: - I'll take this one if you guys don't mind. > Update

[jira] [Commented] (SPARK-14817) ML 2.0 QA: Programming guide update and migration guide

2016-04-22 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254196#comment-15254196 ] Benjamin Fradet commented on SPARK-14817: - Count me in! > ML 2.0 QA: Programming guide update

[jira] [Commented] (SPARK-14570) Log instrumentation in Random forests

2016-04-20 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15250067#comment-15250067 ] Benjamin Fradet commented on SPARK-14570: - I'll take this one if you guys don't mind. > Log

[jira] [Commented] (SPARK-14730) Expose ColumnPruner as feature transformer

2016-04-20 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15250050#comment-15250050 ] Benjamin Fradet commented on SPARK-14730: - [~jlaskowski], [~yanboliang] are one of you guys

[jira] [Created] (SPARK-12983) Correct metrics.properties.template

2016-01-25 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-12983: --- Summary: Correct metrics.properties.template Key: SPARK-12983 URL: https://issues.apache.org/jira/browse/SPARK-12983 Project: Spark Issue Type:

[jira] [Closed] (SPARK-12858) Remove duplicated code in metrics

2016-01-24 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Fradet closed SPARK-12858. --- Resolution: Not A Problem > Remove duplicated code in metrics >

[jira] [Created] (SPARK-12858) Remove duplicated code in metrics

2016-01-16 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-12858: --- Summary: Remove duplicated code in metrics Key: SPARK-12858 URL: https://issues.apache.org/jira/browse/SPARK-12858 Project: Spark Issue Type:

[jira] [Commented] (SPARK-9716) BinaryClassificationEvaluator should accept Double prediction column

2015-12-24 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15070992#comment-15070992 ] Benjamin Fradet commented on SPARK-9716: Somewhat related, I think `RegressionEvaluator` should

[jira] [Comment Edited] (SPARK-9716) BinaryClassificationEvaluator should accept Double prediction column

2015-12-24 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15070992#comment-15070992 ] Benjamin Fradet edited comment on SPARK-9716 at 12/24/15 1:19 PM: --

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-24 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15071062#comment-15071062 ] Benjamin Fradet commented on SPARK-12247: - The [PR|https://github.com/apache/spark/pull/10411]

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-23 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15069366#comment-15069366 ] Benjamin Fradet commented on SPARK-12247: - Yup, I was thinking of keeping only the rmse

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-22 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15067726#comment-15067726 ] Benjamin Fradet commented on SPARK-12247: - [~thunterdb] Do you think I should also include the

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-21 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15067021#comment-15067021 ] Benjamin Fradet commented on SPARK-12247: - Ok thanks, I'll rework the examples accordingly. >

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-19 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15065417#comment-15065417 ] Benjamin Fradet commented on SPARK-12247: - By the way, should we repurpose

[jira] [Commented] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-19 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15065344#comment-15065344 ] Benjamin Fradet commented on SPARK-12247: - I've started working on this. > Documentation for

[jira] [Commented] (SPARK-9716) BinaryClassificationEvaluator should accept Double prediction column

2015-12-19 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15065342#comment-15065342 ] Benjamin Fradet commented on SPARK-9716: [~lkhamsurenl] Are you working on it or can I take over?

[jira] [Created] (SPARK-12368) Better doc for the binary classification evaluator setMetricName method

2015-12-16 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-12368: --- Summary: Better doc for the binary classification evaluator setMetricName method Key: SPARK-12368 URL: https://issues.apache.org/jira/browse/SPARK-12368

[jira] [Commented] (SPARK-12368) Better doc for the binary classification evaluator setMetricName method

2015-12-16 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15060167#comment-15060167 ] Benjamin Fradet commented on SPARK-12368: - I've started working on this. > Better doc for the

[jira] [Updated] (SPARK-12368) Better doc for the binary classification evaluator' metricName

2015-12-16 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Fradet updated SPARK-12368: Summary: Better doc for the binary classification evaluator' metricName (was: Better doc

[jira] [Commented] (SPARK-7425) spark.ml Predictor should support other numeric types for label

2015-12-12 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15054664#comment-15054664 ] Benjamin Fradet commented on SPARK-7425: Is there anyone working on this? Because I'm considering

[jira] [Commented] (SPARK-12217) Document invalid handling for StringIndexer

2015-12-10 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15051698#comment-15051698 ] Benjamin Fradet commented on SPARK-12217: - Sorry [~srowen], my bad, I wanted to duplicate the

[jira] [Commented] (SPARK-9059) Update Python Direct Kafka Word count examples to show the use of HasOffsetRanges

2015-12-09 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15049116#comment-15049116 ] Benjamin Fradet commented on SPARK-9059: There is a python code snipped like the java and scala

[jira] [Comment Edited] (SPARK-9059) Update Python Direct Kafka Word count examples to show the use of HasOffsetRanges

2015-12-09 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15049116#comment-15049116 ] Benjamin Fradet edited comment on SPARK-9059 at 12/10/15 6:49 AM: -- There

[jira] [Commented] (SPARK-9059) Update Python Direct Kafka Word count examples to show the use of HasOffsetRanges

2015-12-08 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15048177#comment-15048177 ] Benjamin Fradet commented on SPARK-9059: Hi [~neelesh77], I know the documentation has been

[jira] [Created] (SPARK-12217) Document invalid handling for StringIndexer

2015-12-08 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-12217: --- Summary: Document invalid handling for StringIndexer Key: SPARK-12217 URL: https://issues.apache.org/jira/browse/SPARK-12217 Project: Spark Issue

[jira] [Commented] (SPARK-12217) Document invalid handling for StringIndexer

2015-12-08 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15047545#comment-15047545 ] Benjamin Fradet commented on SPARK-12217: - I've started working on this. > Document invalid

[jira] [Commented] (SPARK-12159) Add user guide section for IndexToString transformer

2015-12-05 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15043605#comment-15043605 ] Benjamin Fradet commented on SPARK-12159: - I've started working on this. > Add user guide

[jira] [Created] (SPARK-11902) Unhandled case in VectorAssembler#transform

2015-11-21 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-11902: --- Summary: Unhandled case in VectorAssembler#transform Key: SPARK-11902 URL: https://issues.apache.org/jira/browse/SPARK-11902 Project: Spark Issue

[jira] [Commented] (SPARK-9002) KryoSerializer initialization does not include 'Array[Int]'

2015-07-22 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14636832#comment-14636832 ] Benjamin Fradet commented on SPARK-9002: [~rake] are you planning on opening a PR?

[jira] [Commented] (SPARK-9057) Add Scala, Java and Python example to show DStream.transform

2015-07-21 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14635945#comment-14635945 ] Benjamin Fradet commented on SPARK-9057: One thing that would be interesting as

[jira] [Commented] (SPARK-9059) Update Python Direct Kafka Word count examples to show the use of HasOffsetRanges

2015-07-21 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14635941#comment-14635941 ] Benjamin Fradet commented on SPARK-9059: I have a version with the updated doc

[jira] [Commented] (SPARK-9059) Update Direct Kafka Word count examples to show the use of HasOffsetRanges

2015-07-17 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14630984#comment-14630984 ] Benjamin Fradet commented on SPARK-9059: Agreed. Update Direct Kafka Word count

[jira] [Commented] (SPARK-9059) Update Direct Kafka Word count examples to show the use of HasOffsetRanges

2015-07-17 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14631142#comment-14631142 ] Benjamin Fradet commented on SPARK-9059: We could also demonstrate restarting from

[jira] [Commented] (SPARK-9059) Update Direct Kafka Word count examples to show the use of HasOffsetRanges

2015-07-16 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14629770#comment-14629770 ] Benjamin Fradet commented on SPARK-9059: I've started working on this. Update

[jira] [Commented] (SPARK-8575) Deprecate callUDF in favor of udf

2015-06-23 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14598353#comment-14598353 ] Benjamin Fradet commented on SPARK-8575: I've started working on this issue.

[jira] [Updated] (SPARK-8575) Deprecate callUDF in favor of udf

2015-06-23 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Fradet updated SPARK-8575: --- Description: Follow-up of [SPARK-8356|https://issues.apache.org/jira/browse/SPARK-8356] to

[jira] [Commented] (SPARK-8115) Remove TestData

2015-06-21 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14595052#comment-14595052 ] Benjamin Fradet commented on SPARK-8115: I've started working on this. Remove

[jira] [Commented] (SPARK-8478) Harmonize UDF-related code to use uniformly UDF instead of Udf

2015-06-19 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14593368#comment-14593368 ] Benjamin Fradet commented on SPARK-8478: As discussed on

[jira] [Commented] (SPARK-8356) Reconcile callUDF and callUdf

2015-06-19 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14593362#comment-14593362 ] Benjamin Fradet commented on SPARK-8356: I'll create a separate JIRA for

[jira] [Created] (SPARK-8478) Harmonize UDF-related code to use uniformly UDF instead of Udf

2015-06-19 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-8478: -- Summary: Harmonize UDF-related code to use uniformly UDF instead of Udf Key: SPARK-8478 URL: https://issues.apache.org/jira/browse/SPARK-8478 Project: Spark

[jira] [Comment Edited] (SPARK-8356) Reconcile callUDF and callUdf

2015-06-19 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14593362#comment-14593362 ] Benjamin Fradet edited comment on SPARK-8356 at 6/19/15 12:02 PM:

[jira] [Commented] (SPARK-8356) Reconcile callUDF and callUdf

2015-06-17 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590491#comment-14590491 ] Benjamin Fradet commented on SPARK-8356: Somewhat related, about being coherent,

[jira] [Commented] (SPARK-8356) Reconcile callUDF and callUdf

2015-06-17 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590513#comment-14590513 ] Benjamin Fradet commented on SPARK-8356: Ok, I'll make sure Udf disappear, should

[jira] [Commented] (SPARK-8356) Reconcile callUDF and callUdf

2015-06-17 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590478#comment-14590478 ] Benjamin Fradet commented on SPARK-8356: [~marmbrus] Are we sure {{callUDF}} is

[jira] [Commented] (SPARK-8356) Reconcile callUDF and callUdf

2015-06-17 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590521#comment-14590521 ] Benjamin Fradet commented on SPARK-8356: Ok, thanks a lot for your pointers.

[jira] [Created] (SPARK-8399) Overlap between histograms and axis' name in Spark Streaming UI

2015-06-16 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-8399: -- Summary: Overlap between histograms and axis' name in Spark Streaming UI Key: SPARK-8399 URL: https://issues.apache.org/jira/browse/SPARK-8399 Project: Spark

[jira] [Commented] (SPARK-8399) Overlap between histograms and axis' name in Spark Streaming UI

2015-06-16 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14588358#comment-14588358 ] Benjamin Fradet commented on SPARK-8399: I'll submit a patch shortly. Overlap

[jira] [Commented] (SPARK-8356) Reconcile callUDF and callUdf

2015-06-16 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14588579#comment-14588579 ] Benjamin Fradet commented on SPARK-8356: I've started working on this issue.

[jira] [Created] (SPARK-7255) spark.streaming.kafka.maxRetries not documented

2015-04-29 Thread Benjamin Fradet (JIRA)
Benjamin Fradet created SPARK-7255: -- Summary: spark.streaming.kafka.maxRetries not documented Key: SPARK-7255 URL: https://issues.apache.org/jira/browse/SPARK-7255 Project: Spark Issue

[jira] [Commented] (SPARK-7255) spark.streaming.kafka.maxRetries not documented

2015-04-29 Thread Benjamin Fradet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14520266#comment-14520266 ] Benjamin Fradet commented on SPARK-7255: Otherwise, I'd be glad to add it to the