[jira] [Resolved] (SPARK-6548) stddev_pop and stddev_samp aggregate functions

2015-09-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-6548. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 6297

[jira] [Created] (SPARK-10578) pyspark.ml.classification.RandomForestClassifer does not return `rawPrediction` column

2015-09-12 Thread Karen Yin-Yee Ng (JIRA)
Karen Yin-Yee Ng created SPARK-10578: Summary: pyspark.ml.classification.RandomForestClassifer does not return `rawPrediction` column Key: SPARK-10578 URL: https://issues.apache.org/jira/browse/SPARK-10578

[jira] [Commented] (SPARK-10401) spark-submit --unsupervise

2015-09-12 Thread Sanket Reddy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742252#comment-14742252 ] Sanket Reddy commented on SPARK-10401: -- I would like to work on it > spark-submit --unsupervise >

[jira] [Updated] (SPARK-10429) MutableProjection should evaluate all expressions first and then update the mutable row

2015-09-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-10429: - Assignee: Wenchen Fan > MutableProjection should evaluate all expressions first and then update the >

[jira] [Comment Edited] (SPARK-10429) MutableProjection should evaluate all expressions first and then update the mutable row

2015-09-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742251#comment-14742251 ] Yin Huai edited comment on SPARK-10429 at 9/12/15 10:50 PM: [Here |

[jira] [Comment Edited] (SPARK-10429) MutableProjection should evaluate all expressions first and then update the mutable row

2015-09-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742251#comment-14742251 ] Yin Huai edited comment on SPARK-10429 at 9/12/15 10:37 PM: [Here |

[jira] [Issue Comment Deleted] (SPARK-10401) spark-submit --unsupervise

2015-09-12 Thread Sanket Reddy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sanket Reddy updated SPARK-10401: - Comment: was deleted (was: I would like to work on it) > spark-submit --unsupervise >

[jira] [Resolved] (SPARK-10330) Use SparkHadoopUtil TaskAttemptContext reflection methods in more places

2015-09-12 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-10330. Resolution: Fixed Assignee: Josh Rosen Fix Version/s: 1.6.0 Fixed by my PR for

[jira] [Commented] (SPARK-10429) MutableProjection should evaluate all expressions first and then update the mutable row

2015-09-12 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742251#comment-14742251 ] Yin Huai commented on SPARK-10429: -- [Here |

[jira] [Commented] (SPARK-6548) stddev_pop and stddev_samp aggregate functions

2015-09-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742203#comment-14742203 ] Reynold Xin commented on SPARK-6548: [~davies] the patch that's merged is not using the new aggregate

[jira] [Commented] (SPARK-10557) Publish Spark 1.5.0 on Maven central

2015-09-12 Thread Marko Asplund (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742209#comment-14742209 ] Marko Asplund commented on SPARK-10557: --- thanks! (y) > Publish Spark 1.5.0 on Maven central >

[jira] [Commented] (SPARK-9720) spark.ml Identifiable types should have UID in toString methods

2015-09-12 Thread Bertrand Dechoux (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742161#comment-14742161 ] Bertrand Dechoux commented on SPARK-9720: - The pull request can be merged. > spark.ml

[jira] [Commented] (SPARK-10576) Move .java files out of src/main/scala

2015-09-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742280#comment-14742280 ] Patrick Wendell commented on SPARK-10576: - FWIW - seems to me like moving them into /java makes

[jira] [Created] (SPARK-10579) Extend statistical functions: Add Cardinality/Quantiles/Quartiles/Median in Statistics , e.g. for columns

2015-09-12 Thread Narine Kokhlikyan (JIRA)
Narine Kokhlikyan created SPARK-10579: - Summary: Extend statistical functions: Add Cardinality/Quantiles/Quartiles/Median in Statistics , e.g. for columns Key: SPARK-10579 URL:

[jira] [Created] (SPARK-10575) Wrap RDD.takeSample with scope

2015-09-12 Thread Vinod KC (JIRA)
Vinod KC created SPARK-10575: Summary: Wrap RDD.takeSample with scope Key: SPARK-10575 URL: https://issues.apache.org/jira/browse/SPARK-10575 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-10575) Wrap RDD.takeSample with scope

2015-09-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10575: Assignee: (was: Apache Spark) > Wrap RDD.takeSample with scope >

[jira] [Commented] (SPARK-10575) Wrap RDD.takeSample with scope

2015-09-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14741961#comment-14741961 ] Apache Spark commented on SPARK-10575: -- User 'vinodkc' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10575) Wrap RDD.takeSample with scope

2015-09-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10575: Assignee: Apache Spark > Wrap RDD.takeSample with scope > --

[jira] [Updated] (SPARK-10566) SnappyCompressionCodec init exception handling masks important error information

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10566: -- Assignee: Daniel Imfeld > SnappyCompressionCodec init exception handling masks important error >

[jira] [Commented] (SPARK-10518) Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils

2015-09-12 Thread shimizu yoshihiro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14741922#comment-14741922 ] shimizu yoshihiro commented on SPARK-10518: --- [~mengxr] Thank you for review on github. Here is

[jira] [Resolved] (SPARK-10566) SnappyCompressionCodec init exception handling masks important error information

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10566. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull

[jira] [Updated] (SPARK-10554) Potential NPE with ShutdownHook

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10554: -- Assignee: Nithin Asokan > Potential NPE with ShutdownHook > --- > >

[jira] [Resolved] (SPARK-10554) Potential NPE with ShutdownHook

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10554. --- Resolution: Fixed Fix Version/s: 1.5.1 1.6.0 Issue resolved by pull

[jira] [Commented] (SPARK-10568) Error thrown in stopping one component in SparkContext.stop() doesn't allow other components to be stopped

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14741978#comment-14741978 ] Sean Owen commented on SPARK-10568: --- Yeah I can imagine some relatively painless Scala code that

[jira] [Updated] (SPARK-10568) Error thrown in stopping one component in SparkContext.stop() doesn't allow other components to be stopped

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10568: -- Priority: Minor (was: Major) > Error thrown in stopping one component in SparkContext.stop() doesn't

[jira] [Resolved] (SPARK-10547) Streamline / improve style of Java API tests

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10547. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8706

[jira] [Updated] (SPARK-10518) Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10518: -- Assignee: shimizu yoshihiro > Update code examples in spark.ml user guide to use LIBSVM data source

[jira] [Created] (SPARK-10576) Move .java files out of src/main/scala

2015-09-12 Thread Sean Owen (JIRA)
Sean Owen created SPARK-10576: - Summary: Move .java files out of src/main/scala Key: SPARK-10576 URL: https://issues.apache.org/jira/browse/SPARK-10576 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-10538) java.lang.NegativeArraySizeException during join

2015-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742018#comment-14742018 ] Maciej Bryński commented on SPARK-10538: OK. I managed to isolate the problem. I have two

[jira] [Commented] (SPARK-9610) Class and instance weighting for ML

2015-09-12 Thread Nickolay Yakushev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742064#comment-14742064 ] Nickolay Yakushev commented on SPARK-9610: -- Thanks for reply. > Class and instance weighting for

[jira] [Comment Edited] (SPARK-10576) Move .java files out of src/main/scala

2015-09-12 Thread Kiran Lonikar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742066#comment-14742066 ] Kiran Lonikar edited comment on SPARK-10576 at 9/12/15 2:13 PM: thats

[jira] [Commented] (SPARK-10576) Move .java files out of src/main/scala

2015-09-12 Thread Kiran Lonikar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742066#comment-14742066 ] Kiran Lonikar commented on SPARK-10576: --- thats right, the intent is to find out if there is any

[jira] [Commented] (SPARK-9213) Improve regular expression performance (via joni)

2015-09-12 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742070#comment-14742070 ] Yadong Qi commented on SPARK-9213: -- [~rxin] I'm working on this, and already have a pull request as you

[jira] [Comment Edited] (SPARK-9213) Improve regular expression performance (via joni)

2015-09-12 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742070#comment-14742070 ] Yadong Qi edited comment on SPARK-9213 at 9/12/15 2:29 PM: --- [~rxin] I'm working

[jira] [Comment Edited] (SPARK-9213) Improve regular expression performance (via joni)

2015-09-12 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742070#comment-14742070 ] Yadong Qi edited comment on SPARK-9213 at 9/12/15 2:41 PM: --- [~rxin] I'm working

[jira] [Comment Edited] (SPARK-9213) Improve regular expression performance (via joni)

2015-09-12 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742070#comment-14742070 ] Yadong Qi edited comment on SPARK-9213 at 9/12/15 2:42 PM: --- [~rxin] I'm working

[jira] [Commented] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742036#comment-14742036 ] Maciej Bryński commented on SPARK-10577: [~rxin] I can find broadcast in functions.scala. Is it

[jira] [Comment Edited] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742036#comment-14742036 ] Maciej Bryński edited comment on SPARK-10577 at 9/12/15 12:29 PM: --

[jira] [Comment Edited] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742042#comment-14742042 ] Maciej Bryński edited comment on SPARK-10577 at 9/12/15 12:47 PM: -- Same

[jira] [Assigned] (SPARK-10222) More thoroughly deprecate Bagel in favor of GraphX

2015-09-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10222: Assignee: Apache Spark (was: Sean Owen) > More thoroughly deprecate Bagel in favor of

[jira] [Commented] (SPARK-6350) Make mesosExecutorCores configurable in mesos "fine-grained" mode

2015-09-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742049#comment-14742049 ] Apache Spark commented on SPARK-6350: - User 'dragos' has created a pull request for this issue:

[jira] [Created] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-12 Thread JIRA
Maciej Bryński created SPARK-10577: -- Summary: [PySpark, SQL] DataFrame hint for broadcast join Key: SPARK-10577 URL: https://issues.apache.org/jira/browse/SPARK-10577 Project: Spark Issue

[jira] [Comment Edited] (SPARK-10538) java.lang.NegativeArraySizeException during join

2015-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742018#comment-14742018 ] Maciej Bryński edited comment on SPARK-10538 at 9/12/15 12:24 PM: -- OK.

[jira] [Comment Edited] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742042#comment-14742042 ] Maciej Bryński edited comment on SPARK-10577 at 9/12/15 12:46 PM: -- Same

[jira] [Comment Edited] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742042#comment-14742042 ] Maciej Bryński edited comment on SPARK-10577 at 9/12/15 12:46 PM: -- Same

[jira] [Updated] (SPARK-10222) More thoroughly deprecate Bagel in favor of GraphX

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10222: -- Assignee: Sean Owen Target Version/s: 1.6.0 (was: 2+) Priority: Minor (was:

[jira] [Commented] (SPARK-10577) [PySpark, SQL] DataFrame hint for broadcast join

2015-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742042#comment-14742042 ] Maciej Bryński commented on SPARK-10577: Same without Hive support. Py4JJavaError: An error

[jira] [Commented] (SPARK-10222) More thoroughly deprecate Bagel in favor of GraphX

2015-09-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14742043#comment-14742043 ] Apache Spark commented on SPARK-10222: -- User 'srowen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10222) More thoroughly deprecate Bagel in favor of GraphX

2015-09-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10222: Assignee: Sean Owen (was: Apache Spark) > More thoroughly deprecate Bagel in favor of

[jira] [Updated] (SPARK-9014) Allow Python spark API to use built-in exponential operator

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9014: - Assignee: Alexey Grishchenko > Allow Python spark API to use built-in exponential operator >

[jira] [Updated] (SPARK-7942) Receiver's life cycle is inconsistent with streaming job.

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7942: - Assignee: Tathagata Das > Receiver's life cycle is inconsistent with streaming job. >

[jira] [Updated] (SPARK-10572) Investigate the contentions bewteen tasks in the same executor

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10572: -- Component/s: Spark Core Scheduler > Investigate the contentions bewteen tasks in the

[jira] [Updated] (SPARK-4879) Missing output partitions after job completes with speculative execution

2015-09-12 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4879: - Target Version/s: 1.3.0 (was: 1.0.3, 1.1.2, 1.2.2, 1.3.0) Labels: (was: backport-needed)