[jira] [Commented] (SPARK-18821) Bisecting k-means wrapper in SparkR

2017-01-02 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15794322#comment-15794322 ] Miao Wang commented on SPARK-18821: --- Start it now. ETA within one week. > Bisecting k-means wrapper in

[jira] [Updated] (SPARK-16872) Include Gaussian Naive Bayes Classifier

2017-01-02 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-16872: - Description: I implemented Gaussian NB according to scikit-learn's {{GaussianNB}}. In GaussianNB

[jira] [Updated] (SPARK-16872) Include Gaussian Naive Bayes Classifier

2017-01-02 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-16872: - Description: I implement Gaussian NB according to scikit-learn's {{GaussianNB}}. In GaussianNB

[jira] [Updated] (SPARK-16872) Include Gaussian Naive Bayes Classifier

2017-01-02 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-16872: - Description: I implement Gaussian NB according to scikit-learn's {{GaussianNB}}. In GaussianNB

[jira] [Commented] (SPARK-19035) rand() function in case when cause failed

2017-01-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15794221#comment-15794221 ] Wenchen Fan commented on SPARK-19035: - This is because the `rand()` in GROUP BY and in `SELECT` are

[jira] [Resolved] (SPARK-18998) Add a cbo conf to switch between default statistics and cbo estimated statistics

2017-01-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-18998. - Resolution: Fixed Assignee: Zhenhua Wang Fix Version/s: 2.2.0 > Add a cbo conf

[jira] [Commented] (SPARK-18994) worker clean up app directory block the heartbeat sending

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15794100#comment-15794100 ] Apache Spark commented on SPARK-18994: -- User 'liujianhuiouc' has created a pull request for this

[jira] [Commented] (SPARK-18994) worker clean up app directory block the heartbeat sending

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15794090#comment-15794090 ] Apache Spark commented on SPARK-18994: -- User 'liujianhuiouc' has created a pull request for this

[jira] [Resolved] (SPARK-19029) Remove databaseName from SimpleCatalogRelation

2017-01-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19029. - Resolution: Fixed Fix Version/s: 2.2.0 > Remove databaseName from SimpleCatalogRelation

[jira] [Assigned] (SPARK-19055) SparkSession initialization will be associated with invalid SparkContext when new SparkContext is created to replace stopped SparkContext

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19055: Assignee: Apache Spark > SparkSession initialization will be associated with invalid

[jira] [Assigned] (SPARK-19055) SparkSession initialization will be associated with invalid SparkContext when new SparkContext is created to replace stopped SparkContext

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19055: Assignee: (was: Apache Spark) > SparkSession initialization will be associated with

[jira] [Commented] (SPARK-19055) SparkSession initialization will be associated with invalid SparkContext when new SparkContext is created to replace stopped SparkContext

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15794029#comment-15794029 ] Apache Spark commented on SPARK-19055: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-19035) rand() function in case when cause failed

2017-01-02 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15794015#comment-15794015 ] Xiao Li commented on SPARK-19035: - This is a bug. We are fixing it in another JIRA Spark-18969 > rand()

[jira] [Created] (SPARK-19055) SparkSession initialization will be associated with invalid SparkContext when new SparkContext is created to replace stopped SparkContext

2017-01-02 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-19055: --- Summary: SparkSession initialization will be associated with invalid SparkContext when new SparkContext is created to replace stopped SparkContext Key: SPARK-19055 URL:

[jira] [Commented] (SPARK-19054) Eliminate extra pass in NB

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15793987#comment-15793987 ] Apache Spark commented on SPARK-19054: -- User 'zhengruifeng' has created a pull request for this

[jira] [Assigned] (SPARK-19054) Eliminate extra pass in NB

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19054: Assignee: (was: Apache Spark) > Eliminate extra pass in NB >

[jira] [Assigned] (SPARK-19054) Eliminate extra pass in NB

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19054: Assignee: Apache Spark > Eliminate extra pass in NB > -- > >

[jira] [Created] (SPARK-19054) Eliminate extra pass in NB

2017-01-02 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-19054: Summary: Eliminate extra pass in NB Key: SPARK-19054 URL: https://issues.apache.org/jira/browse/SPARK-19054 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-18757) Models in Pyspark support column setters

2017-01-02 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15793971#comment-15793971 ] zhengruifeng commented on SPARK-18757: -- OK, I will follow your guides. > Models in Pyspark support

[jira] [Commented] (SPARK-10078) Vector-free L-BFGS

2017-01-02 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15793770#comment-15793770 ] Debasish Das commented on SPARK-10078: -- [~mengxr] [~dlwh] is it possible to implement VL-BFGS as

[jira] [Comment Edited] (SPARK-10078) Vector-free L-BFGS

2017-01-02 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15793760#comment-15793760 ] Debasish Das edited comment on SPARK-10078 at 1/3/17 12:26 AM: --- Ideally

[jira] [Commented] (SPARK-10078) Vector-free L-BFGS

2017-01-02 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15793760#comment-15793760 ] Debasish Das commented on SPARK-10078: -- Ideally feature partitioning should be automatically

[jira] [Commented] (SPARK-18857) SparkSQL ThriftServer hangs while extracting huge data volumes in incremental collect mode

2017-01-02 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15793668#comment-15793668 ] Dongjoon Hyun commented on SPARK-18857: --- Thank you for testing and confirming! > SparkSQL

[jira] [Commented] (SPARK-10078) Vector-free L-BFGS

2017-01-02 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15793488#comment-15793488 ] Seth Hendrickson commented on SPARK-10078: -- [~yanboliang] I was a bit confused by the following

[jira] [Created] (SPARK-19053) Supporting multiple evaluation metrics in DataFrame-based API: discussion

2017-01-02 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-19053: - Summary: Supporting multiple evaluation metrics in DataFrame-based API: discussion Key: SPARK-19053 URL: https://issues.apache.org/jira/browse/SPARK-19053

[jira] [Updated] (SPARK-19036) Merging dealyed micro batches

2017-01-02 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19036: - Component/s: (was: Structured Streaming) DStreams > Merging dealyed micro

[jira] [Updated] (SPARK-19041) Fix code snippet compilation issues in Structured Streaming Programming Guide

2017-01-02 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-19041: - Fix Version/s: 2.1.1 > Fix code snippet compilation issues in Structured Streaming Programming

[jira] [Commented] (SPARK-18922) Fix more resource-closing-related and path-related test failures in identified ones on Windows

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15793189#comment-15793189 ] Apache Spark commented on SPARK-18922: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Updated] (SPARK-18922) Fix more resource-closing-related and path-related test failures in identified ones on Windows

2017-01-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18922: -- Fix Version/s: (was: 2.2.0) > Fix more resource-closing-related and path-related test failures in

[jira] [Updated] (SPARK-19051) test_hivecontext (pyspark.sql.tests.HiveSparkSubmitTests) fails in python/run-tests

2017-01-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19051: -- Target Version/s: (was: 2.0.1) Labels: (was: test-failure) Fix Version/s:

[jira] [Updated] (SPARK-19022) Fix tests dependent on OS due to different newline characters

2017-01-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19022: -- Assignee: Hyukjin Kwon > Fix tests dependent on OS due to different newline characters >

[jira] [Resolved] (SPARK-19022) Fix tests dependent on OS due to different newline characters

2017-01-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19022. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16433

[jira] [Updated] (SPARK-18379) Make the parallelism of parallelPartitionDiscovery configurable.

2017-01-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18379: -- Target Version/s: (was: 2.0.3, 2.1.0) Fix Version/s: 2.1.1 > Make the parallelism of

[jira] [Resolved] (SPARK-19002) Check pep8 against all the python scripts

2017-01-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19002. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16405

[jira] [Updated] (SPARK-19002) Check pep8 against all the python scripts

2017-01-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19002: -- Assignee: Hyukjin Kwon > Check pep8 against all the python scripts >

[jira] [Updated] (SPARK-19041) Fix code snippet compilation issues in Structured Streaming Programming Guide

2017-01-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-19041: -- Assignee: Liwei Lin > Fix code snippet compilation issues in Structured Streaming Programming Guide >

[jira] [Resolved] (SPARK-19041) Fix code snippet compilation issues in Structured Streaming Programming Guide

2017-01-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-19041. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16442

[jira] [Commented] (SPARK-18948) Add Mean Percentile Rank metric for ranking algorithms

2017-01-02 Thread Danilo Ascione (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15792934#comment-15792934 ] Danilo Ascione commented on SPARK-18948: I understand. I'll close the PR. Thank you > Add Mean

[jira] [Assigned] (SPARK-19052) the rest api don't support multiple standby masters on standalone cluster

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19052: Assignee: Apache Spark > the rest api don't support multiple standby masters on

[jira] [Assigned] (SPARK-19052) the rest api don't support multiple standby masters on standalone cluster

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19052: Assignee: (was: Apache Spark) > the rest api don't support multiple standby masters

[jira] [Commented] (SPARK-19052) the rest api don't support multiple standby masters on standalone cluster

2017-01-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15792932#comment-15792932 ] Apache Spark commented on SPARK-19052: -- User 'hustfxj' has created a pull request for this issue:

[jira] [Created] (SPARK-19052) the rest api don't support multiple standby masters on standalone cluster

2017-01-02 Thread hustfxj (JIRA)
hustfxj created SPARK-19052: --- Summary: the rest api don't support multiple standby masters on standalone cluster Key: SPARK-19052 URL: https://issues.apache.org/jira/browse/SPARK-19052 Project: Spark

[jira] [Commented] (SPARK-18199) Support appending to Parquet files

2017-01-02 Thread Soubhik Chakraborty (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15792773#comment-15792773 ] Soubhik Chakraborty commented on SPARK-18199: - Thanks for pointing to the PR. In such case,

[jira] [Commented] (SPARK-18991) Change ContextCleaner.referenceBuffer to ConcurrentHashMap to make it faster

2017-01-02 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15792707#comment-15792707 ] Prashant Sharma commented on SPARK-18991: - Thank you for the fix ! > Change

[jira] [Commented] (SPARK-18857) SparkSQL ThriftServer hangs while extracting huge data volumes in incremental collect mode

2017-01-02 Thread vishal agrawal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15792731#comment-15792731 ] vishal agrawal commented on SPARK-18857: Thanks. its working fine now for our scenario. >

[jira] [Commented] (SPARK-18301) VectorAssembler does not support StructTypes

2017-01-02 Thread Steffen Herbold (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15792615#comment-15792615 ] Steffen Herbold commented on SPARK-18301: - I think if this is a bug or a feature request depends

[jira] [Commented] (SPARK-19045) irrelevant warning when creating a checkpoint dir

2017-01-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15792530#comment-15792530 ] Sean Owen commented on SPARK-19045: --- I get it, but it's not worth this complexity to check these cases