[jira] [Assigned] (SPARK-16321) Spark 2.0 performance drop vs Spark 1.6 when reading parquet file

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16321: Assignee: (was: Apache Spark) > Spark 2.0 performance drop vs Spark 1.6 when reading p

[jira] [Commented] (SPARK-16321) Spark 2.0 performance drop vs Spark 1.6 when reading parquet file

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404676#comment-15404676 ] Apache Spark commented on SPARK-16321: -- User 'maver1ck' has created a pull request f

[jira] [Resolved] (SPARK-16858) Removal of TestHiveSharedState

2016-08-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16858. - Resolution: Fixed Assignee: Xiao Li Fix Version/s: 2.1.0 > Removal of TestHiveSha

[jira] [Created] (SPARK-16861) Refactor PySpark accumulator API to be on top of AccumulatorV2 API

2016-08-02 Thread holdenk (JIRA)
holdenk created SPARK-16861: --- Summary: Refactor PySpark accumulator API to be on top of AccumulatorV2 API Key: SPARK-16861 URL: https://issues.apache.org/jira/browse/SPARK-16861 Project: Spark Iss

[jira] [Assigned] (SPARK-16861) Refactor PySpark accumulator API to be on top of AccumulatorV2 API

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16861: Assignee: Apache Spark > Refactor PySpark accumulator API to be on top of AccumulatorV2 AP

[jira] [Commented] (SPARK-16861) Refactor PySpark accumulator API to be on top of AccumulatorV2 API

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404891#comment-15404891 ] Apache Spark commented on SPARK-16861: -- User 'holdenk' has created a pull request fo

[jira] [Assigned] (SPARK-16861) Refactor PySpark accumulator API to be on top of AccumulatorV2 API

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16861: Assignee: (was: Apache Spark) > Refactor PySpark accumulator API to be on top of Accum

[jira] [Resolved] (SPARK-16796) Visible passwords on Spark environment page

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16796. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14409 [https://github.co

[jira] [Assigned] (SPARK-16671) Merge variable substitution code in core and SQL

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16671: Assignee: (was: Apache Spark) > Merge variable substitution code in core and SQL > ---

[jira] [Assigned] (SPARK-16671) Merge variable substitution code in core and SQL

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16671: Assignee: Apache Spark > Merge variable substitution code in core and SQL > --

[jira] [Commented] (SPARK-16671) Merge variable substitution code in core and SQL

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405022#comment-15405022 ] Apache Spark commented on SPARK-16671: -- User 'vanzin' has created a pull request for

[jira] [Updated] (SPARK-15541) SparkContext.stop throws error

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15541: -- Fix Version/s: 2.1.0 2.0.1 1.6.3 > SparkContext.stop throws error

[jira] [Assigned] (SPARK-16700) StructType doesn't accept Python dicts anymore

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16700: Assignee: Apache Spark > StructType doesn't accept Python dicts anymore >

[jira] [Assigned] (SPARK-16700) StructType doesn't accept Python dicts anymore

2016-08-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-16700: -- Assignee: Davies Liu > StructType doesn't accept Python dicts anymore > --

[jira] [Assigned] (SPARK-16700) StructType doesn't accept Python dicts anymore

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16700: Assignee: (was: Apache Spark) > StructType doesn't accept Python dicts anymore > -

[jira] [Commented] (SPARK-16700) StructType doesn't accept Python dicts anymore

2016-08-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405043#comment-15405043 ] Davies Liu commented on SPARK-16700: Sent PR https://github.com/apache/spark/pull/144

[jira] [Commented] (SPARK-16700) StructType doesn't accept Python dicts anymore

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405044#comment-15405044 ] Apache Spark commented on SPARK-16700: -- User 'davies' has created a pull request for

[jira] [Commented] (SPARK-16857) CrossValidator and KMeans throws IllegalArgumentException

2016-08-02 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405050#comment-15405050 ] Xusen Yin commented on SPARK-16857: --- Using CrossValidator with KMeans should be support

[jira] [Commented] (SPARK-16857) CrossValidator and KMeans throws IllegalArgumentException

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405055#comment-15405055 ] Sean Owen commented on SPARK-16857: --- I don't think that in general it makes sense to us

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread Sean Zhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405057#comment-15405057 ] Sean Zhong commented on SPARK-16320: [~maver1ck] Did you use the test case in this ji

[jira] [Commented] (SPARK-16857) CrossValidator and KMeans throws IllegalArgumentException

2016-08-02 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405069#comment-15405069 ] Xusen Yin commented on SPARK-16857: --- I agree the cluster assignments could be arbitrary

[jira] [Commented] (SPARK-16610) When writing ORC files, orc.compress should not be overridden if users do not set "compression" in the options

2016-08-02 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405071#comment-15405071 ] Hyukjin Kwon commented on SPARK-16610: -- One thought is, we might have to document th

[jira] [Commented] (SPARK-14453) Remove SPARK_JAVA_OPTS environment variable

2016-08-02 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405112#comment-15405112 ] Saisai Shao commented on SPARK-14453: - If you want to fix this issue, it would be bet

[jira] [Created] (SPARK-16862) Configurable buffer size in `UnsafeSorterSpillReader`

2016-08-02 Thread Tejas Patil (JIRA)
Tejas Patil created SPARK-16862: --- Summary: Configurable buffer size in `UnsafeSorterSpillReader` Key: SPARK-16862 URL: https://issues.apache.org/jira/browse/SPARK-16862 Project: Spark Issue Typ

[jira] [Created] (SPARK-16863) ProbabilisticClassifier.fit check threshoulds' length

2016-08-02 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-16863: Summary: ProbabilisticClassifier.fit check threshoulds' length Key: SPARK-16863 URL: https://issues.apache.org/jira/browse/SPARK-16863 Project: Spark Issue T

[jira] [Assigned] (SPARK-16863) ProbabilisticClassifier.fit check threshoulds' length

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16863: Assignee: (was: Apache Spark) > ProbabilisticClassifier.fit check threshoulds' length

[jira] [Assigned] (SPARK-16863) ProbabilisticClassifier.fit check threshoulds' length

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16863: Assignee: Apache Spark > ProbabilisticClassifier.fit check threshoulds' length > -

[jira] [Commented] (SPARK-16863) ProbabilisticClassifier.fit check threshoulds' length

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405182#comment-15405182 ] Apache Spark commented on SPARK-16863: -- User 'zhengruifeng' has created a pull reque

[jira] [Created] (SPARK-16864) Comprehensive version info

2016-08-02 Thread jay vyas (JIRA)
jay vyas created SPARK-16864: Summary: Comprehensive version info Key: SPARK-16864 URL: https://issues.apache.org/jira/browse/SPARK-16864 Project: Spark Issue Type: Improvement Repor

[jira] [Commented] (SPARK-14387) Exceptions thrown when querying ORC tables

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405192#comment-15405192 ] Apache Spark commented on SPARK-14387: -- User 'rajeshbalamohan' has created a pull re

[jira] [Created] (SPARK-16865) A file-based end-to-end SQL query suite

2016-08-02 Thread Peter Lee (JIRA)
Peter Lee created SPARK-16865: - Summary: A file-based end-to-end SQL query suite Key: SPARK-16865 URL: https://issues.apache.org/jira/browse/SPARK-16865 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-16866) Basic infrastructure for file-based SQL end-to-end tests

2016-08-02 Thread Peter Lee (JIRA)
Peter Lee created SPARK-16866: - Summary: Basic infrastructure for file-based SQL end-to-end tests Key: SPARK-16866 URL: https://issues.apache.org/jira/browse/SPARK-16866 Project: Spark Issue Type

[jira] [Assigned] (SPARK-16866) Basic infrastructure for file-based SQL end-to-end tests

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16866: Assignee: (was: Apache Spark) > Basic infrastructure for file-based SQL end-to-end tes

[jira] [Commented] (SPARK-16866) Basic infrastructure for file-based SQL end-to-end tests

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405203#comment-15405203 ] Apache Spark commented on SPARK-16866: -- User 'petermaxlee' has created a pull reques

[jira] [Assigned] (SPARK-16866) Basic infrastructure for file-based SQL end-to-end tests

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16866: Assignee: Apache Spark > Basic infrastructure for file-based SQL end-to-end tests > --

[jira] [Commented] (SPARK-16495) Add ADMM optimizer in mllib package

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405217#comment-15405217 ] Apache Spark commented on SPARK-16495: -- User 'ZunwenYou' has created a pull request

[jira] [Assigned] (SPARK-16853) Analysis error for DataSet typed selection

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16853: Assignee: Apache Spark > Analysis error for DataSet typed selection >

[jira] [Commented] (SPARK-16853) Analysis error for DataSet typed selection

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405220#comment-15405220 ] Apache Spark commented on SPARK-16853: -- User 'clockfly' has created a pull request f

[jira] [Assigned] (SPARK-16853) Analysis error for DataSet typed selection

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16853: Assignee: (was: Apache Spark) > Analysis error for DataSet typed selection > -

[jira] [Assigned] (SPARK-16862) Configurable buffer size in `UnsafeSorterSpillReader`

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16862: Assignee: Apache Spark > Configurable buffer size in `UnsafeSorterSpillReader` > -

[jira] [Commented] (SPARK-16862) Configurable buffer size in `UnsafeSorterSpillReader`

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405245#comment-15405245 ] Apache Spark commented on SPARK-16862: -- User 'tejasapatil' has created a pull reques

[jira] [Assigned] (SPARK-16862) Configurable buffer size in `UnsafeSorterSpillReader`

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16862: Assignee: (was: Apache Spark) > Configurable buffer size in `UnsafeSorterSpillReader`

[jira] [Commented] (SPARK-16863) ProbabilisticClassifier.fit check threshoulds' length

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405276#comment-15405276 ] Sean Owen commented on SPARK-16863: --- Wait, how is this different from SPARK-16851? I do

[jira] [Commented] (SPARK-7146) Should ML sharedParams be a public API?

2016-08-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405300#comment-15405300 ] Nicholas Chammas commented on SPARK-7146: - A quick update from a PySpark user: I a

[jira] [Comment Edited] (SPARK-7146) Should ML sharedParams be a public API?

2016-08-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405300#comment-15405300 ] Nicholas Chammas edited comment on SPARK-7146 at 8/3/16 4:45 AM: ---

[jira] [Commented] (SPARK-16863) ProbabilisticClassifier.fit check threshoulds' length

2016-08-02 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405333#comment-15405333 ] zhengruifeng commented on SPARK-16863: -- [SPARK-16851] add checking for {{Probabilis

[jira] [Commented] (SPARK-16864) Comprehensive version info

2016-08-02 Thread Jagadeesan A S (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405421#comment-15405421 ] Jagadeesan A S commented on SPARK-16864: Hi : I was looking at this JIRA and foun

[jira] [Created] (SPARK-16867) createTable and alterTable in ExternalCatalog should not take db

2016-08-02 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-16867: --- Summary: createTable and alterTable in ExternalCatalog should not take db Key: SPARK-16867 URL: https://issues.apache.org/jira/browse/SPARK-16867 Project: Spark

[jira] [Commented] (SPARK-16867) createTable and alterTable in ExternalCatalog should not take db

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15405440#comment-15405440 ] Apache Spark commented on SPARK-16867: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-16867) createTable and alterTable in ExternalCatalog should not take db

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16867: Assignee: Wenchen Fan (was: Apache Spark) > createTable and alterTable in ExternalCatalog

[jira] [Assigned] (SPARK-16867) createTable and alterTable in ExternalCatalog should not take db

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16867: Assignee: Apache Spark (was: Wenchen Fan) > createTable and alterTable in ExternalCatalog

<    1   2