[jira] [Resolved] (SPARK-16734) Make sure examples in all language bindings are consistent

2016-08-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-16734. - Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 > Make sure examples in al

[jira] [Commented] (SPARK-16842) Concern about disallowing user-given schema for Parquet and ORC

2016-08-02 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403567#comment-15403567 ] Cheng Lian commented on SPARK-16842: First of all, the cost of schema discovery can b

[jira] [Commented] (SPARK-16842) Concern about disallowing user-given schema for Parquet and ORC

2016-08-02 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403573#comment-15403573 ] Hyukjin Kwon commented on SPARK-16842: -- Thank you both for great explanations. I wil

[jira] [Commented] (SPARK-16831) CrossValidator reports incorrect avgMetrics

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403576#comment-15403576 ] Apache Spark commented on SPARK-16831: -- User 'pkch' has created a pull request for t

[jira] [Assigned] (SPARK-16831) CrossValidator reports incorrect avgMetrics

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16831: Assignee: (was: Apache Spark) > CrossValidator reports incorrect avgMetrics >

[jira] [Closed] (SPARK-16842) Concern about disallowing user-given schema for Parquet and ORC

2016-08-02 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon closed SPARK-16842. Resolution: Not A Problem > Concern about disallowing user-given schema for Parquet and ORC > -

[jira] [Assigned] (SPARK-16831) CrossValidator reports incorrect avgMetrics

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16831: Assignee: Apache Spark > CrossValidator reports incorrect avgMetrics > ---

[jira] [Commented] (SPARK-16832) CrossValidator and TrainValidationSplit are not random without seed

2016-08-02 Thread Max Moroz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403585#comment-15403585 ] Max Moroz commented on SPARK-16832: --- [~bryanc] If this is intentional, then this is a d

[jira] [Commented] (SPARK-16834) TrainValildationSplit and direct evaluation produce different scores

2016-08-02 Thread Max Moroz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403586#comment-15403586 ] Max Moroz commented on SPARK-16834: --- [~sowen] Unfortunately, it's not related to the av

[jira] [Commented] (SPARK-16846) read.csv() option: "inferSchema" don't work

2016-08-02 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403592#comment-15403592 ] Yuming Wang commented on SPARK-16846: - You may need to be remove -schema-. The follow

[jira] [Updated] (SPARK-16851) Incorrect threshould length in 'setThresholds()' evoke Exception

2016-08-02 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-16851: - Description: {code} val path = "./spark-2.0.0-bin-hadoop2.7/data/mllib/sample_multiclass_classif

[jira] [Created] (SPARK-16851) Incorrect threshould length in 'setThresholds()' evoke Exception

2016-08-02 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-16851: Summary: Incorrect threshould length in 'setThresholds()' evoke Exception Key: SPARK-16851 URL: https://issues.apache.org/jira/browse/SPARK-16851 Project: Spark

[jira] [Updated] (SPARK-16851) Incorrect threshould length in 'setThresholds()' evoke Exception

2016-08-02 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-16851: - Description: {code} val path = "./spark-2.0.0-bin-hadoop2.7/data/mllib/sample_multiclass_classif

[jira] [Updated] (SPARK-16851) Incorrect threshould length in 'setThresholds()' evoke Exception

2016-08-02 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-16851: - Description: {code} val path = "./spark-2.0.0-bin-hadoop2.7/data/mllib/sample_multiclass_classif

[jira] [Created] (SPARK-16852) RejectedExecutionException when exit at some times

2016-08-02 Thread Weizhong (JIRA)
Weizhong created SPARK-16852: Summary: RejectedExecutionException when exit at some times Key: SPARK-16852 URL: https://issues.apache.org/jira/browse/SPARK-16852 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-16851) Incorrect threshould length in 'setThresholds()' evoke Exception

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403677#comment-15403677 ] Apache Spark commented on SPARK-16851: -- User 'zhengruifeng' has created a pull reque

[jira] [Assigned] (SPARK-16851) Incorrect threshould length in 'setThresholds()' evoke Exception

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16851: Assignee: (was: Apache Spark) > Incorrect threshould length in 'setThresholds()' evoke

[jira] [Assigned] (SPARK-16851) Incorrect threshould length in 'setThresholds()' evoke Exception

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16851: Assignee: Apache Spark > Incorrect threshould length in 'setThresholds()' evoke Exception

[jira] [Created] (SPARK-16853) Analysis error for DataSet typed selection

2016-08-02 Thread Sean Zhong (JIRA)
Sean Zhong created SPARK-16853: -- Summary: Analysis error for DataSet typed selection Key: SPARK-16853 URL: https://issues.apache.org/jira/browse/SPARK-16853 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-15541) SparkContext.stop throws error

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403794#comment-15403794 ] Apache Spark commented on SPARK-15541: -- User 'maver1ck' has created a pull request f

[jira] [Created] (SPARK-16854) mapWithState Support for Python

2016-08-02 Thread Boaz (JIRA)
Boaz created SPARK-16854: Summary: mapWithState Support for Python Key: SPARK-16854 URL: https://issues.apache.org/jira/browse/SPARK-16854 Project: Spark Issue Type: Task Components: PySpar

[jira] [Resolved] (SPARK-16850) Improve error message for greatest/least

2016-08-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-16850. - Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14453 [https://githu

[jira] [Updated] (SPARK-16850) Improve error message for greatest/least

2016-08-02 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-16850: Assignee: Peter Lee > Improve error message for greatest/least > --

[jira] [Created] (SPARK-16855) move Greatest and Least from conditionalExpressions.scala to arithmetic.scala

2016-08-02 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-16855: --- Summary: move Greatest and Least from conditionalExpressions.scala to arithmetic.scala Key: SPARK-16855 URL: https://issues.apache.org/jira/browse/SPARK-16855 Project:

[jira] [Commented] (SPARK-16855) move Greatest and Least from conditionalExpressions.scala to arithmetic.scala

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403825#comment-15403825 ] Apache Spark commented on SPARK-16855: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-16855) move Greatest and Least from conditionalExpressions.scala to arithmetic.scala

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16855: Assignee: Apache Spark (was: Wenchen Fan) > move Greatest and Least from conditionalExpre

[jira] [Assigned] (SPARK-16855) move Greatest and Least from conditionalExpressions.scala to arithmetic.scala

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16855: Assignee: Wenchen Fan (was: Apache Spark) > move Greatest and Least from conditionalExpre

[jira] [Closed] (SPARK-3692) RBF Kernel implementation to SVM

2016-08-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath closed SPARK-3692. - Resolution: Duplicate > RBF Kernel implementation to SVM > > >

[jira] [Commented] (SPARK-8779) Add documentation for Python's FP-growth

2016-08-02 Thread Jagadeesan A S (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403985#comment-15403985 ] Jagadeesan A S commented on SPARK-8779: --- Hi : I was looking at this JIRA and found a

[jira] [Commented] (SPARK-16840) Please save the aggregate term frequencies as part of the NaiveBayesModel

2016-08-02 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404006#comment-15404006 ] Barry Becker commented on SPARK-16840: -- I tried adding the aggregated data to the mo

[jira] [Commented] (SPARK-16802) joins.LongToUnsafeRowMap crashes with ArrayIndexOutOfBoundsException

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404009#comment-15404009 ] Sean Owen commented on SPARK-16802: --- Do you have a particular fix that addressed it [~w

[jira] [Resolved] (SPARK-8779) Add documentation for Python's FP-growth

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8779. -- Resolution: Duplicate Yes, I think I'm going to treat that as addressing this issue. > Add documentatio

[jira] [Commented] (SPARK-16854) mapWithState Support for Python

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404017#comment-15404017 ] Sean Owen commented on SPARK-16854: --- I think there was some opinion to not implement al

[jira] [Commented] (SPARK-16852) RejectedExecutionException when exit at some times

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404020#comment-15404020 ] Sean Owen commented on SPARK-16852: --- This is an effect rather than a cause. Can you say

[jira] [Commented] (SPARK-16832) CrossValidator and TrainValidationSplit are not random without seed

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404027#comment-15404027 ] Sean Owen commented on SPARK-16832: --- Hm, yeah I really think the default should be "Non

[jira] [Updated] (SPARK-16851) Incorrect threshould length in 'setThresholds()' evoke Exception

2016-08-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-16851: Assignee: zhengruifeng > Incorrect threshould length in 'setThresholds()' evoke Exception > --

[jira] [Resolved] (SPARK-16851) Incorrect threshould length in 'setThresholds()' evoke Exception

2016-08-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-16851. - Resolution: Fixed Fix Version/s: 2.1.0 > Incorrect threshould length in 'setThresholds()'

[jira] [Commented] (SPARK-16667) Spark driver executor dont release unused memory

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404070#comment-15404070 ] Luis Angel Hernández Acosta commented on SPARK-16667: - The problem is

[jira] [Resolved] (SPARK-16558) examples/mllib/LDAExample should use MLVector instead of MLlib Vector

2016-08-02 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-16558. - Resolution: Fixed Assignee: Xusen Yin Fix Version/s: 2.1.0 2.0.

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404114#comment-15404114 ] Maciej Bryński commented on SPARK-16320: [~clockfly] I tested your patch. Results

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Attachment: (was: spark1.6-ui.png) > Spark 2.0 slower than 1.6 when querying nested colum

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Attachment: spark2-ui.png spark1.6-ui.png > Spark 2.0 slower than 1.6 when qu

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Attachment: spark1.6-ui.png > Spark 2.0 slower than 1.6 when querying nested columns > --

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Attachment: spark1.6-ui.png > Spark 2.0 slower than 1.6 when querying nested columns > --

[jira] [Updated] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-16320: --- Attachment: (was: spark1.6-ui.png) > Spark 2.0 slower than 1.6 when querying nested colum

[jira] [Commented] (SPARK-16667) Spark driver executor dont release unused memory

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404140#comment-15404140 ] Sean Owen commented on SPARK-16667: --- That's not what you're reporting above though. You

[jira] [Comment Edited] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404114#comment-15404114 ] Maciej Bryński edited comment on SPARK-16320 at 8/2/16 3:05 PM: ---

[jira] [Commented] (SPARK-16852) RejectedExecutionException when exit at some times

2016-08-02 Thread Weizhong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404166#comment-15404166 ] Weizhong commented on SPARK-16852: -- I run 2T tpcds, and some times will print the stack.

[jira] [Updated] (SPARK-14453) Remove SPARK_JAVA_OPTS environment variable

2016-08-02 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Laskowski updated SPARK-14453: Summary: Remove SPARK_JAVA_OPTS environment variable (was: Consider removing SPARK_JAVA_OP

[jira] [Updated] (SPARK-14453) Remove SPARK_JAVA_OPTS environment variable

2016-08-02 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Laskowski updated SPARK-14453: Issue Type: Task (was: Bug) > Remove SPARK_JAVA_OPTS environment variable > --

[jira] [Commented] (SPARK-14453) Remove SPARK_JAVA_OPTS environment variable

2016-08-02 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404175#comment-15404175 ] Jacek Laskowski commented on SPARK-14453: - Is anyone working on it? Just found fe

[jira] [Updated] (SPARK-12650) No means to specify Xmx settings for SparkSubmit in cluster deploy mode for Spark on YARN

2016-08-02 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Laskowski updated SPARK-12650: Summary: No means to specify Xmx settings for SparkSubmit in cluster deploy mode for Spark

[jira] [Updated] (SPARK-12650) No means to specify Xmx settings for spark-submit in cluster deploy mode for Spark on YARN

2016-08-02 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Laskowski updated SPARK-12650: Summary: No means to specify Xmx settings for spark-submit in cluster deploy mode for Spark

[jira] [Commented] (SPARK-16854) mapWithState Support for Python

2016-08-02 Thread Boaz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404217#comment-15404217 ] Boaz commented on SPARK-16854: -- IMHO, streaming in python would be incomplete without mapWit

[jira] [Resolved] (SPARK-15541) SparkContext.stop throws error

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15541. --- Resolution: Resolved Assignee: Maciej Bryński Resolved by https://github.com/apache/spark/pull/

[jira] [Updated] (SPARK-16520) Link executors to corresponding worker pages

2016-08-02 Thread Tao Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Lin updated SPARK-16520: Priority: Major (was: Minor) > Link executors to corresponding worker pages >

[jira] [Created] (SPARK-16856) Link application summary page and detail page to the master page

2016-08-02 Thread Tao Lin (JIRA)
Tao Lin created SPARK-16856: --- Summary: Link application summary page and detail page to the master page Key: SPARK-16856 URL: https://issues.apache.org/jira/browse/SPARK-16856 Project: Spark Issue

[jira] [Resolved] (SPARK-16822) Support latex in scaladoc with MathJax

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16822. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14438 [https://github.co

[jira] [Updated] (SPARK-16822) Support latex in scaladoc with MathJax

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16822: -- Assignee: Shuai Lin > Support latex in scaladoc with MathJax > -- >

[jira] [Resolved] (SPARK-16837) TimeWindow incorrectly drops slideDuration in constructors

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16837. --- Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull request

[jira] [Updated] (SPARK-16837) TimeWindow incorrectly drops slideDuration in constructors

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16837: -- Assignee: Tom Magrino > TimeWindow incorrectly drops slideDuration in constructors > --

[jira] [Resolved] (SPARK-16835) LinearRegression LogisticRegression AFTSuvivalRegression should unpersist input training data when exception throws

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16835. --- Resolution: Won't Fix > LinearRegression LogisticRegression AFTSuvivalRegression should unpersist >

[jira] [Assigned] (SPARK-16856) Link application summary page and detail page to the master page

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16856: Assignee: Apache Spark > Link application summary page and detail page to the master page

[jira] [Assigned] (SPARK-16856) Link application summary page and detail page to the master page

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16856: Assignee: (was: Apache Spark) > Link application summary page and detail page to the m

[jira] [Commented] (SPARK-16856) Link application summary page and detail page to the master page

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404331#comment-15404331 ] Apache Spark commented on SPARK-16856: -- User 'nblintao' has created a pull request f

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404361#comment-15404361 ] Michael Allman commented on SPARK-16320: [~maver1ck] I'm having trouble reproduci

[jira] [Comment Edited] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404369#comment-15404369 ] Yin Huai edited comment on SPARK-16320 at 8/2/16 4:52 PM: -- [~mav

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404369#comment-15404369 ] Yin Huai commented on SPARK-16320: -- Can you also try https://github.com/apache/spark/pul

[jira] [Resolved] (SPARK-16062) PySpark SQL python-only UDTs don't work well

2016-08-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-16062. Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull reque

[jira] [Resolved] (SPARK-15989) PySpark SQL python-only UDTs don't support nested types

2016-08-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-15989. Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull reque

[jira] [Resolved] (SPARK-16836) Hive date/time function error

2016-08-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16836. - Resolution: Fixed Assignee: Herman van Hovell Fix Version/s: 2.1.0

[jira] [Created] (SPARK-16857) CrossValidator and KMeans throws IllegalArgumentException

2016-08-02 Thread Ryan Claussen (JIRA)
Ryan Claussen created SPARK-16857: - Summary: CrossValidator and KMeans throws IllegalArgumentException Key: SPARK-16857 URL: https://issues.apache.org/jira/browse/SPARK-16857 Project: Spark I

[jira] [Updated] (SPARK-16850) Improve error message for greatest/least

2016-08-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16850: Fix Version/s: 2.0.1 > Improve error message for greatest/least > -

[jira] [Resolved] (SPARK-16816) Add documentation to create JavaSparkContext from SparkSession

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16816. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14436 [https://github.co

[jira] [Updated] (SPARK-16816) Add documentation to create JavaSparkContext from SparkSession

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16816: -- Assignee: sandeep purohit > Add documentation to create JavaSparkContext from SparkSession > --

[jira] [Updated] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-08-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-15639: --- Target Version/s: 2.0.1 > Try to push down filter at RowGroups level for parquet reader > ---

[jira] [Updated] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-08-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-15639: --- Priority: Blocker (was: Major) > Try to push down filter at RowGroups level for parquet reader > ---

[jira] [Created] (SPARK-16858) Removal of TestHiveSharedState

2016-08-02 Thread Xiao Li (JIRA)
Xiao Li created SPARK-16858: --- Summary: Removal of TestHiveSharedState Key: SPARK-16858 URL: https://issues.apache.org/jira/browse/SPARK-16858 Project: Spark Issue Type: Improvement Compon

[jira] [Assigned] (SPARK-16858) Removal of TestHiveSharedState

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16858: Assignee: Apache Spark > Removal of TestHiveSharedState > -- >

[jira] [Commented] (SPARK-16858) Removal of TestHiveSharedState

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404464#comment-15404464 ] Apache Spark commented on SPARK-16858: -- User 'gatorsmile' has created a pull request

[jira] [Assigned] (SPARK-16858) Removal of TestHiveSharedState

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16858: Assignee: (was: Apache Spark) > Removal of TestHiveSharedState > -

[jira] [Updated] (SPARK-16796) Visible passwords on Spark environment page

2016-08-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16796: -- Assignee: Artur > Visible passwords on Spark environment page > ---

[jira] [Resolved] (SPARK-16855) move Greatest and Least from conditionalExpressions.scala to arithmetic.scala

2016-08-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-16855. - Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 > move Greatest and Least

[jira] [Updated] (SPARK-16855) move Greatest and Least from conditionalExpressions.scala to arithmetic.scala

2016-08-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16855: Fix Version/s: (was: 2.0.1) > move Greatest and Least from conditionalExpressions.scala to arit

[jira] [Created] (SPARK-16859) History Server storage information is missing

2016-08-02 Thread Andrey Ivanov (JIRA)
Andrey Ivanov created SPARK-16859: - Summary: History Server storage information is missing Key: SPARK-16859 URL: https://issues.apache.org/jira/browse/SPARK-16859 Project: Spark Issue Type: B

[jira] [Resolved] (SPARK-6399) Code compiled against 1.3.0 may not run against older Spark versions

2016-08-02 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-6399. --- Resolution: Won't Fix I think at this point it's pretty clear we won't do anything here. > Co

[jira] [Updated] (SPARK-16859) History Server storage information is missing

2016-08-02 Thread Andrey Ivanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrey Ivanov updated SPARK-16859: -- Description: It looks like job history storage tab in history server is broken for completed j

[jira] [Commented] (SPARK-16838) Add PMML export for ML KMeans in PySpark

2016-08-02 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404591#comment-15404591 ] Gayathri Murali commented on SPARK-16838: - I can work on this > Add PMML export

[jira] [Resolved] (SPARK-16787) SparkContext.addFile() should not fail if called twice with the same file

2016-08-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-16787. Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull reque

[jira] [Commented] (SPARK-16802) joins.LongToUnsafeRowMap crashes with ArrayIndexOutOfBoundsException

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404611#comment-15404611 ] Apache Spark commented on SPARK-16802: -- User 'davies' has created a pull request for

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404619#comment-15404619 ] Maciej Bryński commented on SPARK-16320: Yes. That's it. With this PR Spark 2.0 i

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404620#comment-15404620 ] Maciej Bryński commented on SPARK-16320: I think that problem is already resolved

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404646#comment-15404646 ] Maciej Bryński commented on SPARK-16320: [~michael], [~yhuai] I think this is sma

[jira] [Comment Edited] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404646#comment-15404646 ] Maciej Bryński edited comment on SPARK-16320 at 8/2/16 7:37 PM: ---

[jira] [Comment Edited] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404646#comment-15404646 ] Maciej Bryński edited comment on SPARK-16320 at 8/2/16 7:38 PM: ---

[jira] [Commented] (SPARK-16321) Spark 2.0 performance drop vs Spark 1.6 when reading parquet file

2016-08-02 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404652#comment-15404652 ] Xiao Li commented on SPARK-16321: - Can you set `spark.sql.parquet.enableVectorizedReader

[jira] [Created] (SPARK-16860) UDT Stringification Incorrect in PySpark

2016-08-02 Thread Vladimir Feinberg (JIRA)
Vladimir Feinberg created SPARK-16860: - Summary: UDT Stringification Incorrect in PySpark Key: SPARK-16860 URL: https://issues.apache.org/jira/browse/SPARK-16860 Project: Spark Issue Type

[jira] [Commented] (SPARK-16321) Spark 2.0 performance drop vs Spark 1.6 when reading parquet file

2016-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404654#comment-15404654 ] Maciej Bryński commented on SPARK-16321: [~smilegator] spark.sql.parquet.filterPu

[jira] [Assigned] (SPARK-16321) Spark 2.0 performance drop vs Spark 1.6 when reading parquet file

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16321: Assignee: Apache Spark > Spark 2.0 performance drop vs Spark 1.6 when reading parquet file

[jira] [Commented] (SPARK-16320) Spark 2.0 slower than 1.6 when querying nested columns

2016-08-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404675#comment-15404675 ] Apache Spark commented on SPARK-16320: -- User 'maver1ck' has created a pull request f

  1   2   >