[jira] [Commented] (SPARK-19992) spark-submit on deployment-mode cluster

2017-03-23 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939867#comment-15939867 ] Saisai Shao commented on SPARK-19992: - Sorry I cannot give you valid suggestions without knowing your

[jira] [Commented] (SPARK-19992) spark-submit on deployment-mode cluster

2017-03-23 Thread narendra maru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939857#comment-15939857 ] narendra maru commented on SPARK-19992: --- Thanks sean and saisai for your rlyy I followed the same

[jira] [Commented] (SPARK-20073) Unexpected Cartesian product when using eqNullSafe in join with a derived table

2017-03-23 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939852#comment-15939852 ] Takeshi Yamamuro commented on SPARK-20073: -- yea, you need an alias for key; {code} scala> val

[jira] [Updated] (SPARK-19612) Tests failing with timeout

2017-03-23 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-19612: --- Affects Version/s: (was: 2.1.1) 2.2.0 > Tests failing with

[jira] [Reopened] (SPARK-19612) Tests failing with timeout

2017-03-23 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout reopened SPARK-19612: This seems to be back: saw two recently:

[jira] [Commented] (SPARK-20073) Unexpected Cartesian product when using eqNullSafe in join with a derived table

2017-03-23 Thread Everett Anderson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939844#comment-15939844 ] Everett Anderson commented on SPARK-20073: -- [~maropu] Hi! Thanks for taking a look. What's the

[jira] [Commented] (SPARK-20068) Twenty-two column coalesce has pool performance when codegen is open

2017-03-23 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939836#comment-15939836 ] Takeshi Yamamuro commented on SPARK-20068: -- If you could, you'd better to use newer Spark. I

[jira] [Commented] (SPARK-20073) Unexpected Cartesian product when using eqNullSafe in join with a derived table

2017-03-23 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939831#comment-15939831 ] Takeshi Yamamuro commented on SPARK-20073: -- If you use `===` instead of `<=>`, you get a warning

[jira] [Comment Edited] (SPARK-20073) Unexpected Cartesian product when using eqNullSafe in join with a derived table

2017-03-23 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939822#comment-15939822 ] Takeshi Yamamuro edited comment on SPARK-20073 at 3/24/17 5:16 AM: --- I

[jira] [Comment Edited] (SPARK-20073) Unexpected Cartesian product when using eqNullSafe in join with a derived table

2017-03-23 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939822#comment-15939822 ] Takeshi Yamamuro edited comment on SPARK-20073 at 3/24/17 5:15 AM: --- I

[jira] [Commented] (SPARK-20073) Unexpected Cartesian product when using eqNullSafe in join with a derived table

2017-03-23 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939822#comment-15939822 ] Takeshi Yamamuro commented on SPARK-20073: -- I think this is the known issue and you need to

[jira] [Commented] (SPARK-20068) Twenty-two column coalesce has pool performance when codegen is open

2017-03-23 Thread QQShu1 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939820#comment-15939820 ] QQShu1 commented on SPARK-20068: @Takeshi Yamamuro thanks for your answer. V2.1 doesn`t have this issue.

[jira] [Assigned] (SPARK-19959) df[java.lang.Long].collect throws NullPointerException if df includes null

2017-03-23 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-19959: --- Assignee: Kazuaki Ishizaki > df[java.lang.Long].collect throws NullPointerException if df

[jira] [Resolved] (SPARK-19959) df[java.lang.Long].collect throws NullPointerException if df includes null

2017-03-23 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-19959. - Resolution: Fixed Fix Version/s: 2.2.0 2.0.3 2.1.1

[jira] [Updated] (SPARK-17204) Spark 2.0 off heap RDD persistence with replication factor 2 leads to in-memory data corruption

2017-03-23 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-17204: Fix Version/s: 2.0.3 > Spark 2.0 off heap RDD persistence with replication factor 2 leads to >

[jira] [Assigned] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20043: Assignee: (was: Apache Spark) > CrossValidatorModel loader does not recognize

[jira] [Commented] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939774#comment-15939774 ] Apache Spark commented on SPARK-20043: -- User 'facaiy' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20043: Assignee: Apache Spark > CrossValidatorModel loader does not recognize impurity "Gini"

[jira] [Assigned] (SPARK-20009) Use user-friendly DDL formats for defining a schema in user-facing APIs

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20009: Assignee: (was: Apache Spark) > Use user-friendly DDL formats for defining a schema

[jira] [Commented] (SPARK-20009) Use user-friendly DDL formats for defining a schema in user-facing APIs

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939748#comment-15939748 ] Apache Spark commented on SPARK-20009: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20009) Use user-friendly DDL formats for defining a schema in user-facing APIs

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20009: Assignee: Apache Spark > Use user-friendly DDL formats for defining a schema in

[jira] [Commented] (SPARK-14083) Analyze JVM bytecode and turn closures into Catalyst expressions

2017-03-23 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939625#comment-15939625 ] Liang-Chi Hsieh commented on SPARK-14083: - [~maropu] Thanks! That's great! > Analyze JVM

[jira] [Created] (SPARK-20079) Re registration of AM hangs spark cluster in yarn-client mode

2017-03-23 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-20079: --- Summary: Re registration of AM hangs spark cluster in yarn-client mode Key: SPARK-20079 URL: https://issues.apache.org/jira/browse/SPARK-20079 Project: Spark

[jira] [Commented] (SPARK-19641) JSON schema inference in DROPMALFORMED mode produces incorrect schema

2017-03-23 Thread Nathan Howell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939572#comment-15939572 ] Nathan Howell commented on SPARK-19641: --- Please pick it up if you have cycles and want to take it

[jira] [Comment Edited] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accept

2017-03-23 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939571#comment-15939571 ] Yan Facai (颜发才) edited comment on SPARK-20043 at 3/24/17 2:15 AM: -- The

[jira] [Commented] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-23 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939571#comment-15939571 ] Yan Facai (颜发才) commented on SPARK-20043: - The bug can be reproduced by: ```scala test("cross

[jira] [Commented] (SPARK-19641) JSON schema inference in DROPMALFORMED mode produces incorrect schema

2017-03-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939570#comment-15939570 ] Hyukjin Kwon commented on SPARK-19641: -- Not sure, just IMHO, it sounds not super urgent one because

[jira] [Issue Comment Deleted] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are

2017-03-23 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Facai (颜发才) updated SPARK-20043: Comment: was deleted (was: [~zsellami] could you give an example of your code? I try to

[jira] [Commented] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-23 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939560#comment-15939560 ] Yan Facai (颜发才) commented on SPARK-20043: - [~zsellami] could you give an example of your code? I

[jira] [Commented] (SPARK-19970) Table owner should be USER instead of PRINCIPAL in kerberized clusters

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939551#comment-15939551 ] Apache Spark commented on SPARK-19970: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Resolved] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19636. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17108

[jira] [Commented] (SPARK-19641) JSON schema inference in DROPMALFORMED mode produces incorrect schema

2017-03-23 Thread Nathan Howell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939547#comment-15939547 ] Nathan Howell commented on SPARK-19641: --- [~hyukjin.kwon], I'm super busy through next Tuesday. I

[jira] [Commented] (SPARK-20043) CrossValidatorModel loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-23 Thread 颜发才
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939533#comment-15939533 ] Yan Facai (颜发才) commented on SPARK-20043: - Perhaps it's better to convert impurity Type after

[jira] [Commented] (SPARK-19641) JSON schema inference in DROPMALFORMED mode produces incorrect schema

2017-03-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939530#comment-15939530 ] Hyukjin Kwon commented on SPARK-19641: -- [~NathanHowell], I just happened to revisit this. Are you

[jira] [Resolved] (SPARK-19625) Authorization Support(on all operations not only DDL) in Spark Sql version 2.1.0

2017-03-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19625. -- Resolution: Duplicate It sounds a duplicate of SPARK-8321. Both JIRAs that the PRs indiciate

[jira] [Resolved] (SPARK-19567) Support some Schedulable variables immutability and access

2017-03-23 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-19567. Resolution: Fixed Assignee: Eren Avsarogullari Fix Version/s: 2.2.0 >

[jira] [Commented] (SPARK-19372) Code generation for Filter predicate including many OR conditions exceeds JVM method size limit

2017-03-23 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939511#comment-15939511 ] Kazuaki Ishizaki commented on SPARK-19372: -- I implemented the code to take care of it, and am

[jira] [Commented] (SPARK-19372) Code generation for Filter predicate including many OR conditions exceeds JVM method size limit

2017-03-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939480#comment-15939480 ] Hyukjin Kwon commented on SPARK-19372: -- I have seen this too before. > Code generation for Filter

[jira] [Resolved] (SPARK-10849) Allow user to specify database column type for data frame fields when writing data to jdbc data sources.

2017-03-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-10849. - Resolution: Fixed Assignee: Suresh Thalamati Fix Version/s: 2.2.0 > Allow user to

[jira] [Resolved] (SPARK-11300) Support for string length when writing to JDBC

2017-03-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-11300. - Resolution: Duplicate > Support for string length when writing to JDBC >

[jira] [Commented] (SPARK-19496) to_date with format has weird behavior

2017-03-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939508#comment-15939508 ] Josh Rosen commented on SPARK-19496: Let's make sure to document this clearly in the release notes. I

[jira] [Updated] (SPARK-19496) to_date with format has weird behavior

2017-03-23 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-19496: --- Labels: release-notes (was: ) > to_date with format has weird behavior >

[jira] [Reopened] (SPARK-10101) Spark JDBC writer mapping String to TEXT or VARCHAR

2017-03-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reopened SPARK-10101: - > Spark JDBC writer mapping String to TEXT or VARCHAR > ---

[jira] [Resolved] (SPARK-10101) Spark JDBC writer mapping String to TEXT or VARCHAR

2017-03-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-10101. - Resolution: Duplicate Fix Version/s: 2.2.0 > Spark JDBC writer mapping String to TEXT or VARCHAR

[jira] [Issue Comment Deleted] (SPARK-10101) Spark JDBC writer mapping String to TEXT or VARCHAR

2017-03-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-10101: Comment: was deleted (was: This has been resolved in the master. If you still hit any bug, please open a

[jira] [Updated] (SPARK-19868) conflict TasksetManager lead to spark stopped

2017-03-23 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-19868: --- Target Version/s: 2.2.0 > conflict TasksetManager lead to spark stopped >

[jira] [Commented] (SPARK-20078) Mesos executor configurability for task name and labels

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939436#comment-15939436 ] Apache Spark commented on SPARK-20078: -- User 'kalvinnchau' has created a pull request for this

[jira] [Assigned] (SPARK-20078) Mesos executor configurability for task name and labels

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20078: Assignee: (was: Apache Spark) > Mesos executor configurability for task name and

[jira] [Assigned] (SPARK-20078) Mesos executor configurability for task name and labels

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20078: Assignee: Apache Spark > Mesos executor configurability for task name and labels >

[jira] [Created] (SPARK-20078) Mesos executor configurability for task name and labels

2017-03-23 Thread Kalvin Chau (JIRA)
Kalvin Chau created SPARK-20078: --- Summary: Mesos executor configurability for task name and labels Key: SPARK-20078 URL: https://issues.apache.org/jira/browse/SPARK-20078 Project: Spark Issue

[jira] [Updated] (SPARK-20050) Kafka 0.10 DirectStream doesn't commit last processed batch's offset when graceful shutdown

2017-03-23 Thread Sasaki Toru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sasaki Toru updated SPARK-20050: Description: I use Kafka 0.10 DirectStream with properties 'enable.auto.commit=false' and call

[jira] [Updated] (SPARK-20050) Kafka 0.10 DirectStream doesn't commit last processed batch's offset when graceful shutdown

2017-03-23 Thread Sasaki Toru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sasaki Toru updated SPARK-20050: Description: I use Kafka 0.10 DirectStream with properties 'enable.auto.commit=false' and call

[jira] [Commented] (SPARK-13333) DataFrame filter + randn + unionAll has bad interaction

2017-03-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939347#comment-15939347 ] Xiao Li commented on SPARK-1: - [~josephkb] In the SQL specification, the set operations are merging

[jira] [Commented] (SPARK-13333) DataFrame filter + randn + unionAll has bad interaction

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939323#comment-15939323 ] Joseph K. Bradley commented on SPARK-1: --- [~smilegator] I wouldn't call that result "right."

[jira] [Assigned] (SPARK-7200) Tungsten test suites should fail if memory leak is detected

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7200: --- Assignee: Apache Spark > Tungsten test suites should fail if memory leak is detected >

[jira] [Assigned] (SPARK-7200) Tungsten test suites should fail if memory leak is detected

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7200: --- Assignee: (was: Apache Spark) > Tungsten test suites should fail if memory leak is

[jira] [Commented] (SPARK-7200) Tungsten test suites should fail if memory leak is detected

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939318#comment-15939318 ] Apache Spark commented on SPARK-7200: - User 'jsoltren' has created a pull request for this issue:

[jira] [Updated] (SPARK-19970) Table owner should be USER instead of PRINCIPAL in kerberized clusters

2017-03-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-19970: --- Fix Version/s: 2.1.1 > Table owner should be USER instead of PRINCIPAL in kerberized

[jira] [Created] (SPARK-20077) Documentation for ml.stats.Correlation

2017-03-23 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-20077: -- Summary: Documentation for ml.stats.Correlation Key: SPARK-20077 URL: https://issues.apache.org/jira/browse/SPARK-20077 Project: Spark Issue Type:

[jira] [Created] (SPARK-20076) Python interface for ml.stats.Correlation

2017-03-23 Thread Timothy Hunter (JIRA)
Timothy Hunter created SPARK-20076: -- Summary: Python interface for ml.stats.Correlation Key: SPARK-20076 URL: https://issues.apache.org/jira/browse/SPARK-20076 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-19876) Add OneTime trigger executor

2017-03-23 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-19876. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17219

[jira] [Created] (SPARK-20075) Support classifier, packaging in Maven coordinates

2017-03-23 Thread Sean Owen (JIRA)
Sean Owen created SPARK-20075: - Summary: Support classifier, packaging in Maven coordinates Key: SPARK-20075 URL: https://issues.apache.org/jira/browse/SPARK-20075 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18364) Expose metrics for YarnShuffleService

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939194#comment-15939194 ] Apache Spark commented on SPARK-18364: -- User 'ash211' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18364) Expose metrics for YarnShuffleService

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18364: Assignee: Apache Spark > Expose metrics for YarnShuffleService >

[jira] [Assigned] (SPARK-18364) Expose metrics for YarnShuffleService

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18364: Assignee: (was: Apache Spark) > Expose metrics for YarnShuffleService >

[jira] [Commented] (SPARK-18085) Better History Server scalability for many / large applications

2017-03-23 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939191#comment-15939191 ] Marcelo Vanzin commented on SPARK-18085: I'm also keeping the current activity at

[jira] [Updated] (SPARK-20073) Unexpected Cartesian product when using eqNullSafe in join with a derived table

2017-03-23 Thread Everett Anderson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Everett Anderson updated SPARK-20073: - Description: It appears that if you try to join tables A and B when B is derived from A

[jira] [Commented] (SPARK-10816) EventTime based sessionization

2017-03-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939156#comment-15939156 ] Michael Armbrust commented on SPARK-10816: -- Just a quick note for people interested in this

[jira] [Commented] (SPARK-19372) Code generation for Filter predicate including many OR conditions exceeds JVM method size limit

2017-03-23 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939072#comment-15939072 ] Andrew Ash commented on SPARK-19372: I've seen this as well on parquet files. > Code generation for

[jira] [Created] (SPARK-20074) Make buffer size in unsafe external sorter configurable

2017-03-23 Thread Sital Kedia (JIRA)
Sital Kedia created SPARK-20074: --- Summary: Make buffer size in unsafe external sorter configurable Key: SPARK-20074 URL: https://issues.apache.org/jira/browse/SPARK-20074 Project: Spark Issue

[jira] [Comment Edited] (SPARK-20050) Kafka 0.10 DirectStream doesn't commit last processed batch's offset when graceful shutdown

2017-03-23 Thread Sasaki Toru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938988#comment-15938988 ] Sasaki Toru edited comment on SPARK-20050 at 3/23/17 6:45 PM: -- Thank you for

[jira] [Commented] (SPARK-20050) Kafka 0.10 DirectStream doesn't commit last processed batch's offset when graceful shutdown

2017-03-23 Thread Sasaki Toru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938988#comment-15938988 ] Sasaki Toru commented on SPARK-20050: - Thank you for your comment, but I can't understand your

[jira] [Updated] (SPARK-19791) Add doc and example for fpgrowth

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19791: -- Target Version/s: 2.2.0 > Add doc and example for fpgrowth >

[jira] [Updated] (SPARK-19791) Add doc and example for fpgrowth

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19791: -- Shepherd: Joseph K. Bradley > Add doc and example for fpgrowth >

[jira] [Assigned] (SPARK-19791) Add doc and example for fpgrowth

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19791: - Assignee: yuhao yang > Add doc and example for fpgrowth >

[jira] [Updated] (SPARK-20073) Unexpected Cartesian product when using eqNullSafe in join with a derived table

2017-03-23 Thread Everett Anderson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Everett Anderson updated SPARK-20073: - Labels: correctness (was: ) > Unexpected Cartesian product when using eqNullSafe in

[jira] [Commented] (SPARK-15176) Job Scheduling Within Application Suffers from Priority Inversion

2017-03-23 Thread Travis Hegner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938813#comment-15938813 ] Travis Hegner commented on SPARK-15176: --- I'd like to have this feature available as well for a

[jira] [Commented] (SPARK-20073) Unexpected Cartesian product when using eqNullSafe in join with a derived table

2017-03-23 Thread Everett Anderson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938810#comment-15938810 ] Everett Anderson commented on SPARK-20073: -- With the local master in spark-shell and cross joins

[jira] [Created] (SPARK-20073) Unexpected Cartesian product when using eqNullSafe in join with a derived table

2017-03-23 Thread Everett Anderson (JIRA)
Everett Anderson created SPARK-20073: Summary: Unexpected Cartesian product when using eqNullSafe in join with a derived table Key: SPARK-20073 URL: https://issues.apache.org/jira/browse/SPARK-20073

[jira] [Resolved] (SPARK-20066) Add explicit SecurityManager(SparkConf) constructor for backwards compatibility with Java

2017-03-23 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Grover resolved SPARK-20066. - Resolution: Won't Fix > Add explicit SecurityManager(SparkConf) constructor for backwards >

[jira] [Updated] (SPARK-19591) Add sample weights to decision trees

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19591: -- Shepherd: Joseph K. Bradley > Add sample weights to decision trees >

[jira] [Assigned] (SPARK-19591) Add sample weights to decision trees

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19591: - Assignee: Seth Hendrickson > Add sample weights to decision trees >

[jira] [Assigned] (SPARK-19981) Sort-Merge join inserts shuffles when joining dataframes with aliased columns

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19981: Assignee: Apache Spark > Sort-Merge join inserts shuffles when joining dataframes with

[jira] [Commented] (SPARK-19981) Sort-Merge join inserts shuffles when joining dataframes with aliased columns

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938718#comment-15938718 ] Apache Spark commented on SPARK-19981: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19981) Sort-Merge join inserts shuffles when joining dataframes with aliased columns

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19981: Assignee: (was: Apache Spark) > Sort-Merge join inserts shuffles when joining

[jira] [Commented] (SPARK-19803) Flaky BlockManagerProactiveReplicationSuite tests

2017-03-23 Thread Shubham Chopra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938676#comment-15938676 ] Shubham Chopra commented on SPARK-19803: Any feedback on the PR -

[jira] [Updated] (SPARK-20037) impossible to set kafka offsets using kafka 0.10 and spark 2.0.0

2017-03-23 Thread Daniel Nuriyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Nuriyev updated SPARK-20037: --- Attachment: offsets.png > impossible to set kafka offsets using kafka 0.10 and spark 2.0.0 >

[jira] [Commented] (SPARK-20037) impossible to set kafka offsets using kafka 0.10 and spark 2.0.0

2017-03-23 Thread Daniel Nuriyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938600#comment-15938600 ] Daniel Nuriyev commented on SPARK-20037: This is en exception from partition 1 of another topic:

[jira] [Commented] (SPARK-20072) Clarify ALS-WR documentation

2017-03-23 Thread chris snow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938477#comment-15938477 ] chris snow commented on SPARK-20072: Fair enough. Though this did cause me some grief - I had read

[jira] [Commented] (SPARK-20071) StringIndexer overflows Kryo serialization buffer when run on column with many long distinct values

2017-03-23 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938475#comment-15938475 ] Barry Becker commented on SPARK-20071: -- Yes. I agree. I wanted to report the issue, but wasn't sure

[jira] [Commented] (SPARK-20068) Twenty-two column coalesce has pool performance when codegen is open

2017-03-23 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938471#comment-15938471 ] Takeshi Yamamuro commented on SPARK-20068: -- Have you tried v2.1? The latest also has the same

[jira] [Commented] (SPARK-20036) impossible to read a whole kafka topic using kafka 0.10 and spark 2.0.0

2017-03-23 Thread Daniel Nuriyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938472#comment-15938472 ] Daniel Nuriyev commented on SPARK-20036: To provide more info I am attaching the pom.xml and the

[jira] [Updated] (SPARK-20036) impossible to read a whole kafka topic using kafka 0.10 and spark 2.0.0

2017-03-23 Thread Daniel Nuriyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Nuriyev updated SPARK-20036: --- Attachment: Main.java pom.xml > impossible to read a whole kafka topic using

[jira] [Commented] (SPARK-19716) Dataset should allow by-name resolution for struct type elements in array

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938438#comment-15938438 ] Apache Spark commented on SPARK-19716: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19716) Dataset should allow by-name resolution for struct type elements in array

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19716: Assignee: Apache Spark > Dataset should allow by-name resolution for struct type elements

[jira] [Assigned] (SPARK-19716) Dataset should allow by-name resolution for struct type elements in array

2017-03-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19716: Assignee: (was: Apache Spark) > Dataset should allow by-name resolution for struct

[jira] [Commented] (SPARK-20072) Clarify ALS-WR documentation

2017-03-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15938415#comment-15938415 ] Sean Owen commented on SPARK-20072: --- I don't think those two wordings differ meaningfully? I think

[jira] [Created] (SPARK-20072) Clarify ALS-WR documentation

2017-03-23 Thread chris snow (JIRA)
chris snow created SPARK-20072: -- Summary: Clarify ALS-WR documentation Key: SPARK-20072 URL: https://issues.apache.org/jira/browse/SPARK-20072 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-20071) StringIndexer overflows Kryo serialization buffer when run on column with many long distinct values

2017-03-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20071: -- Issue Type: Improvement (was: Bug) Not a bug, right? You can effect some of this yourself with

[jira] [Closed] (SPARK-19728) PythonUDF with multiple parents shouldn't be pushed down when used as a predicate

2017-03-23 Thread Maciej Szymkiewicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Szymkiewicz closed SPARK-19728. -- Resolution: Fixed Fix Version/s: 2.2.0 > PythonUDF with multiple parents

  1   2   >