[jira] [Commented] (SPARK-13219) Pushdown predicate propagation in SparkSQL with join

2016-02-23 Thread Evan Chan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160340#comment-15160340 ] Evan Chan commented on SPARK-13219: --- [~smilegator] [~doodlegum] Guys, let me explain

[jira] [Commented] (SPARK-13219) Pushdown predicate propagation in SparkSQL with join

2016-02-23 Thread Evan Chan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160333#comment-15160333 ] Evan Chan commented on SPARK-13219: --- Sorry, could you explain how SPARK-12957 affects t

[jira] [Commented] (SPARK-13467) abstract python function to simplify pyspark code

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160330#comment-15160330 ] Apache Spark commented on SPARK-13467: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-13467) abstract python function to simplify pyspark code

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13467: Assignee: (was: Apache Spark) > abstract python function to simplify pyspark code > --

[jira] [Assigned] (SPARK-13467) abstract python function to simplify pyspark code

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13467: Assignee: Apache Spark > abstract python function to simplify pyspark code > -

[jira] [Created] (SPARK-13467) abstract python function to simplify pyspark code

2016-02-23 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-13467: --- Summary: abstract python function to simplify pyspark code Key: SPARK-13467 URL: https://issues.apache.org/jira/browse/SPARK-13467 Project: Spark Issue Type: I

[jira] [Assigned] (SPARK-13466) Don't introduce redundant project with colum pruning rule

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13466: Assignee: Apache Spark > Don't introduce redundant project with colum pruning rule > -

[jira] [Commented] (SPARK-13466) Don't introduce redundant project with colum pruning rule

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160289#comment-15160289 ] Apache Spark commented on SPARK-13466: -- User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-13466) Don't introduce redundant project with colum pruning rule

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13466: Assignee: (was: Apache Spark) > Don't introduce redundant project with colum pruning r

[jira] [Created] (SPARK-13466) Don't introduce redundant project with colum pruning rule

2016-02-23 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-13466: --- Summary: Don't introduce redundant project with colum pruning rule Key: SPARK-13466 URL: https://issues.apache.org/jira/browse/SPARK-13466 Project: Spark

[jira] [Comment Edited] (SPARK-12997) Use cast expression to perform type cast in csv

2016-02-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160270#comment-15160270 ] Hyukjin Kwon edited comment on SPARK-12997 at 2/24/16 6:43 AM:

[jira] [Commented] (SPARK-13450) SortMergeJoin will OOM when join rows have lot of same keys

2016-02-23 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160271#comment-15160271 ] Hong Shen commented on SPARK-13450: --- A join has a lot of rows with the same key. > Sor

[jira] [Commented] (SPARK-12997) Use cast expression to perform type cast in csv

2016-02-23 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160270#comment-15160270 ] Hyukjin Kwon commented on SPARK-12997: -- If I got this correctly, I think the issue i

[jira] [Assigned] (SPARK-13465) Add a task failure listener to TaskContext

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13465: Assignee: Apache Spark (was: Reynold Xin) > Add a task failure listener to TaskContext >

[jira] [Commented] (SPARK-13307) TPCDS query 66 degraded by 30% in 1.6.0 compared to 1.4.1

2016-02-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160260#comment-15160260 ] Xiao Li commented on SPARK-13307: - You need to check the plan and check the join type it

[jira] [Commented] (SPARK-13307) TPCDS query 66 degraded by 30% in 1.6.0 compared to 1.4.1

2016-02-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160256#comment-15160256 ] Xiao Li commented on SPARK-13307: - First, I am not sure if usage of broadcastjoin makes s

[jira] [Assigned] (SPARK-13465) Add a task failure listener to TaskContext

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13465: Assignee: Reynold Xin (was: Apache Spark) > Add a task failure listener to TaskContext >

[jira] [Commented] (SPARK-13465) Add a task failure listener to TaskContext

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160251#comment-15160251 ] Apache Spark commented on SPARK-13465: -- User 'rxin' has created a pull request for t

[jira] [Updated] (SPARK-13465) Add a task failure listener to TaskContext

2016-02-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13465: Description: TaskContext supports task completion callback, which gets called regardless of task fa

[jira] [Created] (SPARK-13465) Add a task failure listener to TaskContext

2016-02-23 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-13465: --- Summary: Add a task failure listener to TaskContext Key: SPARK-13465 URL: https://issues.apache.org/jira/browse/SPARK-13465 Project: Spark Issue Type: New Feat

[jira] [Updated] (SPARK-13358) Retrieve grep path when doing Benchmark

2016-02-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13358: Assignee: Liang-Chi Hsieh > Retrieve grep path when doing Benchmark > -

[jira] [Closed] (SPARK-12907) Use BitSet to represent null fields in ColumnVector

2016-02-23 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki closed SPARK-12907. Resolution: Won't Fix > Use BitSet to represent null fields in ColumnVector > -

[jira] [Resolved] (SPARK-13431) Maven build fails due to: Method code too large! in Catalyst

2016-02-23 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13431. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11331 [https://github.

[jira] [Commented] (SPARK-13219) Pushdown predicate propagation in SparkSQL with join

2016-02-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160187#comment-15160187 ] Xiao Li commented on SPARK-13219: - Thank you! Could you hold it now? I think we can do it

[jira] [Commented] (SPARK-12532) Join-key Pushdown via Predicate Transitivity

2016-02-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160186#comment-15160186 ] Xiao Li commented on SPARK-12532: - Thank you! Could you hold it now? I think we can do it

[jira] [Commented] (SPARK-13421) Make output of a SparkPlan configurable

2016-02-23 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160157#comment-15160157 ] Kazuaki Ishizaki commented on SPARK-13421: -- I am willing to work for and contrib

[jira] [Commented] (SPARK-13445) Seleting "data" with window function does not work unless aliased (using PARTITION BY)

2016-02-23 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160151#comment-15160151 ] Xiao Li commented on SPARK-13445: - I will take a look at this. Thanks! > Seleting "data"

[jira] [Commented] (SPARK-1199) Type mismatch in Spark shell when using case class defined in shell

2016-02-23 Thread Prashant Sharma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160147#comment-15160147 ] Prashant Sharma commented on SPARK-1199: Did you try the :paste option ? > Type m

[jira] [Commented] (SPARK-13358) Retrieve grep path when doing Benchmark

2016-02-23 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160145#comment-15160145 ] Liang-Chi Hsieh commented on SPARK-13358: - [~davies] Can you help update the Assi

[jira] [Assigned] (SPARK-13464) Fix failed test test_reduce_by_key_and_window_with_none_invFunc in pyspark/streaming

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13464: Assignee: Apache Spark > Fix failed test test_reduce_by_key_and_window_with_none_invFunc i

[jira] [Assigned] (SPARK-13464) Fix failed test test_reduce_by_key_and_window_with_none_invFunc in pyspark/streaming

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13464: Assignee: (was: Apache Spark) > Fix failed test test_reduce_by_key_and_window_with_non

[jira] [Commented] (SPARK-13464) Fix failed test test_reduce_by_key_and_window_with_none_invFunc in pyspark/streaming

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160136#comment-15160136 ] Apache Spark commented on SPARK-13464: -- User 'viirya' has created a pull request for

[jira] [Created] (SPARK-13464) Fix failed test test_reduce_by_key_and_window_with_none_invFunc in pyspark/streaming

2016-02-23 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-13464: --- Summary: Fix failed test test_reduce_by_key_and_window_with_none_invFunc in pyspark/streaming Key: SPARK-13464 URL: https://issues.apache.org/jira/browse/SPARK-13464

[jira] [Commented] (SPARK-12878) Dataframe fails with nested User Defined Types

2016-02-23 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160119#comment-15160119 ] Jakob Odersky commented on SPARK-12878: --- I just tried your example and get a slight

[jira] [Updated] (SPARK-8369) Support dependency jar and files on HDFS in standalone cluster mode

2016-02-23 Thread Dong Lei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Lei updated SPARK-8369: Description: Currently, in standalone cluster mode, spark can take care of the app-jar whether the app-jar

[jira] [Commented] (SPARK-8369) Support dependency jar and files on HDFS in standalone cluster mode

2016-02-23 Thread Dong Lei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15160097#comment-15160097 ] Dong Lei commented on SPARK-8369: - Hi Jim, I was using standalone cluster mode. I will add

[jira] [Resolved] (SPARK-13376) Improve column pruning

2016-02-23 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13376. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11256 [https://github.

[jira] [Assigned] (SPARK-13092) Track constraints in ExpressionSet

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13092: Assignee: Michael Armbrust (was: Apache Spark) > Track constraints in ExpressionSet > ---

[jira] [Commented] (SPARK-13092) Track constraints in ExpressionSet

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159988#comment-15159988 ] Apache Spark commented on SPARK-13092: -- User 'marmbrus' has created a pull request f

[jira] [Assigned] (SPARK-13092) Track constraints in ExpressionSet

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13092: Assignee: Apache Spark (was: Michael Armbrust) > Track constraints in ExpressionSet > ---

[jira] [Assigned] (SPARK-13092) Track constraints in ExpressionSet

2016-02-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reassigned SPARK-13092: Assignee: Michael Armbrust (was: Sameer Agarwal) > Track constraints in Expressio

[jira] [Created] (SPARK-13463) Support Column pruning for Dataset logical plan

2016-02-23 Thread Davies Liu (JIRA)
Davies Liu created SPARK-13463: -- Summary: Support Column pruning for Dataset logical plan Key: SPARK-13463 URL: https://issues.apache.org/jira/browse/SPARK-13463 Project: Spark Issue Type: Impro

[jira] [Assigned] (SPARK-13441) NullPointerException when either HADOOP_CONF_DIR or YARN_CONF_DIR is not readable

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13441: Assignee: Apache Spark > NullPointerException when either HADOOP_CONF_DIR or YARN_CONF_DIR

[jira] [Commented] (SPARK-13441) NullPointerException when either HADOOP_CONF_DIR or YARN_CONF_DIR is not readable

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159954#comment-15159954 ] Apache Spark commented on SPARK-13441: -- User 'chtyim' has created a pull request for

[jira] [Assigned] (SPARK-13441) NullPointerException when either HADOOP_CONF_DIR or YARN_CONF_DIR is not readable

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13441: Assignee: (was: Apache Spark) > NullPointerException when either HADOOP_CONF_DIR or YA

[jira] [Comment Edited] (SPARK-12422) Binding Spark Standalone Master to public IP fails

2016-02-23 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159904#comment-15159904 ] Jakob Odersky edited comment on SPARK-12422 at 2/24/16 12:16 AM: --

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2016-02-23 Thread Oscar D. Lara Yejas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159928#comment-15159928 ] Oscar D. Lara Yejas commented on SPARK-9325: Hi, folks. I have created a PR f

[jira] [Assigned] (SPARK-9325) Support `collect` on DataFrame columns

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9325: --- Assignee: Apache Spark > Support `collect` on DataFrame columns > ---

[jira] [Assigned] (SPARK-9325) Support `collect` on DataFrame columns

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9325: --- Assignee: (was: Apache Spark) > Support `collect` on DataFrame columns >

[jira] [Commented] (SPARK-9325) Support `collect` on DataFrame columns

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159917#comment-15159917 ] Apache Spark commented on SPARK-9325: - User 'olarayej' has created a pull request for

[jira] [Updated] (SPARK-11381) Replace example code in mllib-linear-methods.md using include_example

2016-02-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11381: -- Shepherd: Xusen Yin > Replace example code in mllib-linear-methods.md using include_example > -

[jira] [Resolved] (SPARK-10759) Missing Python code example in model selection user guide

2016-02-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10759. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11240 [https://g

[jira] [Commented] (SPARK-12422) Binding Spark Standalone Master to public IP fails

2016-02-23 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159904#comment-15159904 ] Jakob Odersky commented on SPARK-12422: --- This blocker issue is quite old now, can y

[jira] [Commented] (SPARK-12877) TrainValidationSplit is missing in pyspark.ml.tuning

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159884#comment-15159884 ] Apache Spark commented on SPARK-12877: -- User 'JeremyNixon' has created a pull reques

[jira] [Assigned] (SPARK-12877) TrainValidationSplit is missing in pyspark.ml.tuning

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12877: Assignee: Apache Spark > TrainValidationSplit is missing in pyspark.ml.tuning > --

[jira] [Assigned] (SPARK-12877) TrainValidationSplit is missing in pyspark.ml.tuning

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12877: Assignee: (was: Apache Spark) > TrainValidationSplit is missing in pyspark.ml.tuning >

[jira] [Resolved] (SPARK-13011) K-means wrapper in SparkR

2016-02-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-13011. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11124 [https://g

[jira] [Commented] (SPARK-13431) Maven build fails due to: Method code too large! in Catalyst

2016-02-23 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159862#comment-15159862 ] Stavros Kontopoulos commented on SPARK-13431: - cool! > Maven build fails due

[jira] [Updated] (SPARK-13410) unionAll AnalysisException with DataFrames containing UDT columns.

2016-02-23 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13410: Fix Version/s: 1.6.1 > unionAll AnalysisException with DataFrames containing UDT columns. > ---

[jira] [Updated] (SPARK-13461) Duplicated example code merge and cleanup

2016-02-23 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin updated SPARK-13461: -- Description: Merge duplicated code after we finishing the example code substitution. Duplications incl

[jira] [Updated] (SPARK-13461) Duplicated example code merge and cleanup

2016-02-23 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin updated SPARK-13461: -- Description: Merge duplicated code after we finishing the example code substitution. Duplications incl

[jira] [Commented] (SPARK-13390) Java Spark createDataFrame with List parameter bug

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159832#comment-15159832 ] Apache Spark commented on SPARK-13390: -- User 'zsxwing' has created a pull request fo

[jira] [Resolved] (SPARK-13373) Generate code for sort merge join

2016-02-23 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13373. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11248 [https://github.

[jira] [Commented] (SPARK-12877) TrainValidationSplit is missing in pyspark.ml.tuning

2016-02-23 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159822#comment-15159822 ] Xusen Yin commented on SPARK-12877: --- Yes, we lack TrainValidationSplit in Python. Go ah

[jira] [Created] (SPARK-13462) Vector serialization error in example code of ModelSelectionViaTrainValidationSplitExample and JavaModelSelectionViaTrainValidationSplitExample

2016-02-23 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-13462: - Summary: Vector serialization error in example code of ModelSelectionViaTrainValidationSplitExample and JavaModelSelectionViaTrainValidationSplitExample Key: SPARK-13462 URL: https://i

[jira] [Created] (SPARK-13461) Duplicated example code merge and cleanup

2016-02-23 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-13461: - Summary: Duplicated example code merge and cleanup Key: SPARK-13461 URL: https://issues.apache.org/jira/browse/SPARK-13461 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-10759) Missing Python code example in model selection user guide

2016-02-23 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin updated SPARK-10759: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-11337 > Missing Python code example in m

[jira] [Commented] (SPARK-12922) Implement gapply() on DataFrame in SparkR

2016-02-23 Thread Narine Kokhlikyan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159736#comment-15159736 ] Narine Kokhlikyan commented on SPARK-12922: --- Thanks for your quick response [~s

[jira] [Updated] (SPARK-13445) Seleting "data" with window function does not work unless aliased (using PARTITION BY)

2016-02-23 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-13445: - Priority: Critical (was: Major) > Seleting "data" with window function does not work unl

[jira] [Commented] (SPARK-13410) unionAll AnalysisException with DataFrames containing UDT columns.

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159719#comment-15159719 ] Apache Spark commented on SPARK-13410: -- User 'damnMeddlingKid' has created a pull re

[jira] [Commented] (SPARK-6761) Approximate quantile

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159699#comment-15159699 ] Apache Spark commented on SPARK-6761: - User 'mengxr' has created a pull request for th

[jira] [Assigned] (SPARK-13431) Maven build fails due to: Method code too large! in Catalyst

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13431: Assignee: (was: Apache Spark) > Maven build fails due to: Method code too large! in Ca

[jira] [Commented] (SPARK-13431) Maven build fails due to: Method code too large! in Catalyst

2016-02-23 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159682#comment-15159682 ] Davies Liu commented on SPARK-13431: https://github.com/apache/spark/pull/11331 > Ma

[jira] [Assigned] (SPARK-13431) Maven build fails due to: Method code too large! in Catalyst

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13431: Assignee: Apache Spark > Maven build fails due to: Method code too large! in Catalyst > --

[jira] [Commented] (SPARK-13431) Maven build fails due to: Method code too large! in Catalyst

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159684#comment-15159684 ] Apache Spark commented on SPARK-13431: -- User 'davies' has created a pull request for

[jira] [Commented] (SPARK-5992) Locality Sensitive Hashing (LSH) for MLlib

2016-02-23 Thread Karl Higley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159679#comment-15159679 ] Karl Higley commented on SPARK-5992: I've been working on [a Spark package for approxi

[jira] [Commented] (SPARK-13410) unionAll AnalysisException with DataFrames containing UDT columns.

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159669#comment-15159669 ] Apache Spark commented on SPARK-13410: -- User 'damnMeddlingKid' has created a pull re

[jira] [Commented] (SPARK-13431) Maven build fails due to: Method code too large! in Catalyst

2016-02-23 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159670#comment-15159670 ] Davies Liu commented on SPARK-13431: I'd like to split ExpressionParser.g, or we can'

[jira] [Commented] (SPARK-13459) Separate Alive and Dead Executors in Executor Totals Table

2016-02-23 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159657#comment-15159657 ] Alex Bozarth commented on SPARK-13459: -- That PR isn't for this Jira, I mixed up my J

[jira] [Commented] (SPARK-13306) Initial implementation for uncorrelated scalar subquery

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159658#comment-15159658 ] Apache Spark commented on SPARK-13306: -- User 'hvanhovell' has created a pull request

[jira] [Commented] (SPARK-13431) Maven build fails due to: Method code too large! in Catalyst

2016-02-23 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159660#comment-15159660 ] Herman van Hovell commented on SPARK-13431: --- Submitted PR for revert: https://g

[jira] [Assigned] (SPARK-13241) add long--formatted timestamps to org.apache.spark.status.api.v1.ApplicationAttemptInfo

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13241: Assignee: (was: Apache Spark) > add long--formatted timestamps to > org.apache.spark.

[jira] [Commented] (SPARK-13241) add long--formatted timestamps to org.apache.spark.status.api.v1.ApplicationAttemptInfo

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159653#comment-15159653 ] Apache Spark commented on SPARK-13241: -- User 'ajbozarth' has created a pull request

[jira] [Assigned] (SPARK-13241) add long--formatted timestamps to org.apache.spark.status.api.v1.ApplicationAttemptInfo

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13241: Assignee: Apache Spark > add long--formatted timestamps to > org.apache.spark.status.api.

[jira] [Assigned] (SPARK-13459) Separate Alive and Dead Executors in Executor Totals Table

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13459: Assignee: (was: Apache Spark) > Separate Alive and Dead Executors in Executor Totals T

[jira] [Assigned] (SPARK-13459) Separate Alive and Dead Executors in Executor Totals Table

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13459: Assignee: Apache Spark > Separate Alive and Dead Executors in Executor Totals Table >

[jira] [Commented] (SPARK-13459) Separate Alive and Dead Executors in Executor Totals Table

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159651#comment-15159651 ] Apache Spark commented on SPARK-13459: -- User 'ajbozarth' has created a pull request

[jira] [Commented] (SPARK-13431) Maven build fails due to: Method code too large! in Catalyst

2016-02-23 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159649#comment-15159649 ] Kazuaki Ishizaki commented on SPARK-13431: -- I identified why this problem occurs

[jira] [Commented] (SPARK-12313) getPartitionsByFilter doesnt handle predicates on all / multiple Partition Columns

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159642#comment-15159642 ] Apache Spark commented on SPARK-12313: -- User 'davies' has created a pull request for

[jira] [Assigned] (SPARK-12313) getPartitionsByFilter doesnt handle predicates on all / multiple Partition Columns

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12313: Assignee: (was: Apache Spark) > getPartitionsByFilter doesnt handle predicates on all

[jira] [Assigned] (SPARK-12313) getPartitionsByFilter doesnt handle predicates on all / multiple Partition Columns

2016-02-23 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12313: Assignee: Apache Spark > getPartitionsByFilter doesnt handle predicates on all / multiple

[jira] [Commented] (SPARK-13460) Applying Encoding methods to GraphX's Internal storage structure

2016-02-23 Thread Adnan Haider (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159641#comment-15159641 ] Adnan Haider commented on SPARK-13460: -- I am currently working on submitting a pull

[jira] [Commented] (SPARK-13307) TPCDS query 66 degraded by 30% in 1.6.0 compared to 1.4.1

2016-02-23 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15159636#comment-15159636 ] JESSE CHEN commented on SPARK-13307: I tuned up the autoBroadcastJoinThreshold to 100

[jira] [Resolved] (SPARK-13329) Considering output for statistics of logical plan

2016-02-23 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13329. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11210 [https://github.

[jira] [Updated] (SPARK-11381) Replace example code in mllib-linear-methods.md using include_example

2016-02-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11381: -- Target Version/s: 2.0.0 Component/s: MLlib > Replace example code in mllib-linear-meth

[jira] [Updated] (SPARK-11381) Replace example code in mllib-linear-methods.md using include_example

2016-02-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-11381: -- Assignee: Dongjoon Hyun > Replace example code in mllib-linear-methods.md using include_example

[jira] [Updated] (SPARK-10759) Missing Python code example in model selection user guide

2016-02-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10759: -- Shepherd: Xusen Yin Assignee: Jeremy (was: Apache Spark) > Missing Python code example in

[jira] [Updated] (SPARK-10759) Missing Python code example in model selection user guide

2016-02-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10759: -- Target Version/s: 2.0.0 Component/s: ML > Missing Python code example in model selecti

[jira] [Updated] (SPARK-10759) Missing Python code example in model selection user guide

2016-02-23 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10759: -- Summary: Missing Python code example in model selection user guide (was: Missing Python code e

[jira] [Created] (SPARK-13460) Applying Encoding methods to GraphX's Internal storage structure

2016-02-23 Thread Adnan Haider (JIRA)
Adnan Haider created SPARK-13460: Summary: Applying Encoding methods to GraphX's Internal storage structure Key: SPARK-13460 URL: https://issues.apache.org/jira/browse/SPARK-13460 Project: Spark

  1   2   3   >