[jira] [Commented] (SPARK-11337) Make example code in user guide testable

2015-10-28 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979884#comment-14979884 ] Xusen Yin commented on SPARK-11337: --- Sure will do it later. > Make example code in use

[jira] [Commented] (SPARK-6190) create LargeByteBuffer abstraction for eliminating 2GB limit on blocks

2015-10-28 Thread hotdog (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979886#comment-14979886 ] hotdog commented on SPARK-6190: --- is there any progress? > create LargeByteBuffer abstractio

[jira] [Created] (SPARK-11398) misleading dialect conf at the start of spark-sql

2015-10-28 Thread Zhenhua Wang (JIRA)
Zhenhua Wang created SPARK-11398: Summary: misleading dialect conf at the start of spark-sql Key: SPARK-11398 URL: https://issues.apache.org/jira/browse/SPARK-11398 Project: Spark Issue Type:

[jira] [Created] (SPARK-11399) Include_example should support labels to cut out different parts in one example code

2015-10-28 Thread Xusen Yin (JIRA)
Xusen Yin created SPARK-11399: - Summary: Include_example should support labels to cut out different parts in one example code Key: SPARK-11399 URL: https://issues.apache.org/jira/browse/SPARK-11399 Projec

[jira] [Updated] (SPARK-11399) Include_example should support labels to cut out different parts in one example code

2015-10-28 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin updated SPARK-11399: -- Description: There are many small examples that do not need to create a single example file. Take the

[jira] [Assigned] (SPARK-11398) misleading dialect conf at the start of spark-sql

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11398: Assignee: Apache Spark > misleading dialect conf at the start of spark-sql > -

[jira] [Commented] (SPARK-11398) misleading dialect conf at the start of spark-sql

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979916#comment-14979916 ] Apache Spark commented on SPARK-11398: -- User 'wzhfy' has created a pull request for

[jira] [Assigned] (SPARK-11394) PostgreDialect cannot handle BYTE types

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11394: Assignee: (was: Apache Spark) > PostgreDialect cannot handle BYTE types >

[jira] [Commented] (SPARK-11394) PostgreDialect cannot handle BYTE types

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979915#comment-14979915 ] Apache Spark commented on SPARK-11394: -- User 'maropu' has created a pull request for

[jira] [Assigned] (SPARK-11398) misleading dialect conf at the start of spark-sql

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11398: Assignee: (was: Apache Spark) > misleading dialect conf at the start of spark-sql > --

[jira] [Assigned] (SPARK-11394) PostgreDialect cannot handle BYTE types

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11394: Assignee: Apache Spark > PostgreDialect cannot handle BYTE types > ---

[jira] [Created] (SPARK-11400) BroadcastNestedLoopJoin should support LeftSemi join

2015-10-28 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-11400: --- Summary: BroadcastNestedLoopJoin should support LeftSemi join Key: SPARK-11400 URL: https://issues.apache.org/jira/browse/SPARK-11400 Project: Spark Is

[jira] [Commented] (SPARK-11400) BroadcastNestedLoopJoin should support LeftSemi join

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979920#comment-14979920 ] Apache Spark commented on SPARK-11400: -- User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-11400) BroadcastNestedLoopJoin should support LeftSemi join

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11400: Assignee: (was: Apache Spark) > BroadcastNestedLoopJoin should support LeftSemi join >

[jira] [Assigned] (SPARK-11400) BroadcastNestedLoopJoin should support LeftSemi join

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11400: Assignee: Apache Spark > BroadcastNestedLoopJoin should support LeftSemi join > --

[jira] [Commented] (SPARK-9836) Provide R-like summary statistics for ordinary least squares via normal equation solver

2015-10-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979924#comment-14979924 ] Yanbo Liang commented on SPARK-9836: OK, I will try to finish it before the end of the

[jira] [Updated] (SPARK-11398) misleading dialect conf at the start of spark-sql

2015-10-28 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-11398: - Description: When we start bin/spark-sql, the default context is HiveContext, and the correspond

[jira] [Commented] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14979935#comment-14979935 ] Cheng Lian commented on SPARK-11376: No, {{GenerateColumnAccessor}} only exist in mas

[jira] [Commented] (SPARK-11045) Contributing Receiver based Low Level Kafka Consumer from Spark-Packages to Apache Spark Project

2015-10-28 Thread Dibyendu Bhattacharya (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977874#comment-14977874 ] Dibyendu Bhattacharya commented on SPARK-11045: --- hi [~tdas] , let me know w

[jira] [Updated] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-10517: --- Affects Version/s: (was: 1.5.0) 1.5.1 > Console "Output" field is

[jira] [Commented] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977890#comment-14977890 ] Maciej Bryński commented on SPARK-10517: No. I think that output field is always

[jira] [Commented] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977891#comment-14977891 ] Sean Owen commented on SPARK-11332: --- They need to be a Contributor in JIRA. I can add t

[jira] [Commented] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977893#comment-14977893 ] DB Tsai commented on SPARK-11332: - Thanks. Please help me to add his as contributor. >

[jira] [Created] (SPARK-11365) consolidate aggregates for summary statistics in weighted least squares

2015-10-28 Thread holdenk (JIRA)
holdenk created SPARK-11365: --- Summary: consolidate aggregates for summary statistics in weighted least squares Key: SPARK-11365 URL: https://issues.apache.org/jira/browse/SPARK-11365 Project: Spark

[jira] [Resolved] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai resolved SPARK-11332. - Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9325 [https://github.com/apa

[jira] [Commented] (SPARK-11246) [1.5] Table cache for Parquet broken in 1.5

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977940#comment-14977940 ] Apache Spark commented on SPARK-11246: -- User 'xwu0226' has created a pull request fo

[jira] [Assigned] (SPARK-11246) [1.5] Table cache for Parquet broken in 1.5

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11246: Assignee: Apache Spark > [1.5] Table cache for Parquet broken in 1.5 > ---

[jira] [Assigned] (SPARK-11246) [1.5] Table cache for Parquet broken in 1.5

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11246: Assignee: (was: Apache Spark) > [1.5] Table cache for Parquet broken in 1.5 >

[jira] [Commented] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977961#comment-14977961 ] Cheng Lian commented on SPARK-11103: Quoted from my reply on the user list: For 1: T

[jira] [Updated] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-11103: --- Assignee: Hyukjin Kwon > Filter applied on Merged Parquet shema with new column fail with > (java.la

[jira] [Comment Edited] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977961#comment-14977961 ] Cheng Lian edited comment on SPARK-11103 at 10/28/15 8:32 AM: -

[jira] [Updated] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-11103: --- Target Version/s: 1.5.3, 1.6.0 (was: 1.6.0) > Filter applied on Merged Parquet shema with new column

[jira] [Commented] (SPARK-2089) With YARN, preferredNodeLocalityData isn't honored

2015-10-28 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977969#comment-14977969 ] Saisai Shao commented on SPARK-2089: Hi [~pwendell], [~mridulm80], [~sandyr] and [~lia

[jira] [Assigned] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11103: Assignee: Apache Spark (was: Hyukjin Kwon) > Filter applied on Merged Parquet shema with

[jira] [Assigned] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11103: Assignee: Hyukjin Kwon (was: Apache Spark) > Filter applied on Merged Parquet shema with

[jira] [Commented] (SPARK-11103) Filter applied on Merged Parquet shema with new column fail with (java.lang.IllegalArgumentException: Column [column_name] was not found in schema!)

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14977976#comment-14977976 ] Apache Spark commented on SPARK-11103: -- User 'HyukjinKwon' has created a pull reques

[jira] [Commented] (SPARK-3655) Support sorting of values in addition to keys (i.e. secondary sort)

2015-10-28 Thread Koert Kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978000#comment-14978000 ] Koert Kuipers commented on SPARK-3655: -- spark-sorted (https://github.com/tresata/spar

[jira] [Created] (SPARK-11366) binary functions (methods) such as rlike in pyspark.sql.column only accept strings but should also accept another Column

2015-10-28 Thread Jose Antonio (JIRA)
Jose Antonio created SPARK-11366: Summary: binary functions (methods) such as rlike in pyspark.sql.column only accept strings but should also accept another Column Key: SPARK-11366 URL: https://issues.apache.org/j

[jira] [Commented] (SPARK-11167) Incorrect type resolution on heterogeneous data structures

2015-10-28 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978007#comment-14978007 ] Sun Rui commented on SPARK-11167: - It seems time-consuming and not desirable to going thr

[jira] [Created] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-11367: --- Summary: Python LinearRegression should support setting solver Key: SPARK-11367 URL: https://issues.apache.org/jira/browse/SPARK-11367 Project: Spark Issue Typ

[jira] [Created] (SPARK-11368) Spark scan all partitions when using Python UDF and filter over partitioned column is given

2015-10-28 Thread JIRA
Maciej Bryński created SPARK-11368: -- Summary: Spark scan all partitions when using Python UDF and filter over partitioned column is given Key: SPARK-11368 URL: https://issues.apache.org/jira/browse/SPARK-11368

[jira] [Commented] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978037#comment-14978037 ] Apache Spark commented on SPARK-11367: -- User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11367: Assignee: Apache Spark > Python LinearRegression should support setting solver > -

[jira] [Updated] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11367: Description: Python ML LinearRegression should support setting solver("auto", "normal", "l-bfgs")

[jira] [Assigned] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11367: Assignee: (was: Apache Spark) > Python LinearRegression should support setting solver

[jira] [Updated] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11367: Description: SPARK-10668 has provided WeightedLeastSquares solver("normal") in LinearRegression wit

[jira] [Updated] (SPARK-11368) Spark scan all partitions when using Python UDF and filter over partitioned column is given

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-11368: --- Description: Hi, I think this is huge performance bug. I created parquet file partitioned by

[jira] [Updated] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11367: Description: SPARK-10668 has provide WeightedLeastSquares solver("normal") in LinearRegression with

[jira] [Updated] (SPARK-11368) Spark scan all partitions when using Python UDF and filter over partitioned column is given

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-11368: --- Description: Hi, I think this is huge performance bug. I created parquet file partitioned by

[jira] [Updated] (SPARK-11367) Python LinearRegression should support setting solver

2015-10-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-11367: Description: SPARK-10668 has provided WeightedLeastSquares solver("normal") in LinearRegression wit

[jira] [Created] (SPARK-11369) SparkR glm should support setting standardize

2015-10-28 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-11369: --- Summary: SparkR glm should support setting standardize Key: SPARK-11369 URL: https://issues.apache.org/jira/browse/SPARK-11369 Project: Spark Issue Type: Impro

[jira] [Created] (SPARK-11370) fix a bug in GroupedIterator and create unit test for it

2015-10-28 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-11370: --- Summary: fix a bug in GroupedIterator and create unit test for it Key: SPARK-11370 URL: https://issues.apache.org/jira/browse/SPARK-11370 Project: Spark Issue

[jira] [Commented] (SPARK-11370) fix a bug in GroupedIterator and create unit test for it

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978125#comment-14978125 ] Apache Spark commented on SPARK-11370: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-11370) fix a bug in GroupedIterator and create unit test for it

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11370: Assignee: (was: Apache Spark) > fix a bug in GroupedIterator and create unit test for

[jira] [Assigned] (SPARK-11370) fix a bug in GroupedIterator and create unit test for it

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11370: Assignee: Apache Spark > fix a bug in GroupedIterator and create unit test for it > --

[jira] [Commented] (SPARK-9836) Provide R-like summary statistics for ordinary least squares via normal equation solver

2015-10-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978149#comment-14978149 ] Yanbo Liang commented on SPARK-9836: I will work on it. > Provide R-like summary stat

[jira] [Created] (SPARK-11371) Make "mean" an alias for "avg" operator

2015-10-28 Thread Ted Yu (JIRA)
Ted Yu created SPARK-11371: -- Summary: Make "mean" an alias for "avg" operator Key: SPARK-11371 URL: https://issues.apache.org/jira/browse/SPARK-11371 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-11371) Make "mean" an alias for "avg" operator

2015-10-28 Thread Ted Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated SPARK-11371: --- Attachment: spark-11371-v1.patch > Make "mean" an alias for "avg" operator >

[jira] [Created] (SPARK-11372) custom UDAF with StringType throws java.lang.ClassCastException

2015-10-28 Thread Pravin Gadakh (JIRA)
Pravin Gadakh created SPARK-11372: - Summary: custom UDAF with StringType throws java.lang.ClassCastException Key: SPARK-11372 URL: https://issues.apache.org/jira/browse/SPARK-11372 Project: Spark

[jira] [Updated] (SPARK-11372) custom UDAF with StringType throws java.lang.ClassCastException

2015-10-28 Thread Pravin Gadakh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pravin Gadakh updated SPARK-11372: -- Description: Consider following custom UDAF which uses one StringType column as intermediate b

[jira] [Updated] (SPARK-11372) custom UDAF with StringType throws java.lang.ClassCastException

2015-10-28 Thread Pravin Gadakh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pravin Gadakh updated SPARK-11372: -- Description: Consider following custom UDAF which uses one StringType column as intermediate b

[jira] [Updated] (SPARK-11372) custom UDAF with StringType throws java.lang.ClassCastException

2015-10-28 Thread Pravin Gadakh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pravin Gadakh updated SPARK-11372: -- Description: Consider following custom UDAF which uses StringType column as intermediate buffe

[jira] [Updated] (SPARK-11372) custom UDAF with StringType throws java.lang.ClassCastException

2015-10-28 Thread Pravin Gadakh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pravin Gadakh updated SPARK-11372: -- Description: Consider following custom UDAF which uses StringType column as intermediate buffe

[jira] [Updated] (SPARK-11332) WeightedLeastSquares should use ml features generic Instance class instead of private

2015-10-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11332: -- Assignee: Nakul Jindal > WeightedLeastSquares should use ml features generic Instance class instead of

[jira] [Assigned] (SPARK-11369) SparkR glm should support setting standardize

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11369: Assignee: Apache Spark > SparkR glm should support setting standardize > -

[jira] [Commented] (SPARK-11369) SparkR glm should support setting standardize

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978256#comment-14978256 ] Apache Spark commented on SPARK-11369: -- User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-11369) SparkR glm should support setting standardize

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11369: Assignee: (was: Apache Spark) > SparkR glm should support setting standardize > --

[jira] [Updated] (SPARK-11317) YARN HBase token code shouldn't swallow invocation target exceptions

2015-10-28 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-11317: --- Affects Version/s: 1.5.1 > YARN HBase token code shouldn't swallow invocation target exceptio

[jira] [Updated] (SPARK-11317) YARN HBase token code shouldn't swallow invocation target exceptions

2015-10-28 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-11317: --- Description: As with SPARK-11265; the HBase token retrieval code of SPARK-6918 1. swallows e

[jira] [Created] (SPARK-11373) Add metrics to the History Server and providers

2015-10-28 Thread Steve Loughran (JIRA)
Steve Loughran created SPARK-11373: -- Summary: Add metrics to the History Server and providers Key: SPARK-11373 URL: https://issues.apache.org/jira/browse/SPARK-11373 Project: Spark Issue Typ

[jira] [Commented] (SPARK-11373) Add metrics to the History Server and providers

2015-10-28 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978322#comment-14978322 ] Steve Loughran commented on SPARK-11373: # This has tangible benefit for the SPAR

[jira] [Created] (SPARK-11374) skip.header.line.count is ignored in HiveContext

2015-10-28 Thread Daniel Haviv (JIRA)
Daniel Haviv created SPARK-11374: Summary: skip.header.line.count is ignored in HiveContext Key: SPARK-11374 URL: https://issues.apache.org/jira/browse/SPARK-11374 Project: Spark Issue Type:

[jira] [Created] (SPARK-11375) History Server "no histories" message to be dynamically generated by ApplicationHistoryProviders

2015-10-28 Thread Steve Loughran (JIRA)
Steve Loughran created SPARK-11375: -- Summary: History Server "no histories" message to be dynamically generated by ApplicationHistoryProviders Key: SPARK-11375 URL: https://issues.apache.org/jira/browse/SPARK-113

[jira] [Commented] (SPARK-11375) History Server "no histories" message to be dynamically generated by ApplicationHistoryProviders

2015-10-28 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978329#comment-14978329 ] Steve Loughran commented on SPARK-11375: This could be implemented with a new met

[jira] [Resolved] (SPARK-11313) Implement cogroup

2015-10-28 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-11313. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9324 [http

[jira] [Updated] (SPARK-11313) Implement cogroup

2015-10-28 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-11313: - Assignee: Wenchen Fan > Implement cogroup > - > > Key: SP

[jira] [Commented] (SPARK-11303) sample (without replacement) + filter returns wrong results in DataFrame

2015-10-28 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978406#comment-14978406 ] Michael Armbrust commented on SPARK-11303: -- I picked it into branch-1.5, but I'm

[jira] [Comment Edited] (SPARK-11368) Spark scan all partitions when using Python UDF and filter over partitioned column is given

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978412#comment-14978412 ] Maciej Bryński edited comment on SPARK-11368 at 10/28/15 1:27 PM: -

[jira] [Commented] (SPARK-11368) Spark scan all partitions when using Python UDF and filter over partitioned column is given

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978412#comment-14978412 ] Maciej Bryński commented on SPARK-11368: Problem exists only when using Pyspark.

[jira] [Updated] (SPARK-11368) Spark shouldn't scan all partitions when using Python UDF and filter over partitioned column is given

2015-10-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-11368: --- Summary: Spark shouldn't scan all partitions when using Python UDF and filter over partitione

[jira] [Assigned] (SPARK-11371) Make "mean" an alias for "avg" operator

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11371: Assignee: (was: Apache Spark) > Make "mean" an alias for "avg" operator >

[jira] [Commented] (SPARK-11371) Make "mean" an alias for "avg" operator

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978482#comment-14978482 ] Apache Spark commented on SPARK-11371: -- User 'ted-yu' has created a pull request for

[jira] [Created] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-11376: -- Summary: Invalid generated Java code in GenerateColumnAccessor Key: SPARK-11376 URL: https://issues.apache.org/jira/browse/SPARK-11376 Project: Spark Issue Type:

[jira] [Created] (SPARK-11377) withNewChildren should not convert StructType to Seq

2015-10-28 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-11377: Summary: withNewChildren should not convert StructType to Seq Key: SPARK-11377 URL: https://issues.apache.org/jira/browse/SPARK-11377 Project: Spark

[jira] [Assigned] (SPARK-11371) Make "mean" an alias for "avg" operator

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11371: Assignee: Apache Spark > Make "mean" an alias for "avg" operator > ---

[jira] [Assigned] (SPARK-11377) withNewChildren should not convert StructType to Seq

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11377: Assignee: Michael Armbrust (was: Apache Spark) > withNewChildren should not convert Struc

[jira] [Commented] (SPARK-11377) withNewChildren should not convert StructType to Seq

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978505#comment-14978505 ] Apache Spark commented on SPARK-11377: -- User 'marmbrus' has created a pull request f

[jira] [Assigned] (SPARK-11377) withNewChildren should not convert StructType to Seq

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11377: Assignee: Apache Spark (was: Michael Armbrust) > withNewChildren should not convert Struc

[jira] [Commented] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978509#comment-14978509 ] Apache Spark commented on SPARK-11376: -- User 'liancheng' has created a pull request

[jira] [Assigned] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11376: Assignee: Apache Spark (was: Cheng Lian) > Invalid generated Java code in GenerateColumnA

[jira] [Assigned] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11376: Assignee: Cheng Lian (was: Apache Spark) > Invalid generated Java code in GenerateColumnA

[jira] [Commented] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results

2015-10-28 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978528#comment-14978528 ] Saif Addin Ellafi commented on SPARK-11330: --- Hello Cheng Hao, and thank you ver

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results

2015-10-28 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Attachment: bug_reproduce.zip Json dataset folder > Filter operation on StringType aft

[jira] [Created] (SPARK-11378) StreamingContext.awaitTerminationOrTimeout does not return

2015-10-28 Thread Nick Evans (JIRA)
Nick Evans created SPARK-11378: -- Summary: StreamingContext.awaitTerminationOrTimeout does not return Key: SPARK-11378 URL: https://issues.apache.org/jira/browse/SPARK-11378 Project: Spark Issue

[jira] [Updated] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results

2015-10-28 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saif Addin Ellafi updated SPARK-11330: -- Attachment: bug_reproduce.zip > Filter operation on StringType after groupBy PERSISTED

[jira] [Assigned] (SPARK-11378) StreamingContext.awaitTerminationOrTimeout does not return

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11378: Assignee: (was: Apache Spark) > StreamingContext.awaitTerminationOrTimeout does not re

[jira] [Updated] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-11376: --- Priority: Major (was: Minor) > Invalid generated Java code in GenerateColumnAccessor > -

[jira] [Comment Edited] (SPARK-11330) Filter operation on StringType after groupBy PERSISTED brings no results

2015-10-28 Thread Saif Addin Ellafi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14978528#comment-14978528 ] Saif Addin Ellafi edited comment on SPARK-11330 at 10/28/15 2:46 PM: --

[jira] [Assigned] (SPARK-11378) StreamingContext.awaitTerminationOrTimeout does not return

2015-10-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11378: Assignee: Apache Spark > StreamingContext.awaitTerminationOrTimeout does not return >

[jira] [Updated] (SPARK-11376) Invalid generated Java code in GenerateColumnAccessor

2015-10-28 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-11376: --- Description: There are two {{mutableRow}} fields in the generated code within {{GenerateColumnAccess

  1   2   3   >