[jira] [Updated] (SPARK-18827) Cann't read broadcast if broadcast blocks are stored on-disk

2016-12-14 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-18827: Description: How to reproduce it: {code:java} test("Cache broadcast to disk") { val conf = ne

[jira] [Created] (SPARK-18857) SparkSQL ThriftServer hangs while extracting huge data volumes in incremental collect mode

2016-12-14 Thread vishal agrawal (JIRA)
vishal agrawal created SPARK-18857: -- Summary: SparkSQL ThriftServer hangs while extracting huge data volumes in incremental collect mode Key: SPARK-18857 URL: https://issues.apache.org/jira/browse/SPARK-18857

[jira] [Updated] (SPARK-18857) SparkSQL ThriftServer hangs while extracting huge data volumes in incremental collect mode

2016-12-14 Thread vishal agrawal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vishal agrawal updated SPARK-18857: --- Attachment: GC-spark-2.0.2 GC-spark-1.6.3 GC logs for 2 spark versions while

[jira] [Updated] (SPARK-18857) SparkSQL ThriftServer hangs while extracting huge data volumes in incremental collect mode

2016-12-14 Thread vishal agrawal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] vishal agrawal updated SPARK-18857: --- Description: We are trying to run a sql query on our spark cluster and extracting around 200

[jira] [Commented] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-14 Thread Vicente Masip (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15747863#comment-15747863 ] Vicente Masip commented on SPARK-18823: --- At this issue,there is something missing t

[jira] [Created] (SPARK-18858) reduceByKey not avaiable on Dataset

2016-12-14 Thread Jorge Machado (JIRA)
Jorge Machado created SPARK-18858: - Summary: reduceByKey not avaiable on Dataset Key: SPARK-18858 URL: https://issues.apache.org/jira/browse/SPARK-18858 Project: Spark Issue Type: Bug Aff

[jira] [Updated] (SPARK-18814) CheckAnalysis rejects TPCDS query 32

2016-12-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18814: -- Assignee: Nattavut Sutyanyong (was: Herman van Hovell) > CheckAnalysis rejects TPCDS q

[jira] [Resolved] (SPARK-18814) CheckAnalysis rejects TPCDS query 32

2016-12-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-18814. --- Resolution: Fixed Fix Version/s: 2.1.0 > CheckAnalysis rejects TPCDS query 32

[jira] [Created] (SPARK-18859) Catalyst codegen does not mark column as nullable when it should. Causes NPE

2016-12-14 Thread Mykhailo Osypov (JIRA)
Mykhailo Osypov created SPARK-18859: --- Summary: Catalyst codegen does not mark column as nullable when it should. Causes NPE Key: SPARK-18859 URL: https://issues.apache.org/jira/browse/SPARK-18859 Pr

[jira] [Updated] (SPARK-18859) Catalyst codegen does not mark column as nullable when it should. Causes NPE

2016-12-14 Thread Mykhailo Osypov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mykhailo Osypov updated SPARK-18859: Description: When joining two tables via LEFT JOIN, columns in right table may be NULLs, h

[jira] [Updated] (SPARK-18859) Catalyst codegen does not mark column as nullable when it should. Causes NPE

2016-12-14 Thread Mykhailo Osypov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mykhailo Osypov updated SPARK-18859: Description: When joining two tables via LEFT JOIN, columns in right table may be NULLs, h

[jira] [Updated] (SPARK-18859) Catalyst codegen does not mark column as nullable when it should. Causes NPE

2016-12-14 Thread Mykhailo Osypov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mykhailo Osypov updated SPARK-18859: Description: When joining two tables via LEFT JOIN, columns in right table may be NULLs, h

[jira] [Updated] (SPARK-18859) Catalyst codegen does not mark column as nullable when it should. Causes NPE

2016-12-14 Thread Mykhailo Osypov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mykhailo Osypov updated SPARK-18859: Description: When joining two tables via LEFT JOIN, columns in right table may be NULLs, h

[jira] [Commented] (SPARK-18859) Catalyst codegen does not mark column as nullable when it should. Causes NPE

2016-12-14 Thread Mykhailo Osypov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748042#comment-15748042 ] Mykhailo Osypov commented on SPARK-18859: - Current workaround is to create a view

[jira] [Assigned] (SPARK-18779) Messages being received only from one partition when using Spark Streaming integration for Kafka 0.10 with kafka client library at 0.10.1

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18779: Assignee: (was: Apache Spark) > Messages being received only from one partition when u

[jira] [Commented] (SPARK-18779) Messages being received only from one partition when using Spark Streaming integration for Kafka 0.10 with kafka client library at 0.10.1

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748076#comment-15748076 ] Apache Spark commented on SPARK-18779: -- User 'pnakhe' has created a pull request for

[jira] [Assigned] (SPARK-18779) Messages being received only from one partition when using Spark Streaming integration for Kafka 0.10 with kafka client library at 0.10.1

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18779: Assignee: Apache Spark > Messages being received only from one partition when using Spark

[jira] [Commented] (SPARK-18471) In treeAggregate, generate (big) zeros instead of sending them.

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748079#comment-15748079 ] Apache Spark commented on SPARK-18471: -- User 'AnthonyTruchet' has created a pull req

[jira] [Assigned] (SPARK-18856) Newly created catalog table assumed to have 0 rows and 0 bytes

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18856: Assignee: Apache Spark > Newly created catalog table assumed to have 0 rows and 0 bytes >

[jira] [Commented] (SPARK-18856) Newly created catalog table assumed to have 0 rows and 0 bytes

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748253#comment-15748253 ] Apache Spark commented on SPARK-18856: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-18856) Newly created catalog table assumed to have 0 rows and 0 bytes

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18856: Assignee: (was: Apache Spark) > Newly created catalog table assumed to have 0 rows and

[jira] [Commented] (SPARK-17662) Dedup UDAF

2016-12-14 Thread Ohad Raviv (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748264#comment-15748264 ] Ohad Raviv commented on SPARK-17662: When I tried to use you suggestion I have encoun

[jira] [Created] (SPARK-18860) Update Parquet to 1.9.0

2016-12-14 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-18860: - Summary: Update Parquet to 1.9.0 Key: SPARK-18860 URL: https://issues.apache.org/jira/browse/SPARK-18860 Project: Spark Issue Type: Bug Component

[jira] [Assigned] (SPARK-18860) Update Parquet to 1.9.0

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18860: Assignee: Apache Spark > Update Parquet to 1.9.0 > --- > >

[jira] [Commented] (SPARK-18860) Update Parquet to 1.9.0

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748281#comment-15748281 ] Apache Spark commented on SPARK-18860: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Assigned] (SPARK-18860) Update Parquet to 1.9.0

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18860: Assignee: (was: Apache Spark) > Update Parquet to 1.9.0 > --- > >

[jira] [Reopened] (SPARK-18829) Printing to logger

2016-12-14 Thread David Hodeffi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Hodeffi reopened SPARK-18829: --- Why did you close this issue? > Printing to logger > -- > > Key

[jira] [Commented] (SPARK-18829) Printing to logger

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748303#comment-15748303 ] Sean Owen commented on SPARK-18829: --- ... did you read the discussion just above? As sta

[jira] [Commented] (SPARK-18829) Printing to logger

2016-12-14 Thread David Hodeffi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748302#comment-15748302 ] David Hodeffi commented on SPARK-18829: --- My request is to write to logger the expla

[jira] [Commented] (SPARK-17262) Spark SizeEstimator does not ignore transient fields in java classes when calculates class size

2016-12-14 Thread Michel Lemay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748307#comment-15748307 ] Michel Lemay commented on SPARK-17262: -- I agree. As a realworld example of this, co

[jira] [Commented] (SPARK-17262) Spark SizeEstimator does not ignore transient fields in java classes when calculates class size

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748342#comment-15748342 ] Sean Owen commented on SPARK-17262: --- This is still wrong. transient fields most certain

[jira] [Created] (SPARK-18861) Spark-SQL unconsistent behavior with "struct" expressions

2016-12-14 Thread Ohad Raviv (JIRA)
Ohad Raviv created SPARK-18861: -- Summary: Spark-SQL unconsistent behavior with "struct" expressions Key: SPARK-18861 URL: https://issues.apache.org/jira/browse/SPARK-18861 Project: Spark Issue T

[jira] [Commented] (SPARK-18860) Update Parquet to 1.9.0

2016-12-14 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748397#comment-15748397 ] Hyukjin Kwon commented on SPARK-18860: -- Ah, [~dongjoon], it seems it is a duplicate

[jira] [Commented] (SPARK-18569) Support R formula arithmetic

2016-12-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748440#comment-15748440 ] Yanbo Liang commented on SPARK-18569: - This is generally a good idea, but I think we

[jira] [Comment Edited] (SPARK-18569) Support R formula arithmetic

2016-12-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748440#comment-15748440 ] Yanbo Liang edited comment on SPARK-18569 at 12/14/16 2:25 PM:

[jira] [Comment Edited] (SPARK-18569) Support R formula arithmetic

2016-12-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748440#comment-15748440 ] Yanbo Liang edited comment on SPARK-18569 at 12/14/16 2:26 PM:

[jira] [Commented] (SPARK-18861) Spark-SQL unconsistent behavior with "struct" expressions

2016-12-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748503#comment-15748503 ] Herman van Hovell commented on SPARK-18861: --- Your code fails because the names

[jira] [Comment Edited] (SPARK-18861) Spark-SQL unconsistent behavior with "struct" expressions

2016-12-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748503#comment-15748503 ] Herman van Hovell edited comment on SPARK-18861 at 12/14/16 2:40 PM: --

[jira] [Comment Edited] (SPARK-18861) Spark-SQL unconsistent behavior with "struct" expressions

2016-12-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748503#comment-15748503 ] Herman van Hovell edited comment on SPARK-18861 at 12/14/16 2:41 PM: --

[jira] [Updated] (SPARK-18710) Add offset to GeneralizedLinearRegression models

2016-12-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18710: Assignee: Wayne Zhang > Add offset to GeneralizedLinearRegression models >

[jira] [Updated] (SPARK-18710) Add offset to GeneralizedLinearRegression models

2016-12-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18710: Fix Version/s: (was: 2.2.0) > Add offset to GeneralizedLinearRegression models > --

[jira] [Updated] (SPARK-18710) Add offset to GeneralizedLinearRegression models

2016-12-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18710: Target Version/s: (was: 2.0.2) > Add offset to GeneralizedLinearRegression models > -

[jira] [Commented] (SPARK-18710) Add offset to GeneralizedLinearRegression models

2016-12-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748523#comment-15748523 ] Yanbo Liang commented on SPARK-18710: - [~actuaryzhang] This proposal makes sense, ple

[jira] [Created] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-14 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-18862: --- Summary: Split SparkR mllib.R into multiple files Key: SPARK-18862 URL: https://issues.apache.org/jira/browse/SPARK-18862 Project: Spark Issue Type: Improvemen

[jira] [Updated] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-14 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18862: Description: SparkR mllib.R is getting bigger as we add more ML wrappers, I'd like to split it int

[jira] [Commented] (SPARK-17262) Spark SizeEstimator does not ignore transient fields in java classes when calculates class size

2016-12-14 Thread Michel Lemay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748638#comment-15748638 ] Michel Lemay commented on SPARK-17262: -- In that case, It makes no sense to keep `val

[jira] [Commented] (SPARK-17262) Spark SizeEstimator does not ignore transient fields in java classes when calculates class size

2016-12-14 Thread George Shuklin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748660#comment-15748660 ] George Shuklin commented on SPARK-17262: My original point was slightly different

[jira] [Created] (SPARK-18863) Output non-aggregate expression without a GROUP BY clause in a subquery does not yield a syntax error

2016-12-14 Thread Nattavut Sutyanyong (JIRA)
Nattavut Sutyanyong created SPARK-18863: --- Summary: Output non-aggregate expression without a GROUP BY clause in a subquery does not yield a syntax error Key: SPARK-18863 URL: https://issues.apache.org/jira/

[jira] [Updated] (SPARK-18863) Output non-aggregate expression without a GROUP BY clause in a subquery does not yield a syntax error

2016-12-14 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nattavut Sutyanyong updated SPARK-18863: Description: [~smilegator] has found that the following query does not raise a synt

[jira] [Updated] (SPARK-18863) Output non-aggregate expressions without GROUP BY in a subquery does not yield an error

2016-12-14 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nattavut Sutyanyong updated SPARK-18863: Summary: Output non-aggregate expressions without GROUP BY in a subquery does not y

[jira] [Commented] (SPARK-18699) Spark CSV parsing types other than String throws exception when malformed

2016-12-14 Thread Rishi Kamaleswaran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748886#comment-15748886 ] Rishi Kamaleswaran commented on SPARK-18699: This issue is also seen in cases

[jira] [Commented] (SPARK-18853) Project (UnaryNode) is way too aggressive in estimating statistics

2016-12-14 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748961#comment-15748961 ] Michael Allman commented on SPARK-18853: Should we link this to https://issues.ap

[jira] [Updated] (SPARK-18853) Project (UnaryNode) is way too aggressive in estimating statistics

2016-12-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18853: Description: We currently define statistics in UnaryNode: {code} override def statistics: Stati

[jira] [Commented] (SPARK-18853) Project (UnaryNode) is way too aggressive in estimating statistics

2016-12-14 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748962#comment-15748962 ] Michael Allman commented on SPARK-18853: I'll just add another issue with overest

[jira] [Commented] (SPARK-18853) Project (UnaryNode) is way too aggressive in estimating statistics

2016-12-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748964#comment-15748964 ] Reynold Xin commented on SPARK-18853: - Can you say more? Are you talking about deeply

[jira] [Commented] (SPARK-18853) Project (UnaryNode) is way too aggressive in estimating statistics

2016-12-14 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748968#comment-15748968 ] Michael Allman commented on SPARK-18853: Yes, nested arrays. > Project (UnaryNod

[jira] [Updated] (SPARK-18855) Add RDD flatten function

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18855: -- Priority: Minor (was: Major) This should really go on Dataset as well, if this is done at all. It does

[jira] [Commented] (SPARK-18863) Output non-aggregate expressions without GROUP BY in a subquery does not yield an error

2016-12-14 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15748975#comment-15748975 ] Xiao Li commented on SPARK-18863: - The JIRA only shows one scenario. The expected error m

[jira] [Updated] (SPARK-18863) Output non-aggregate expressions without GROUP BY in a subquery does not yield an error

2016-12-14 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-18863: Priority: Major (was: Minor) > Output non-aggregate expressions without GROUP BY in a subquery does not >

[jira] [Resolved] (SPARK-18858) reduceByKey not avaiable on Dataset

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18858. --- Resolution: Invalid Questions should really go to u...@spark.apache.org. The Dataset API is differen

[jira] [Commented] (SPARK-18829) Printing to logger

2016-12-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749010#comment-15749010 ] Herman van Hovell commented on SPARK-18829: --- Both {{show()}} and {{explain()}}

[jira] [Resolved] (SPARK-18846) Fix flakiness in SchedulerIntegrationSuite

2016-12-14 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid resolved SPARK-18846. -- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16270 [https://git

[jira] [Commented] (SPARK-18860) Update Parquet to 1.9.0

2016-12-14 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749073#comment-15749073 ] Dongjoon Hyun commented on SPARK-18860: --- Oh, sure! Thank you! > Update Parquet to

[jira] [Closed] (SPARK-18860) Update Parquet to 1.9.0

2016-12-14 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-18860. - Resolution: Fixed > Update Parquet to 1.9.0 > --- > > Key: SP

[jira] [Closed] (SPARK-18140) Parquet NPE / Update to 1.9

2016-12-14 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-18140. - Resolution: Duplicate > Parquet NPE / Update to 1.9 > --- > >

[jira] [Reopened] (SPARK-18140) Parquet NPE / Update to 1.9

2016-12-14 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reopened SPARK-18140: --- > Parquet NPE / Update to 1.9 > --- > > Key: SPARK-18140

[jira] [Updated] (SPARK-18864) Changes of MLlib and SparkR behavior for 2.2

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18864: -- Description: This JIRA is for tracking changes of behavior within MLlib and SparkR for

[jira] [Created] (SPARK-18864) Changes of MLlib and SparkR behavior for 2.2

2016-12-14 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-18864: - Summary: Changes of MLlib and SparkR behavior for 2.2 Key: SPARK-18864 URL: https://issues.apache.org/jira/browse/SPARK-18864 Project: Spark Issue

[jira] [Commented] (SPARK-18864) Changes of MLlib and SparkR behavior for 2.2

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749113#comment-15749113 ] Joseph K. Bradley commented on SPARK-18864: --- [SPARK-18374]: Change English stop

[jira] [Commented] (SPARK-18374) Incorrect words in StopWords/english.txt

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749114#comment-15749114 ] Joseph K. Bradley commented on SPARK-18374: --- I noted this change of behavior in

[jira] [Assigned] (SPARK-13127) Upgrade Parquet to 1.9 (Fixes parquet sorting)

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13127: Assignee: Apache Spark > Upgrade Parquet to 1.9 (Fixes parquet sorting) >

[jira] [Assigned] (SPARK-13127) Upgrade Parquet to 1.9 (Fixes parquet sorting)

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13127: Assignee: (was: Apache Spark) > Upgrade Parquet to 1.9 (Fixes parquet sorting) > -

[jira] [Commented] (SPARK-13127) Upgrade Parquet to 1.9 (Fixes parquet sorting)

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749118#comment-15749118 ] Apache Spark commented on SPARK-13127: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Closed] (SPARK-11374) skip.header.line.count is ignored in HiveContext

2016-12-14 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-11374. - Resolution: Won't Fix Please the discussion on PR. > skip.header.line.count is ignored in HiveCo

[jira] [Commented] (SPARK-18374) Incorrect words in StopWords/english.txt

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749138#comment-15749138 ] Sean Owen commented on SPARK-18374: --- Yeah I tagged as 'releasenotes' for that reason --

[jira] [Resolved] (SPARK-18730) Ask the build script to link to Jenkins test report page instead of full console output page when posting to GitHub

2016-12-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18730. - Resolution: Fixed Fix Version/s: 2.2.0 2.1.0 > Ask the build script to

[jira] [Updated] (SPARK-18842) De-duplicate paths in classpaths in processes for local-cluster mode to work around the length limitation on Windows

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18842: -- Assignee: Hyukjin Kwon > De-duplicate paths in classpaths in processes for local-cluster mode to work

[jira] [Resolved] (SPARK-18842) De-duplicate paths in classpaths in processes for local-cluster mode to work around the length limitation on Windows

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18842. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16266 [https://github.co

[jira] [Resolved] (SPARK-18830) Fix tests in PipedRDDSuite to pass on Winodws

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18830. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16254 [https://github.co

[jira] [Reopened] (SPARK-18860) Update Parquet to 1.9.0

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-18860: --- > Update Parquet to 1.9.0 > --- > > Key: SPARK-18860 >

[jira] [Resolved] (SPARK-18860) Update Parquet to 1.9.0

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18860. --- Resolution: Duplicate > Update Parquet to 1.9.0 > --- > > Key: SP

[jira] [Updated] (SPARK-18830) Fix tests in PipedRDDSuite to pass on Winodws

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18830: -- Assignee: Hyukjin Kwon > Fix tests in PipedRDDSuite to pass on Winodws > --

[jira] [Resolved] (SPARK-18753) Inconsistent behavior after writing to parquet files

2016-12-14 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-18753. Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 16184 [https://github.

[jira] [Updated] (SPARK-18753) Inconsistent behavior after writing to parquet files

2016-12-14 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-18753: --- Assignee: Hyukjin Kwon > Inconsistent behavior after writing to parquet files > -

[jira] [Updated] (SPARK-18753) Inconsistent behavior after writing to parquet files

2016-12-14 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-18753: --- Fix Version/s: 2.2.0 > Inconsistent behavior after writing to parquet files > ---

[jira] [Resolved] (SPARK-18853) Project (UnaryNode) is way too aggressive in estimating statistics

2016-12-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-18853. --- Resolution: Fixed Assignee: Reynold Xin Fix Version/s: 2.1.0 > Projec

[jira] [Updated] (SPARK-18853) Project (UnaryNode) is way too aggressive in estimating statistics

2016-12-14 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18853: -- Fix Version/s: 2.0.3 > Project (UnaryNode) is way too aggressive in estimating statisti

[jira] [Commented] (SPARK-18853) Project (UnaryNode) is way too aggressive in estimating statistics

2016-12-14 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749397#comment-15749397 ] Michael Allman commented on SPARK-18853: [~rxin] [~hvanhovell] Should we move the

[jira] [Commented] (SPARK-18853) Project (UnaryNode) is way too aggressive in estimating statistics

2016-12-14 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749405#comment-15749405 ] Reynold Xin commented on SPARK-18853: - Let's do that separately (I thought about doin

[jira] [Resolved] (SPARK-14767) Codegen "no constructor found" errors with Maps inside case classes in Datasets

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14767. --- Resolution: Duplicate Fix Version/s: (was: 2.2.0) > Codegen "no constructor found" errors

[jira] [Reopened] (SPARK-14767) Codegen "no constructor found" errors with Maps inside case classes in Datasets

2016-12-14 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-14767: --- > Codegen "no constructor found" errors with Maps inside case classes in > Datasets > --

[jira] [Commented] (SPARK-18795) SparkR vignette update: ksTest

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749447#comment-15749447 ] Joseph K. Bradley commented on SPARK-18795: --- [~wangmiao1981] I'm going to take

[jira] [Assigned] (SPARK-18795) SparkR vignette update: ksTest

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-18795: - Assignee: Joseph K. Bradley (was: Miao Wang) > SparkR vignette update: ksTest >

[jira] [Commented] (SPARK-18795) SparkR vignette update: ksTest

2016-12-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749450#comment-15749450 ] Joseph K. Bradley commented on SPARK-18795: --- But feel free to send an update la

[jira] [Commented] (SPARK-18795) SparkR vignette update: ksTest

2016-12-14 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749536#comment-15749536 ] Apache Spark commented on SPARK-18795: -- User 'jkbradley' has created a pull request

[jira] [Commented] (SPARK-18854) getNodeNumbered and generateTreeString are not consistent

2016-12-14 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749537#comment-15749537 ] Xiao Li commented on SPARK-18854: - Sorry, I missed the ping. > getNodeNumbered and gener

[jira] [Commented] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749559#comment-15749559 ] Felix Cheung commented on SPARK-18862: -- AFAIK, R package has a constrain that it has

[jira] [Comment Edited] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749559#comment-15749559 ] Felix Cheung edited comment on SPARK-18862 at 12/14/16 9:37 PM: ---

[jira] [Resolved] (SPARK-18852) StreamingQuery.lastProgress should be null when recentProgress is empty

2016-12-14 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18852. -- Resolution: Fixed Assignee: Shixiong Zhu Fix Version/s: 2.1.0 > StreamingQuery.

[jira] [Comment Edited] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15749559#comment-15749559 ] Felix Cheung edited comment on SPARK-18862 at 12/14/16 9:38 PM: ---

  1   2   3   >