[jira] [Updated] (SPARK-14992) Flaky test: BucketedReadSuite.only shuffle one side when join bucketed table and non-bucketed table

2016-05-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14992: --- Priority: Critical (was: Major) > Flaky test: BucketedReadSuite.only shuffle one side when join

[jira] [Updated] (SPARK-14773) Enable the tests in HiveCompatibilitySuite for subquery

2016-05-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14773: --- Target Version/s: 2.0.0 > Enable the tests in HiveCompatibilitySuite for subquery >

[jira] [Resolved] (SPARK-13745) Support columnar in memory representation on Big Endian platforms

2016-05-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13745. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12397

[jira] [Updated] (SPARK-13745) Support columnar in memory representation on Big Endian platforms

2016-05-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-13745: --- Assignee: Pete Robbins > Support columnar in memory representation on Big Endian platforms >

[jira] [Resolved] (SPARK-14781) Support subquery in nested predicates

2016-05-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14781. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12820

[jira] [Assigned] (SPARK-14781) Support subquery in nested predicates

2016-05-01 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-14781: -- Assignee: Davies Liu > Support subquery in nested predicates >

[jira] [Updated] (SPARK-14757) Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left table is joined to "null" on the right table

2016-04-29 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14757: --- Fix Version/s: 1.6.2 > Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left

[jira] [Updated] (SPARK-14757) Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left table is joined to "null" on the right table

2016-04-29 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14757: --- Assignee: Reynold Xin > Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left

[jira] [Resolved] (SPARK-14757) Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left table is joined to "null" on the right table

2016-04-29 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14757. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12799

[jira] [Resolved] (SPARK-14858) Push predicates with subquery

2016-04-29 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14858. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12720

[jira] [Commented] (SPARK-14781) Support subquery in nested predicates

2016-04-29 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15264868#comment-15264868 ] Davies Liu commented on SPARK-14781: PR 12720 is pretty close, we should wait for that. > Support

[jira] [Commented] (SPARK-14781) Support subquery in nested predicates

2016-04-29 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15264866#comment-15264866 ] Davies Liu commented on SPARK-14781: Distinct is slow, it's better to not use that. The LeftSemiPlus

[jira] [Commented] (SPARK-12981) Dataframe distinct() followed by a filter(udf) in pyspark throws a casting error

2016-04-29 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15264437#comment-15264437 ] Davies Liu commented on SPARK-12981: This depends on several changes in 2.0, it's not easy to back

[jira] [Commented] (SPARK-14224) Cannot project all columns from a table with ~1,100 columns

2016-04-29 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15264434#comment-15264434 ] Davies Liu commented on SPARK-14224: [~jgagnon] This bug does not exists in 1.6 branch. > Cannot

[jira] [Created] (SPARK-14992) Flaky test: BucketedReadSuite.only shuffle one side when join bucketed table and non-bucketed table

2016-04-28 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14992: -- Summary: Flaky test: BucketedReadSuite.only shuffle one side when join bucketed table and non-bucketed table Key: SPARK-14992 URL: https://issues.apache.org/jira/browse/SPARK-14992

[jira] [Created] (SPARK-14987) Inline Hive thrift-server into Spark

2016-04-28 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14987: -- Summary: Inline Hive thrift-server into Spark Key: SPARK-14987 URL: https://issues.apache.org/jira/browse/SPARK-14987 Project: Spark Issue Type: Task

[jira] [Resolved] (SPARK-14961) Support LongToUnsafeRowMap larger than 1G

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14961. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12740

[jira] [Commented] (SPARK-14781) Support subquery in nested predicates

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15261230#comment-15261230 ] Davies Liu commented on SPARK-14781: [~freiss] SemiPlus is not introduced yet. Even the subquery in

[jira] [Updated] (SPARK-14785) Support correlated scalar subquery

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14785: --- Description: For example: {code} SELECT a from t where b > (select avg(c) from t2 where t.id =

[jira] [Commented] (SPARK-13323) Type cast support in type inference during merging types.

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15261154#comment-15261154 ] Davies Liu commented on SPARK-13323: This API is not designed to use in this way, I'd like to not do

[jira] [Closed] (SPARK-13323) Type cast support in type inference during merging types.

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-13323. -- Resolution: Not A Problem > Type cast support in type inference during merging types. >

[jira] [Commented] (SPARK-7898) pyspark merges stderr into stdout

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15261148#comment-15261148 ] Davies Liu commented on SPARK-7898: --- [~sds] So this is not a problem for PySpark, right? > pyspark

[jira] [Resolved] (SPARK-11757) Incorrect join output for joining two dataframes loaded from Parquet format

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-11757. Resolution: Fixed Assignee: Dilip Biswal Fix Version/s: 2.0.0 > Incorrect join

[jira] [Closed] (SPARK-13837) SQL Context function to_date() returns wrong date

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-13837. -- Resolution: Cannot Reproduce Assignee: Davies Liu Fix Version/s: 2.0.0 > SQL Context

[jira] [Commented] (SPARK-13837) SQL Context function to_date() returns wrong date

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15261084#comment-15261084 ] Davies Liu commented on SPARK-13837: @Amaud Caruso I'm in the same time zone as you , but can't

[jira] [Closed] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-9807. - Resolution: Not A Problem Assignee: Davies Liu > pyspark.sql.createDataFrame does not infer data

[jira] [Commented] (SPARK-9807) pyspark.sql.createDataFrame does not infer data type of parsed TSV

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15261062#comment-15261062 ] Davies Liu commented on SPARK-9807: --- The type inferring here is talking about get the type from Python

[jira] [Resolved] (SPARK-11368) Spark shouldn't scan all partitions when using Python UDF and filter over partitioned column is given

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-11368. Resolution: Fixed Assignee: Davies Liu Fix Version/s: 2.0.0 This was fixed by

[jira] [Commented] (SPARK-10069) Python's ReduceByKeyAndWindow DStream Keeps Growing

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15261011#comment-15261011 ] Davies Liu commented on SPARK-10069: cc [~zsxwing] > Python's ReduceByKeyAndWindow DStream Keeps

[jira] [Resolved] (SPARK-7891) Python class in __main__ may trigger AssertionError

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-7891. --- Resolution: Duplicate Assignee: Shixiong Zhu Fix Version/s: 2.0.0 > Python class in

[jira] [Closed] (SPARK-12683) SQL timestamp is wrong when accessed as Python datetime

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-12683. -- Resolution: Won't Fix Assignee: Davies Liu > SQL timestamp is wrong when accessed as Python

[jira] [Commented] (SPARK-12683) SQL timestamp is wrong when accessed as Python datetime

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15261004#comment-15261004 ] Davies Liu commented on SPARK-12683: Done some debugging on this, it seems that the Java library

[jira] [Created] (SPARK-14961) Support LongToUnsafeRowMap larger than 1G

2016-04-27 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14961: -- Summary: Support LongToUnsafeRowMap larger than 1G Key: SPARK-14961 URL: https://issues.apache.org/jira/browse/SPARK-14961 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-14911) Fix a potential data race in TaskMemoryManager

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14911. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12681

[jira] [Updated] (SPARK-14911) Fix a potential data race in TaskMemoryManager

2016-04-27 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14911: --- Assignee: Liwei Lin > Fix a potential data race in TaskMemoryManager >

[jira] [Resolved] (SPARK-14929) Disable vectorized hashmap for wide schemas and high-precision decimals

2016-04-26 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14929. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12710

[jira] [Resolved] (SPARK-14853) Support LeftSemi/LeftAnti in SortMergeJoin

2016-04-26 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14853. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12668

[jira] [Assigned] (SPARK-14853) Support LeftSemi/LeftAnti in SortMergeJoin

2016-04-25 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-14853: -- Assignee: Davies Liu > Support LeftSemi/LeftAnti in SortMergeJoin >

[jira] [Updated] (SPARK-4452) Shuffle data structures can starve others on the same thread for memory

2016-04-25 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-4452: -- Assignee: Lianhui Wang (was: Tianshuo Deng) > Shuffle data structures can starve others on the same

[jira] [Commented] (SPARK-4452) Shuffle data structures can starve others on the same thread for memory

2016-04-25 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15256602#comment-15256602 ] Davies Liu commented on SPARK-4452: --- We only backport critical bug fix into released branch. There is

[jira] [Resolved] (SPARK-14870) NPE in generate aggregate

2016-04-24 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14870. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12651

[jira] [Resolved] (SPARK-14881) pyspark and sparkR shell default log level should match spark-shell/Scala

2016-04-24 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14881. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12648

[jira] [Updated] (SPARK-14838) Implement statistics in SerializeFromObject to avoid failure when estimating sizeInBytes for ObjectType

2016-04-23 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14838: --- Assignee: Liang-Chi Hsieh > Implement statistics in SerializeFromObject to avoid failure when

[jira] [Resolved] (SPARK-14838) Implement statistics in SerializeFromObject to avoid failure when estimating sizeInBytes for ObjectType

2016-04-23 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14838. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12599

[jira] [Created] (SPARK-14870) NPE in generate aggregate

2016-04-23 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14870: -- Summary: NPE in generate aggregate Key: SPARK-14870 URL: https://issues.apache.org/jira/browse/SPARK-14870 Project: Spark Issue Type: Bug Components:

[jira] [Resolved] (SPARK-14856) Returning batch unexpected from wide table

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14856. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12619

[jira] [Commented] (SPARK-14858) Push predicates with subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254744#comment-15254744 ] Davies Liu commented on SPARK-14858: I created a few JIRA related to subquery, it will be great if

[jira] [Resolved] (SPARK-12543) Support subquery in select/where/having

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-12543. Resolution: Fixed Assignee: Herman van Hovell (was: Davies Liu) Fix Version/s:

[jira] [Updated] (SPARK-14785) Support correlated scalar subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14785: --- Description: For example: {code} SELECT a from t where b > (select avg(c) from t2 where t.id =

[jira] [Updated] (SPARK-14785) Support correlated scalar subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14785: --- Description: For example: SELECT a from t where b > (select avg(c) from t2 where t.id = t2.id) it

[jira] [Updated] (SPARK-14781) Support subquery in nested predicates

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14781: --- Assignee: (was: Davies Liu) > Support subquery in nested predicates >

[jira] [Updated] (SPARK-14785) Support correlated scalar subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14785: --- Assignee: (was: Davies Liu) > Support correlated scalar subquery >

[jira] [Created] (SPARK-14858) Push predicates with subquery

2016-04-22 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14858: -- Summary: Push predicates with subquery Key: SPARK-14858 URL: https://issues.apache.org/jira/browse/SPARK-14858 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-14785) Support correlated scalar subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-14785: -- Assignee: Davies Liu > Support correlated scalar subquery >

[jira] [Updated] (SPARK-14773) Enable the tests in HiveCompatibilitySuite for subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14773: --- Assignee: Herman van Hovell > Enable the tests in HiveCompatibilitySuite for subquery >

[jira] [Commented] (SPARK-14773) Enable the tests in HiveCompatibilitySuite for subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254622#comment-15254622 ] Davies Liu commented on SPARK-14773: [~hvanhovell] Could you take this? > Enable the tests in

[jira] [Resolved] (SPARK-10129) math function: stddev_samp

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10129. Resolution: Not A Problem > math function: stddev_samp > -- > >

[jira] [Resolved] (SPARK-10600) SparkSQL - Support for Not Exists in a Correlated Subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10600. Resolution: Duplicate Assignee: Herman van Hovell Fix Version/s: 2.0.0 > SparkSQL

[jira] [Resolved] (SPARK-13831) TPC-DS Query 35 fails with the following compile error

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13831. Resolution: Fixed Assignee: Herman van Hovell Fix Version/s: 2.0.0 > TPC-DS Query

[jira] [Resolved] (SPARK-12545) Support exists condition

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-12545. Resolution: Duplicate Assignee: Herman van Hovell (was: Davies Liu) Fix Version/s:

[jira] [Resolved] (SPARK-13347) Reuse the shuffle for duplicated exchange

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13347. Resolution: Duplicate Assignee: Davies Liu Fix Version/s: 2.0.0 > Reuse the

[jira] [Resolved] (SPARK-13348) Avoid duplicated broadcasts

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13348. Resolution: Fixed Assignee: Davies Liu Fix Version/s: 2.0.0 It's fixed by re-use

[jira] [Closed] (SPARK-13541) Flaky test: ParquetHadoopFsRelationSuite.test all data types - ByteType

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-13541. -- Resolution: Cannot Reproduce > Flaky test: ParquetHadoopFsRelationSuite.test all data types - ByteType

[jira] [Resolved] (SPARK-14669) Some SQL metrics is broken when whole-stage codegen enabled

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14669. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12425

[jira] [Created] (SPARK-14856) Returning batch unexpected from wide table

2016-04-22 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14856: -- Summary: Returning batch unexpected from wide table Key: SPARK-14856 URL: https://issues.apache.org/jira/browse/SPARK-14856 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-14791) TPCDS Q23B generate different result each time

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14791. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12600

[jira] [Created] (SPARK-14853) Support LeftSemi/LeftAnti in SortMergeJoin

2016-04-22 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14853: -- Summary: Support LeftSemi/LeftAnti in SortMergeJoin Key: SPARK-14853 URL: https://issues.apache.org/jira/browse/SPARK-14853 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-14762) Fail to parse TPCDS Q90

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14762. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12537

[jira] [Updated] (SPARK-13266) Python DataFrameReader converts None to "None" instead of null

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-13266: --- Assignee: Liang-Chi Hsieh > Python DataFrameReader converts None to "None" instead of null >

[jira] [Resolved] (SPARK-13266) Python DataFrameReader converts None to "None" instead of null

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13266. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12494

[jira] [Updated] (SPARK-10001) Allow Ctrl-C in spark-shell to kill running job

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-10001: --- Assignee: Jakob Odersky > Allow Ctrl-C in spark-shell to kill running job >

[jira] [Resolved] (SPARK-10001) Allow Ctrl-C in spark-shell to kill running job

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10001. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12557

[jira] [Assigned] (SPARK-14791) TPCDS Q23B generate different result each time

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-14791: -- Assignee: Davies Liu > TPCDS Q23B generate different result each time >

[jira] [Resolved] (SPARK-14369) Implement preferredLocations() for FileScanRDD

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14369. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12527

[jira] [Updated] (SPARK-14680) Support all datatypes to use VectorizedHashmap in TungstenAggregate

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14680: --- Assignee: Sameer Agarwal > Support all datatypes to use VectorizedHashmap in TungstenAggregate >

[jira] [Resolved] (SPARK-14680) Support all datatypes to use VectorizedHashmap in TungstenAggregate

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14680. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12440

[jira] [Updated] (SPARK-14793) Code generation for large complex type exceeds JVM size limit.

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14793: --- Assignee: Takuya Ueshin > Code generation for large complex type exceeds JVM size limit. >

[jira] [Resolved] (SPARK-14793) Code generation for large complex type exceeds JVM size limit.

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14793. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12559

[jira] [Resolved] (SPARK-14724) Improve performance of sorting by using radix sort when possible

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14724. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12490

[jira] [Updated] (SPARK-14780) Add `setLogLevel` to SparkR

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14780: --- Assignee: Dongjoon Hyun > Add `setLogLevel` to SparkR > --- > >

[jira] [Resolved] (SPARK-14780) Add `setLogLevel` to SparkR

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14780. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12547

[jira] [Resolved] (SPARK-4452) Shuffle data structures can starve others on the same thread for memory

2016-04-21 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-4452. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10024

[jira] [Created] (SPARK-14791) TPCDS Q23B generate different result each time

2016-04-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14791: -- Summary: TPCDS Q23B generate different result each time Key: SPARK-14791 URL: https://issues.apache.org/jira/browse/SPARK-14791 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-14785) Support correlated scalar subquery

2016-04-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14785: --- Description: For example: SELECT a from t where b > (select c from t2 where t.id = t2.id) TPCDS

[jira] [Created] (SPARK-14785) Support correlated scalar subquery

2016-04-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14785: -- Summary: Support correlated scalar subquery Key: SPARK-14785 URL: https://issues.apache.org/jira/browse/SPARK-14785 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-14784) Build SQL for EXISTS/IN subquery

2016-04-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14784: -- Summary: Build SQL for EXISTS/IN subquery Key: SPARK-14784 URL: https://issues.apache.org/jira/browse/SPARK-14784 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-14781) Support subquery in nested predicates

2016-04-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14781: -- Summary: Support subquery in nested predicates Key: SPARK-14781 URL: https://issues.apache.org/jira/browse/SPARK-14781 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-14773) Enable the tests in HiveCompatibilitySuite for subquery

2016-04-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15250862#comment-15250862 ] Davies Liu commented on SPARK-14773: cc [~hvanhovell] > Enable the tests in HiveCompatibilitySuite

[jira] [Created] (SPARK-14773) Enable the tests in HiveCompatibilitySuite for subquery

2016-04-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14773: -- Summary: Enable the tests in HiveCompatibilitySuite for subquery Key: SPARK-14773 URL: https://issues.apache.org/jira/browse/SPARK-14773 Project: Spark Issue

[jira] [Resolved] (SPARK-13842) Consider __iter__ and __getitem__ methods for pyspark.sql.types.StructType

2016-04-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13842. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12251

[jira] [Assigned] (SPARK-14763) Can't analyze TPCDS Q70

2016-04-20 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-14763: -- Assignee: Davies Liu (was: Herman van Hovell) > Can't analyze TPCDS Q70 >

[jira] [Created] (SPARK-14763) Can't analyze TPCDS Q70

2016-04-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14763: -- Summary: Can't analyze TPCDS Q70 Key: SPARK-14763 URL: https://issues.apache.org/jira/browse/SPARK-14763 Project: Spark Issue Type: Bug Components:

[jira] [Created] (SPARK-14762) Fail to parse TPCDS Q90

2016-04-20 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14762: -- Summary: Fail to parse TPCDS Q90 Key: SPARK-14762 URL: https://issues.apache.org/jira/browse/SPARK-14762 Project: Spark Issue Type: Bug Components:

[jira] [Resolved] (SPARK-14639) Add `bround` function in Python/R.

2016-04-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14639. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12509

[jira] [Updated] (SPARK-14639) Add `bround` function in Python/R.

2016-04-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14639: --- Assignee: Dongjoon Hyun > Add `bround` function in Python/R. > -- >

[jira] [Resolved] (SPARK-14600) Push predicates through Expand

2016-04-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14600. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12496

[jira] [Resolved] (SPARK-13419) SubquerySuite should use checkAnswer rather than ScalaTest's assertResult

2016-04-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13419. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12269

[jira] [Resolved] (SPARK-14717) Scala, Python APIs for Dataset.unpersist differ in default blocking value

2016-04-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14717. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12507

[jira] [Resolved] (SPARK-4226) SparkSQL - Add support for subqueries in predicates

2016-04-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-4226. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12306

[jira] [Updated] (SPARK-14717) Scala, Python APIs for Dataset.unpersist differ in default blocking value

2016-04-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14717: --- Assignee: Felix Cheung > Scala, Python APIs for Dataset.unpersist differ in default blocking value >

<    1   2   3   4   5   6   7   8   9   10   >