git commit: [SPARK-1368][SQL] Optimized HiveTableScan

2014-05-29 Thread marmbrus
ms) def benchmark(f: = Unit) = { val begin = System.currentTimeMillis() f val end = System.currentTimeMillis() end - begin } } ``` @marmbrus Please help review, thanks! Author: Cheng Lian lian.cs@gmail.com Closes #758 from liancheng/fastHiveTableScan and squashes

git commit: Avoid dynamic dispatching when unwrapping Hive data.

2014-06-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 9d0fae936 - 6db0d5cfe Avoid dynamic dispatching when unwrapping Hive data. This is a follow up of PR #758. The `unwrapHiveData` function is now composed statically before actual rows are scanned according to the field object

git commit: Avoid dynamic dispatching when unwrapping Hive data.

2014-06-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master ec8be274a - 862283e9c Avoid dynamic dispatching when unwrapping Hive data. This is a follow up of PR #758. The `unwrapHiveData` function is now composed statically before actual rows are scanned according to the field object inspector to

git commit: fix java.lang.ClassCastException

2014-06-03 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 350cfd311 - d96794132 fix java.lang.ClassCastException get Exception when run:bin/run-example org.apache.spark.examples.sql.RDDRelation Exception's detail is: Exception in thread main java.lang.ClassCastException: java.lang.Long

git commit: [SPARK-2036] [SQL] CaseConversionExpression should check if the evaluated value is null.

2014-06-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 89cdbb087 - e4c11eef2 [SPARK-2036] [SQL] CaseConversionExpression should check if the evaluated value is null. `CaseConversionExpression` should check if the evaluated value is `null`. Author: Takuya UESHIN ues...@happy-camper.st Closes

git commit: [SPARK-2036] [SQL] CaseConversionExpression should check if the evaluated value is null.

2014-06-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 8100cbdb7 - 70383b11c [SPARK-2036] [SQL] CaseConversionExpression should check if the evaluated value is null. `CaseConversionExpression` should check if the evaluated value is `null`. Author: Takuya UESHIN ues...@happy-camper.st

git commit: [SPARK-2050 - 2][SQL] DIV and BETWEEN should not be case sensitive.

2014-06-06 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 d3717bea9 - d7467484f [SPARK-2050 - 2][SQL] DIV and BETWEEN should not be case sensitive. Followup: #989 Author: Michael Armbrust mich...@databricks.com Closes #994 from marmbrus/caseSensitiveFunctions2 and squashes the following

git commit: [SPARK-1495][SQL]add support for left semi join

2014-06-09 Thread marmbrus
Closes #837 from adrian-wang/left-semi-join-support and squashes the following commits: d39cd12 [Daoyuan Wang] Merge pull request #1 from marmbrus/pr/837 6713c09 [Michael Armbrust] Better debugging for failed query tests. 035b73e [Michael Armbrust] Add test for left semi that can't be done

git commit: [SPARK-1495][SQL]add support for left semi join

2014-06-09 Thread marmbrus
...@intel.com Closes #837 from adrian-wang/left-semi-join-support and squashes the following commits: d39cd12 [Daoyuan Wang] Merge pull request #1 from marmbrus/pr/837 6713c09 [Michael Armbrust] Better debugging for failed query tests. 035b73e [Michael Armbrust] Add test for left semi that can't be done

git commit: [SPARK-1704][SQL] Fully support EXPLAIN commands as SchemaRDD.

2014-06-09 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 65fa7bcac - 5a79ba13e [SPARK-1704][SQL] Fully support EXPLAIN commands as SchemaRDD. This PR attempts to resolve [SPARK-1704](https://issues.apache.org/jira/browse/SPARK-1704) by introducing a physical plan for EXPLAIN commands,

git commit: [SPARK-1508][SQL] Add SQLConf to SQLContext.

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 5a79ba13e - d5da81cdd [SPARK-1508][SQL] Add SQLConf to SQLContext. This PR (1) introduces a new class SQLConf that stores key-value properties for a SQLContext (2) clean up the semantics of various forms of SET commands. The SQLConf

git commit: Moved hiveOperators.scala to the right package folder

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 08ed9ad81 - a9a461c59 Moved hiveOperators.scala to the right package folder The package is `org.apache.spark.sql.hive.execution`, while the file was placed under `sql/hive/src/main/scala/org/apache/spark/sql/hive/`. Author: Cheng Lian

git commit: [SPARK-2076][SQL] Pushdown the join filter predication for outer join

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 884ca718b - db0c038a6 [SPARK-2076][SQL] Pushdown the join filter predication for outer join As the rule described in https://cwiki.apache.org/confluence/display/Hive/OuterJoinBehavior, we can optimize the SQL Join by pushing down the

git commit: HOTFIX: clear() configs in SQLConf-related unit tests.

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master a2052a44f - 601032f5b HOTFIX: clear() configs in SQLConf-related unit tests. Thanks goes to @liancheng, who pointed out that `sql/test-only *.SQLConfSuite *.SQLQuerySuite` passed but `sql/test-only *.SQLQuerySuite *.SQLConfSuite` failed.

git commit: HOTFIX: clear() configs in SQLConf-related unit tests.

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 ac8c27bdf - 2cdce7cf3 HOTFIX: clear() configs in SQLConf-related unit tests. Thanks goes to @liancheng, who pointed out that `sql/test-only *.SQLConfSuite *.SQLQuerySuite` passed but `sql/test-only *.SQLQuerySuite *.SQLConfSuite`

git commit: [SPARK-2093] [SQL] NullPropagation should use exact type value.

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 601032f5b - 0402bd77e [SPARK-2093] [SQL] NullPropagation should use exact type value. `NullPropagation` should use exact type value when transform `Count` or `Sum`. Author: Takuya UESHIN ues...@happy-camper.st Closes #1034 from

git commit: [SPARK-2093] [SQL] NullPropagation should use exact type value.

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 2cdce7cf3 - 6d15e9f7c [SPARK-2093] [SQL] NullPropagation should use exact type value. `NullPropagation` should use exact type value when transform `Count` or `Sum`. Author: Takuya UESHIN ues...@happy-camper.st Closes #1034 from

git commit: [SPARK-1968][SQL] SQL/HiveQL command for caching/uncaching tables

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 6d15e9f7c - 65ed7793d [SPARK-1968][SQL] SQL/HiveQL command for caching/uncaching tables JIRA issue: [SPARK-1968](https://issues.apache.org/jira/browse/SPARK-1968) This PR added support for SQL/HiveQL command for caching/uncaching

git commit: SPARK-2107: FilterPushdownSuite doesn't need Junit jar.

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 0f1dc3a73 - 6e1193031 SPARK-2107: FilterPushdownSuite doesn't need Junit jar. Author: Qiuzhuang.Lian qiuzhuang.l...@gmail.com Closes #1046 from Qiuzhuang/master and squashes the following commits: 0a9921a [Qiuzhuang.Lian] SPARK-2107:

git commit: SPARK-2107: FilterPushdownSuite doesn't need Junit jar.

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 65ed7793d - 54ff00547 SPARK-2107: FilterPushdownSuite doesn't need Junit jar. Author: Qiuzhuang.Lian qiuzhuang.l...@gmail.com Closes #1046 from Qiuzhuang/master and squashes the following commits: 0a9921a [Qiuzhuang.Lian] SPARK-2107:

git commit: [SPARK-2042] Prevent unnecessary shuffle triggered by take()

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 4d5c12aa1 - 4107cce58 [SPARK-2042] Prevent unnecessary shuffle triggered by take() This PR implements `take()` on a `SchemaRDD` by inserting a logical limit that is followed by a `collect()`. This is also accompanied by adding a catalyst

git commit: [SPARK-2042] Prevent unnecessary shuffle triggered by take()

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 684a93a72 - cc004488d [SPARK-2042] Prevent unnecessary shuffle triggered by take() This PR implements `take()` on a `SchemaRDD` by inserting a logical limit that is followed by a `collect()`. This is also accompanied by adding a

git commit: [SQL] Code Cleanup: Left Semi Hash Join

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 4107cce58 - ce6deb1e5 [SQL] Code Cleanup: Left Semi Hash Join Some improvement for PR #837, add another case to white list and use `filter` to build result iterator. Author: Daoyuan daoyuan.w...@intel.com Closes #1049 from

git commit: [SQL] Code Cleanup: Left Semi Hash Join

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 cc004488d - 597c7efdc [SQL] Code Cleanup: Left Semi Hash Join Some improvement for PR #837, add another case to white list and use `filter` to build result iterator. Author: Daoyuan daoyuan.w...@intel.com Closes #1049 from

git commit: [SPARK-2052] [SQL] Add optimization for CaseConversionExpression's.

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d45e0c6b9 - 9a2448daf [SPARK-2052] [SQL] Add optimization for CaseConversionExpression's. Add optimization for `CaseConversionExpression`'s. Author: Takuya UESHIN ues...@happy-camper.st Closes #990 from ueshin/issues/SPARK-2052 and

git commit: [SPARK-2052] [SQL] Add optimization for CaseConversionExpression's.

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 81049eb54 - e3955643d [SPARK-2052] [SQL] Add optimization for CaseConversionExpression's. Add optimization for `CaseConversionExpression`'s. Author: Takuya UESHIN ues...@happy-camper.st Closes #990 from ueshin/issues/SPARK-2052 and

git commit: [SPARK-2135][SQL] Use planner for in-memory scans

2014-06-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master f95ac686b - 13f8cfdc0 [SPARK-2135][SQL] Use planner for in-memory scans Author: Michael Armbrust mich...@databricks.com Closes #1072 from marmbrus/cachedStars and squashes the following commits: 8757c8e [Michael Armbrust] Use planner

git commit: [SPARK-2135][SQL] Use planner for in-memory scans

2014-06-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 3962abaf9 - 57526e40a [SPARK-2135][SQL] Use planner for in-memory scans Author: Michael Armbrust mich...@databricks.com Closes #1072 from marmbrus/cachedStars and squashes the following commits: 8757c8e [Michael Armbrust] Use planner

git commit: [SPARK-1964][SQL] Add timestamp to HiveMetastoreTypes.toMetastoreType

2014-06-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 70c8116c0 - 1c2fd015b [SPARK-1964][SQL] Add timestamp to HiveMetastoreTypes.toMetastoreType Author: Michael Armbrust mich...@databricks.com Closes #1061 from marmbrus/timestamp and squashes the following commits: 79c3903 [Michael

git commit: [SPARK-2094][SQL] Exactly once semantics for DDL and command statements

2014-06-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1c2fd015b - ac96d9657 [SPARK-2094][SQL] Exactly once semantics for DDL and command statements ## Related JIRA issues - Main issue: - [SPARK-2094](https://issues.apache.org/jira/browse/SPARK-2094): Ensure exactly once semantics for

git commit: [SPARK-1964][SQL] Add timestamp to HiveMetastoreTypes.toMetastoreType

2014-06-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 bc8d24ac1 - eb2f5c50c [SPARK-1964][SQL] Add timestamp to HiveMetastoreTypes.toMetastoreType Author: Michael Armbrust mich...@databricks.com Closes #1061 from marmbrus/timestamp and squashes the following commits: 79c3903 [Michael

git commit: [SQL] Support transforming TreeNodes with Option children.

2014-06-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 7dd9fc67a - 269fc62b2 [SQL] Support transforming TreeNodes with Option children. Thanks goes to @marmbrus for his implementation. Author: Michael Armbrust mich...@databricks.com Author: Zongheng Yang zonghen...@gmail.com Closes #1074

git commit: [SQL] Break hiveOperators.scala into multiple files.

2014-06-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master ca5d8b590 - ec935abce [SQL] Break hiveOperators.scala into multiple files. The single file was getting very long (500+ loc). Author: Reynold Xin r...@apache.org Closes #1166 from rxin/hiveOperators and squashes the following commits:

git commit: [SQL]Add base row updating methods for JoinedRow

2014-06-24 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 05f84e28f - c3ebf8ee6 [SQL]Add base row updating methods for JoinedRow This will be helpful in join operators. Author: Cheng Hao hao.ch...@intel.com Closes #1187 from chenghao-intel/joinedRow and squashes the following commits:

git commit: [SQL] SPARK-1800 Add broadcast hash join operator associated hints.

2014-06-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1132e472e - 9d824fed8 [SQL] SPARK-1800 Add broadcast hash join operator associated hints. This PR is based off Michael's [PR 734](https://github.com/apache/spark/pull/734) and includes a bunch of cleanups. Moreover, this PR also - makes

git commit: [SPARK-2283][SQL] Reset test environment before running PruningSuite

2014-06-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 9d824fed8 - 7f196b009 [SPARK-2283][SQL] Reset test environment before running PruningSuite JIRA issue: [SPARK-2283](https://issues.apache.org/jira/browse/SPARK-2283) If `PruningSuite` is run right after `HiveCompatibilitySuite`, the first

git commit: [SPARK-2295] [SQL] Make JavaBeans nullability stricter.

2014-06-26 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 45bf91025 - a7bebd14d [SPARK-2295] [SQL] Make JavaBeans nullability stricter. Author: Takuya UESHIN ues...@happy-camper.st Closes #1235 from ueshin/issues/SPARK-2295 and squashes the following commits: 201c508 [Takuya UESHIN] Make

git commit: SPARK-2186: Spark SQL DSL support for simple aggregations such as SUM and AVG

2014-07-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 6596392da - 5c6ec94da SPARK-2186: Spark SQL DSL support for simple aggregations such as SUM and AVG **Description** This patch enables using the `.select()` function in SchemaRDD with functions such as `Sum`, `Count` and other.

git commit: SPARK-2186: Spark SQL DSL support for simple aggregations such as SUM and AVG

2014-07-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0-jdbc 9d3c58c13 - 8a6ee2f37 SPARK-2186: Spark SQL DSL support for simple aggregations such as SUM and AVG **Description** This patch enables using the `.select()` function in SchemaRDD with functions such as `Sum`, `Count` and other.

git commit: SPARK-2186: Spark SQL DSL support for simple aggregations such as SUM and AVG

2014-07-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 a4c754194 - 69112b0bf SPARK-2186: Spark SQL DSL support for simple aggregations such as SUM and AVG **Description** This patch enables using the `.select()` function in SchemaRDD with functions such as `Sum`, `Count` and other.

git commit: [SPARK-2328] [SQL] Add execution of `SHOW TABLES` before `TestHive.reset()`.

2014-07-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 5c6ec94da - 1e2c26c83 [SPARK-2328] [SQL] Add execution of `SHOW TABLES` before `TestHive.reset()`. `PruningSuite` is executed first of Hive tests unfortunately, `TestHive.reset()` breaks the test environment. To prevent this, we must run

git commit: [SPARK-2328] [SQL] Add execution of `SHOW TABLES` before `TestHive.reset()`.

2014-07-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0-jdbc 8a6ee2f37 - 56d674fe7 [SPARK-2328] [SQL] Add execution of `SHOW TABLES` before `TestHive.reset()`. `PruningSuite` is executed first of Hive tests unfortunately, `TestHive.reset()` breaks the test environment. To prevent this, we

git commit: [SPARK-2287] [SQL] Make ScalaReflection be able to handle Generic case classes.

2014-07-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 552e28b63 - 87b74a9bf [SPARK-2287] [SQL] Make ScalaReflection be able to handle Generic case classes. Author: Takuya UESHIN ues...@happy-camper.st Closes #1226 from ueshin/issues/SPARK-2287 and squashes the following commits: 32ef7c3

git commit: [SPARK-2287] [SQL] Make ScalaReflection be able to handle Generic case classes.

2014-07-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0-jdbc 56d674fe7 - cf0d14b01 [SPARK-2287] [SQL] Make ScalaReflection be able to handle Generic case classes. Author: Takuya UESHIN ues...@happy-camper.st Closes #1226 from ueshin/issues/SPARK-2287 and squashes the following commits:

git commit: [SPARK-2287] [SQL] Make ScalaReflection be able to handle Generic case classes.

2014-07-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1e2c26c83 - bc7041a42 [SPARK-2287] [SQL] Make ScalaReflection be able to handle Generic case classes. Author: Takuya UESHIN ues...@happy-camper.st Closes #1226 from ueshin/issues/SPARK-2287 and squashes the following commits: 32ef7c3

svn commit: r1607449 - in /spark: images/spark-stack.png site/images/spark-stack.png

2014-07-02 Thread marmbrus
Author: marmbrus Date: Wed Jul 2 19:01:58 2014 New Revision: 1607449 URL: http://svn.apache.org/r1607449 Log: Remove YARN from stack diagram. Modified: spark/images/spark-stack.png spark/site/images/spark-stack.png Modified: spark/images/spark-stack.png URL: http://svn.apache.org

git commit: [SPARK-2342] Evaluation helper's output type doesn't conform to input ty...

2014-07-03 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 2b36344f5 - a9b52e562 [SPARK-2342] Evaluation helper's output type doesn't conform to input ty... The function cast doesn't conform to the intention of Those expressions are supposed to be in the same data type, and also the return type.

git commit: [SPARK-2342] Evaluation helper's output type doesn't conform to input ty...

2014-07-03 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 fdee6ee06 - 7766c9d26 [SPARK-2342] Evaluation helper's output type doesn't conform to input ty... The function cast doesn't conform to the intention of Those expressions are supposed to be in the same data type, and also the return

git commit: [HOTFIX] Synchronize on SQLContext.settings in tests.

2014-07-03 Thread marmbrus
/SPARK_HADOOP_VERSION=1.0.4,label=centos/81/). pwendell marmbrus Author: Zongheng Yang zonghen...@gmail.com Closes #1277 from concretevitamin/test-fix and squashes the following commits: 28c88bd [Zongheng Yang] Synchronize on SQLContext.settings in tests. Project: http://git-wip-us.apache.org/repos

git commit: [SPARK-2234][SQL]Spark SQL basicOperators add Except operator

2014-07-04 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master b3e768e15 - 5dadda864 [SPARK-2234][SQL]Spark SQL basicOperators add Except operator Hi all, I want to submit a Except operator in basicOperators.scala In SQL case.SQL support two table do except operator. select * from table1 except

git commit: [SPARK-2366] [SQL] Add column pruning for the right side of LeftSemi join.

2014-07-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 42f3abd52 - 3da8df939 [SPARK-2366] [SQL] Add column pruning for the right side of LeftSemi join. The right side of `LeftSemi` join needs columns only used in join condition. Author: Takuya UESHIN ues...@happy-camper.st Closes #1301 from

git commit: [SPARK-2366] [SQL] Add column pruning for the right side of LeftSemi join.

2014-07-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 b77715a5b - 3aa52be39 [SPARK-2366] [SQL] Add column pruning for the right side of LeftSemi join. The right side of `LeftSemi` join needs columns only used in join condition. Author: Takuya UESHIN ues...@happy-camper.st Closes #1301

git commit: [SPARK-2327] [SQL] Fix nullabilities of Join/Generate/Aggregate.

2014-07-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 3da8df939 - 9d5ecf820 [SPARK-2327] [SQL] Fix nullabilities of Join/Generate/Aggregate. Fix nullabilities of `Join`/`Generate`/`Aggregate` because: - Output attributes of opposite side of `OuterJoin` should be nullable. - Output attributes

git commit: [SPARK-2339][SQL] SQL parser in sql-core is case sensitive, but a table alias is converted to lower case when we create Subquery

2014-07-07 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 5044ba60a - e522971e8 [SPARK-2339][SQL] SQL parser in sql-core is case sensitive, but a table alias is converted to lower case when we create Subquery Reported by

git commit: [SPARK-2339][SQL] SQL parser in sql-core is case sensitive, but a table alias is converted to lower case when we create Subquery

2014-07-07 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master f7ce1b3b4 - c0b4cf097 [SPARK-2339][SQL] SQL parser in sql-core is case sensitive, but a table alias is converted to lower case when we create Subquery Reported by

git commit: [SPARK-2375][SQL] JSON schema inference may not resolve type conflicts correctly for a field inside an array of structs

2014-07-07 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 691b554f3 - 1032c2875 [SPARK-2375][SQL] JSON schema inference may not resolve type conflicts correctly for a field inside an array of structs For example, for ``` {array: [{field:214748364700}, {field:1}]} ``` the type of field is

git commit: [SPARK-2376][SQL] Selecting list values inside nested JSON objects raises java.lang.IllegalArgumentException

2014-07-07 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master f0496ee10 - 4352a2fda [SPARK-2376][SQL] Selecting list values inside nested JSON objects raises java.lang.IllegalArgumentException JIRA: https://issues.apache.org/jira/browse/SPARK-2376 Author: Yin Huai h...@cse.ohio-state.edu Closes

git commit: [SPARK-2376][SQL] Selecting list values inside nested JSON objects raises java.lang.IllegalArgumentException

2014-07-07 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 1032c2875 - 9dce7beff [SPARK-2376][SQL] Selecting list values inside nested JSON objects raises java.lang.IllegalArgumentException JIRA: https://issues.apache.org/jira/browse/SPARK-2376 Author: Yin Huai h...@cse.ohio-state.edu

git commit: [SPARK-2395][SQL] Optimize common LIKE patterns.

2014-07-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 56e009d4f - cc3e0a14d [SPARK-2395][SQL] Optimize common LIKE patterns. Author: Michael Armbrust mich...@databricks.com Closes #1325 from marmbrus/slowLike and squashes the following commits: 023c3eb [Michael Armbrust] add comment

git commit: [SPARK-2395][SQL] Optimize common LIKE patterns.

2014-07-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 3e95225c7 - 4bf8ddaee [SPARK-2395][SQL] Optimize common LIKE patterns. Author: Michael Armbrust mich...@databricks.com Closes #1325 from marmbrus/slowLike and squashes the following commits: 023c3eb [Michael Armbrust] add comment

git commit: [SPARK-2431][SQL] Refine StringComparison and related codes.

2014-07-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master ae8ca4dfb - f62c42728 [SPARK-2431][SQL] Refine StringComparison and related codes. Refine `StringComparison` and related codes as follows: - `StringComparison` could be similar to `StringRegexExpression` or `CaseConversionExpression`. -

git commit: [SPARK-2431][SQL] Refine StringComparison and related codes.

2014-07-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 5bfd319d7 - 1fef57bf9 [SPARK-2431][SQL] Refine StringComparison and related codes. Refine `StringComparison` and related codes as follows: - `StringComparison` could be similar to `StringRegexExpression` or `CaseConversionExpression`.

git commit: [SPARK-2415] [SQL] RowWriteSupport should handle empty ArrayType correctly.

2014-07-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master f62c42728 - f5abd2712 [SPARK-2415] [SQL] RowWriteSupport should handle empty ArrayType correctly. `RowWriteSupport` doesn't write empty `ArrayType` value, so the read value becomes `null`. It should write empty `ArrayType` value as it is.

git commit: [SPARK-2428][SQL] Add except and intersect methods to SchemaRDD.

2014-07-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master f5abd2712 - 10b59ba23 [SPARK-2428][SQL] Add except and intersect methods to SchemaRDD. Author: Takuya UESHIN ues...@happy-camper.st Closes #1355 from ueshin/issues/SPARK-2428 and squashes the following commits: b6fa264 [Takuya UESHIN]

git commit: [SPARK-2443][SQL] Fix slow read from partitioned tables

2014-07-14 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 38ccd6ebd - d60b09bb6 [SPARK-2443][SQL] Fix slow read from partitioned tables This fix obtains a comparable performance boost as [PR #1390](https://github.com/apache/spark/pull/1390) by moving an array update and deserializer

git commit: [SPARK-2443][SQL] Fix slow read from partitioned tables

2014-07-14 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 baf92a0f2 - 2ec7d7ab7 [SPARK-2443][SQL] Fix slow read from partitioned tables This fix obtains a comparable performance boost as [PR #1390](https://github.com/apache/spark/pull/1390) by moving an array update and deserializer

git commit: [SPARK-2446][SQL] Add BinaryType support to Parquet I/O.

2014-07-14 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 3dd8af7a6 - 9fe693b5b [SPARK-2446][SQL] Add BinaryType support to Parquet I/O. Note that this commit changes the semantics when loading in data that was created with prior versions of Spark SQL. Before, we were writing out strings as

git commit: [SPARK-2483][SQL] Fix parsing of repeated, nested data access.

2014-07-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master a21f9a754 - 0f98ef1a2 [SPARK-2483][SQL] Fix parsing of repeated, nested data access. Author: Michael Armbrust mich...@databricks.com Closes #1411 from marmbrus/nestedRepeated and squashes the following commits: 044fa09 [Michael Armbrust

git commit: [SPARK-2483][SQL] Fix parsing of repeated, nested data access.

2014-07-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 53a6399e5 - 3aa120cd4 [SPARK-2483][SQL] Fix parsing of repeated, nested data access. Author: Michael Armbrust mich...@databricks.com Closes #1411 from marmbrus/nestedRepeated and squashes the following commits: 044fa09 [Michael

git commit: [SPARK-2474][SQL] For a registered table in OverrideCatalog, the Analyzer failed to resolve references in the format of tableName.fieldName

2014-07-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master bcd0c30c7 - 8af46d584 [SPARK-2474][SQL] For a registered table in OverrideCatalog, the Analyzer failed to resolve references in the format of tableName.fieldName Please refer to JIRA (https://issues.apache.org/jira/browse/SPARK-2474) for

git commit: SPARK-2407: Added internal implementation of SQL SUBSTR()

2014-07-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 8af46d584 - 61de65bc6 SPARK-2407: Added internal implementation of SQL SUBSTR() This replaces the Hive UDF for SUBSTR(ING) with an implementation in Catalyst and adds tests to verify correct operation. Author: William Benton

git commit: SPARK-2407: Added internal implementation of SQL SUBSTR()

2014-07-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 f2bf6511a - 2db77e931 SPARK-2407: Added internal implementation of SQL SUBSTR() This replaces the Hive UDF for SUBSTR(ING) with an implementation in Catalyst and adds tests to verify correct operation. Author: William Benton

git commit: [SQL] Attribute equality comparisons should be done by exprId.

2014-07-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 2db77e931 - 8da0fd869 [SQL] Attribute equality comparisons should be done by exprId. Author: Michael Armbrust mich...@databricks.com Closes #1414 from marmbrus/exprIdResolution and squashes the following commits: 97b47bc [Michael

git commit: [SPARK-2119][SQL] Improved Parquet performance when reading off S3

2014-07-16 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 632fb3d9a - efc452a16 [SPARK-2119][SQL] Improved Parquet performance when reading off S3 JIRA issue: [SPARK-2119](https://issues.apache.org/jira/browse/SPARK-2119) Essentially this PR fixed three issues to gain much better performance

git commit: [SPARK-2535][SQL] Add StringComparison case to NullPropagation.

2014-07-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d88f6be44 - 3a1709fa5 [SPARK-2535][SQL] Add StringComparison case to NullPropagation. `StringComparison` expressions including `null` literal cases could be added to `NullPropagation`. Author: Takuya UESHIN ues...@happy-camper.st Closes

git commit: [SPARK-2540] [SQL] Add HiveDecimal HiveVarchar support in unwrapping data

2014-07-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 3a1709fa5 - 7f1720813 [SPARK-2540] [SQL] Add HiveDecimal HiveVarchar support in unwrapping data Author: Cheng Hao hao.ch...@intel.com Closes #1436 from chenghao-intel/unwrapdata and squashes the following commits: 34cc21a [Cheng Hao]

git commit: [SPARK-2540] [SQL] Add HiveDecimal HiveVarchar support in unwrapping data

2014-07-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 284bf1054 - 7611840f2 [SPARK-2540] [SQL] Add HiveDecimal HiveVarchar support in unwrapping data Author: Cheng Hao hao.ch...@intel.com Closes #1436 from chenghao-intel/unwrapdata and squashes the following commits: 34cc21a [Cheng

[1/2] [SPARK-2190][SQL] Specialized ColumnType for Timestamp

2014-07-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master db56f2df1 - cd273a238 http://git-wip-us.apache.org/repos/asf/spark/blob/cd273a23/sql/hive/src/test/resources/golden/timestamp_2-52-de3c42ab06c17ae895fd7deaf7bd9571 -- diff

git commit: Fix flakey HiveQuerySuite test

2014-07-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 872538c60 - abeacffb7 Fix flakey HiveQuerySuite test Result may not be returned in the expected order, so relax that constraint. Author: Aaron Davidson aa...@databricks.com Closes #1514 from aarondav/flakey and squashes the following

git commit: [SPARK-2561][SQL] Fix apply schema

2014-07-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master a4d60208e - 511a73140 [SPARK-2561][SQL] Fix apply schema We need to use the analyzed attributes otherwise we end up with a tree that will never resolve. Author: Michael Armbrust mich...@databricks.com Closes #1470 from marmbrus

git commit: [SPARK-2561][SQL] Fix apply schema

2014-07-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 cdcd46717 - 84bbfbd84 [SPARK-2561][SQL] Fix apply schema We need to use the analyzed attributes otherwise we end up with a tree that will never resolve. Author: Michael Armbrust mich...@databricks.com Closes #1470 from marmbrus

git commit: [SPARK-2615] [SQL] Add Equal Sign == Support for HiveQl

2014-07-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 84bbfbd84 - c6421b6f6 [SPARK-2615] [SQL] Add Equal Sign == Support for HiveQl Currently, the == in HiveQL expression will cause exception thrown, this patch will fix it. Author: Cheng Hao hao.ch...@intel.com Closes #1522 from

git commit: [SPARK-2615] [SQL] Add Equal Sign == Support for HiveQl

2014-07-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 85d3596e6 - 79fe7634f [SPARK-2615] [SQL] Add Equal Sign == Support for HiveQl Currently, the == in HiveQL expression will cause exception thrown, this patch will fix it. Author: Cheng Hao hao.ch...@intel.com Closes #1522 from

git commit: [SPARK-2588][SQL] Add some more DSLs.

2014-07-23 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master f776bc988 - 1b790cf77 [SPARK-2588][SQL] Add some more DSLs. Author: Takuya UESHIN ues...@happy-camper.st Closes #1491 from ueshin/issues/SPARK-2588 and squashes the following commits: 43d0a46 [Takuya UESHIN] Merge branch 'master' into

git commit: SPARK-2226: [SQL] transform HAVING clauses with aggregate expressions that aren't in the aggregation list

2014-07-23 Thread marmbrus
the following commits: 92c9a93 [William Benton] Removed unnecessary import f1d4f34 [William Benton] Cleanups missed in prior commit 0e1624f [William Benton] Incorporated suggestions from @marmbrus; thanks! 541d4ee [William Benton] Cleanups from review 5a12647 [William Benton] Explanatory comments

git commit: [SPARK-2569][SQL] Fix shipping of TEMPORARY hive UDFs.

2014-07-23 Thread marmbrus
pretty large to move out the type conversion code to its own file. Author: Michael Armbrust mich...@databricks.com Closes #1552 from marmbrus/fixTempUdfs and squashes the following commits: b695904 [Michael Armbrust] Make add jar execute with Hive. Ship the whole function class name since

git commit: [SPARK-2102][SQL][CORE] Add option for kryo registration required and use a resource pool in Spark SQL for Kryo instances.

2014-07-23 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1871574a2 - efdaeb111 [SPARK-2102][SQL][CORE] Add option for kryo registration required and use a resource pool in Spark SQL for Kryo instances. Author: Ian O Connell ioconn...@twitter.com Closes #1377 from ianoc/feature/SPARK-2102 and

git commit: [SPARK-2603][SQL] Remove unnecessary toMap and toList in converting Java collections to Scala collections JsonRDD.scala

2014-07-24 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 9fd141477 - b352ef175 [SPARK-2603][SQL] Remove unnecessary toMap and toList in converting Java collections to Scala collections JsonRDD.scala In JsonRDD.scalafy, we are using toMap/toList to convert a Java Map/List to a Scala one. These

git commit: [SPARK-2665] [SQL] Add EqualNS Unit Tests

2014-07-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master eb82abd8e - 184aa1c6c [SPARK-2665] [SQL] Add EqualNS Unit Tests Hive Supports the operator =, which returns same result with EQUAL(=) operator for non-null operands, but returns TRUE if both are NULL, FALSE if one of the them is NULL.

[2/2] git commit: [SPARK-2410][SQL] Merging Hive Thrift/JDBC server

2014-07-25 Thread marmbrus
[SPARK-2410][SQL] Merging Hive Thrift/JDBC server JIRA issue: - Main: [SPARK-2410](https://issues.apache.org/jira/browse/SPARK-2410) - Related: [SPARK-2678](https://issues.apache.org/jira/browse/SPARK-2678) Cherry picked the Hive Thrift/JDBC server from

[1/2] [SPARK-2410][SQL] Merging Hive Thrift/JDBC server

2014-07-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 32bcf9af9 - 06dc0d2c6 http://git-wip-us.apache.org/repos/asf/spark/blob/06dc0d2c/sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/server/SparkSQLOperationManager.scala

git commit: [SQL]Update HiveMetastoreCatalog.scala

2014-07-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master a19d8c89d - ab3c6a455 [SQL]Update HiveMetastoreCatalog.scala I think it's better to defined hiveQlTable as a val Author: baishuo(白硕) vc_j...@hotmail.com Closes #1569 from baishuo/patch-1 and squashes the following commits: dc2f895

[2/2] git commit: Revert [SPARK-2410][SQL] Merging Hive Thrift/JDBC server

2014-07-25 Thread marmbrus
Revert [SPARK-2410][SQL] Merging Hive Thrift/JDBC server This reverts commit 06dc0d2c6b69c5d59b4d194ced2ac85bfe2e05e2. #1399 is making Jenkins fail. We should investigate and put this back after its passing tests. Author: Michael Armbrust mich...@databricks.com Closes #1594 from marmbrus

[1/2] Revert [SPARK-2410][SQL] Merging Hive Thrift/JDBC server

2014-07-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 37ad3b724 - afd757a24 http://git-wip-us.apache.org/repos/asf/spark/blob/afd757a2/sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/server/SparkSQLOperationManager.scala

git commit: [SPARK-2659][SQL] Fix division semantics for hive

2014-07-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 9d8666cac - 890479123 [SPARK-2659][SQL] Fix division semantics for hive Author: Michael Armbrust mich...@databricks.com Closes #1557 from marmbrus/fixDivision and squashes the following commits: b85077f [Michael Armbrust] Fix unit tests

git commit: [SPARK-2523] [SQL] Hadoop table scan bug fixing

2014-07-28 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master a7d145e98 - 2b8d89e30 [SPARK-2523] [SQL] Hadoop table scan bug fixing In HiveTableScan.scala, ObjectInspector was created for all of the partition based records, which probably causes ClassCastException if the object inspector is not

[2/2] git commit: [SPARK-2410][SQL] Merging Hive Thrift/JDBC server (with Maven profile fix)

2014-07-28 Thread marmbrus
[SPARK-2410][SQL] Merging Hive Thrift/JDBC server (with Maven profile fix) JIRA issue: [SPARK-2410](https://issues.apache.org/jira/browse/SPARK-2410) Another try for #1399 #1600. Those two PR breaks Jenkins builds because we made a separate profile `hive-thriftserver` in sub-project

git commit: [SPARK-2730][SQL] When retrieving a value from a Map, GetItem evaluates key twice

2014-07-29 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 e0bc72eb7 - 3143e51d7 [SPARK-2730][SQL] When retrieving a value from a Map, GetItem evaluates key twice JIRA: https://issues.apache.org/jira/browse/SPARK-2730 Author: Yin Huai h...@cse.ohio-state.edu Closes #1637 from

git commit: [SPARK-2674] [SQL] [PySpark] support datetime type for SchemaRDD

2014-07-29 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master e3643485d - f0d880e28 [SPARK-2674] [SQL] [PySpark] support datetime type for SchemaRDD Datetime and time in Python will be converted into java.util.Calendar after serialization, it will be converted into java.sql.Timestamp during

git commit: [SPARK-2716][SQL] Don't check resolved for having filters.

2014-07-29 Thread marmbrus
clause aggregation rewrite. Author: Michael Armbrust mich...@databricks.com Closes #1640 from marmbrus/havingNoRef and squashes the following commits: 92d3901 [Michael Armbrust] Don't check resolved for having filters. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http

  1   2   3   4   5   6   7   8   9   10   >