spark git commit: [HOTFIX] Use 'new Job' in fsBasedParquet.scala

2015-05-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 32e27df41 -> 728af88cf [HOTFIX] Use 'new Job' in fsBasedParquet.scala Same issue as #6095 cc liancheng Author: zsxwing Closes #6136 from zsxwing/hotfix and squashes the following commits: 4beea54 [zsxwing] Use 'new Job' in fsBasedParqu

spark git commit: [SPARK-7595] [SQL] Window will cause resolve failed with self join

2015-05-14 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d3db2fd66 -> 13e652b61 [SPARK-7595] [SQL] Window will cause resolve failed with self join for example: table: src(key string, value string) sql: with v1 as(select key, count(value) over (partition by key) cnt_val from src), v2 as(select v1

spark git commit: [SPARK-7595] [SQL] Window will cause resolve failed with self join

2015-05-14 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 9ab4db29f -> c80e0cff2 [SPARK-7595] [SQL] Window will cause resolve failed with self join for example: table: src(key string, value string) sql: with v1 as(select key, count(value) over (partition by key) cnt_val from src), v2 as(selec

spark git commit: [SPARK-7548] [SQL] Add explode function for DataFrames

2015-05-14 Thread marmbrus
Closes #6107 from marmbrus/explodeFunction and squashes the following commits: 7ee2c87 [Michael Armbrust] whitespace 6f80ba3 [Michael Armbrust] Update dataframe.py c176c89 [Michael Armbrust] Merge remote-tracking branch 'origin/master' into explodeFunction 81b5da3 [Michael Armbrust] styl

spark git commit: [SPARK-7548] [SQL] Add explode function for DataFrames

2015-05-14 Thread marmbrus
Armbrust Closes #6107 from marmbrus/explodeFunction and squashes the following commits: 7ee2c87 [Michael Armbrust] whitespace 6f80ba3 [Michael Armbrust] Update dataframe.py c176c89 [Michael Armbrust] Merge remote-tracking branch 'origin/master' into explodeFunction 81b5da3 [Michael Armbr

spark git commit: [SPARK-7491] [SQL] Allow configuration of classloader isolation for hive

2015-05-17 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 564562874 -> 2ca60ace8 [SPARK-7491] [SQL] Allow configuration of classloader isolation for hive Author: Michael Armbrust Closes #6167 from marmbrus/configureIsolation and squashes the following commits: 6147cbe [Michael Armbr

spark git commit: [SPARK-7491] [SQL] Allow configuration of classloader isolation for hive

2015-05-17 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 53d6ab51b -> a8556086d [SPARK-7491] [SQL] Allow configuration of classloader isolation for hive Author: Michael Armbrust Closes #6167 from marmbrus/configureIsolation and squashes the following commits: 6147cbe [Michael Armbr

spark git commit: [SPARK-6888] [SQL] Make the jdbc driver handling user-definable

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 563bfcc1a -> e1ac2a955 [SPARK-6888] [SQL] Make the jdbc driver handling user-definable Replace the DriverQuirks with JdbcDialect(s) (and MySQLDialect/PostgresDialect) and allow developers to change the dialects on the fly (for new JDBCRRDs

spark git commit: [SPARK-6888] [SQL] Make the jdbc driver handling user-definable

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 a0ae8ce01 -> b41301a13 [SPARK-6888] [SQL] Make the jdbc driver handling user-definable Replace the DriverQuirks with JdbcDialect(s) (and MySQLDialect/PostgresDialect) and allow developers to change the dialects on the fly (for new JDBCR

spark git commit: [SPARK-7570] [SQL] Ignores _temporary during partition discovery

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master e1ac2a955 -> 010a1c278 [SPARK-7570] [SQL] Ignores _temporary during partition discovery [https://reviewable.io/review_button.png"; height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/6091) Author: Cheng Li

spark git commit: [SPARK-7570] [SQL] Ignores _temporary during partition discovery

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 b41301a13 -> c7623a20a [SPARK-7570] [SQL] Ignores _temporary during partition discovery [https://reviewable.io/review_button.png"; height=40 alt="Review on Reviewable"/>](https://reviewable.io/reviews/apache/spark/6091) Author: Chen

spark git commit: [SQL] [MINOR] [THIS] use private for internal field in ScalaUdf

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 c7623a20a -> 7d44c01c5 [SQL] [MINOR] [THIS] use private for internal field in ScalaUdf Author: Wenchen Fan Closes #6235 from cloud-fan/tmp and squashes the following commits: 8f16367 [Wenchen Fan] use private[this] (cherry picked fr

spark git commit: [SQL] [MINOR] [THIS] use private for internal field in ScalaUdf

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 010a1c278 -> 56ede8848 [SQL] [MINOR] [THIS] use private for internal field in ScalaUdf Author: Wenchen Fan Closes #6235 from cloud-fan/tmp and squashes the following commits: 8f16367 [Wenchen Fan] use private[this] Project: http://git-

spark git commit: [SPARK-2883] [SQL] ORC data source for Spark SQL

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 9c7e802a5 -> aa31e431f [SPARK-2883] [SQL] ORC data source for Spark SQL This PR updates PR #6135 authored by zhzhan from Hortonworks. This PR implements a Spark SQL data source for accessing ORC files. > **NOTE** > > Although ORC is

spark git commit: [SPARK-2883] [SQL] ORC data source for Spark SQL

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 cf4e04a0c -> 65d71bd9f [SPARK-2883] [SQL] ORC data source for Spark SQL This PR updates PR #6135 authored by zhzhan from Hortonworks. This PR implements a Spark SQL data source for accessing ORC files. > **NOTE** > > Although OR

spark git commit: [SPARK-7631] [SQL] treenode argString should not print children

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master aa31e431f -> fc2480ed1 [SPARK-7631] [SQL] treenode argString should not print children spark-sql> > explain extended > select * from ( > select key from src union all > select key from src) t; now the spark plan will print children in argS

spark git commit: [SPARK-7631] [SQL] treenode argString should not print children

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 65d71bd9f -> dbd4ec807 [SPARK-7631] [SQL] treenode argString should not print children spark-sql> > explain extended > select * from ( > select key from src union all > select key from src) t; now the spark plan will print children in

spark git commit: [SPARK-7269] [SQL] Incorrect analysis for aggregation(use semanticEquals)

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master fc2480ed1 -> 103c863c2 [SPARK-7269] [SQL] Incorrect analysis for aggregation(use semanticEquals) A modified version of https://github.com/apache/spark/pull/6110, use `semanticEquals` to make it more efficient. Author: Wenchen Fan Closes

spark git commit: [SPARK-7269] [SQL] Incorrect analysis for aggregation(use semanticEquals)

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 dbd4ec807 -> d6f5f3791 [SPARK-7269] [SQL] Incorrect analysis for aggregation(use semanticEquals) A modified version of https://github.com/apache/spark/pull/6110, use `semanticEquals` to make it more efficient. Author: Wenchen Fan Cl

spark git commit: [SPARK-7567] [SQL] [follow-up] Use a new flag to set output committer based on mapreduce apis

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 103c863c2 -> 530397ba2 [SPARK-7567] [SQL] [follow-up] Use a new flag to set output committer based on mapreduce apis cc liancheng marmbrus Author: Yin Huai Closes #6130 from yhuai/directOutput and squashes the following comm

spark git commit: [SPARK-7567] [SQL] [follow-up] Use a new flag to set output committer based on mapreduce apis

2015-05-18 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 d6f5f3791 -> a385f4b8d [SPARK-7567] [SQL] [follow-up] Use a new flag to set output committer based on mapreduce apis cc liancheng marmbrus Author: Yin Huai Closes #6130 from yhuai/directOutput and squashes the following comm

spark git commit: [SPARK-7662] [SQL] Resolve correct names for generator in projection

2015-05-19 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 4de74d260 -> bcb1ff814 [SPARK-7662] [SQL] Resolve correct names for generator in projection ``` select explode(map(value, key)) from src; ``` Throws exception ``` org.apache.spark.sql.AnalysisException: The number of aliases supplied in the

spark git commit: [SPARK-7662] [SQL] Resolve correct names for generator in projection

2015-05-19 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 87fa8ccd2 -> 62b4c7392 [SPARK-7662] [SQL] Resolve correct names for generator in projection ``` select explode(map(value, key)) from src; ``` Throws exception ``` org.apache.spark.sql.AnalysisException: The number of aliases supplied in

spark git commit: [SPARK-7656] [SQL] use CatalystConf in FunctionRegistry

2015-05-19 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 386052063 -> 60336e3bc [SPARK-7656] [SQL] use CatalystConf in FunctionRegistry follow up for #5806 Author: scwf Closes #6164 from scwf/FunctionRegistry and squashes the following commits: 15e6697 [scwf] use catalogconf in FunctionRegist

spark git commit: [SPARK-7656] [SQL] use CatalystConf in FunctionRegistry

2015-05-19 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 2ef04a162 -> 86893390c [SPARK-7656] [SQL] use CatalystConf in FunctionRegistry follow up for #5806 Author: scwf Closes #6164 from scwf/FunctionRegistry and squashes the following commits: 15e6697 [scwf] use catalogconf in FunctionRe

spark git commit: [SQL] [TEST] udf_java_method failed due to jdk version

2015-05-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 4f572008f -> f6c486aa4 [SQL] [TEST] udf_java_method failed due to jdk version java.lang.Math.exp(1.0) has different result between jdk versions. so do not use createQueryTest, write a separate test for it. ``` jdk version result 1.7.0_

spark git commit: [SQL] [TEST] udf_java_method failed due to jdk version

2015-05-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 f6a29c72c -> e79ecc7dc [SQL] [TEST] udf_java_method failed due to jdk version java.lang.Math.exp(1.0) has different result between jdk versions. so do not use createQueryTest, write a separate test for it. ``` jdk version result 1.

spark git commit: [SPARK-6743] [SQL] Fix empty projections of cached data

2015-05-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 4e5220c31 -> 3b68cb043 [SPARK-6743] [SQL] Fix empty projections of cached data Author: Michael Armbrust Closes #6165 from marmbrus/wrongColumn and squashes the following commits: 4fad158 [Michael Armbrust] Merge remote-tracking bra

spark git commit: [SPARK-6743] [SQL] Fix empty projections of cached data

2015-05-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 1a284743e -> 427dc04c1 [SPARK-6743] [SQL] Fix empty projections of cached data Author: Michael Armbrust Closes #6165 from marmbrus/wrongColumn and squashes the following commits: 4fad158 [Michael Armbrust] Merge remote-track

spark git commit: [SPARK-7758] [SQL] Override more configs to avoid failure when connect to a postgre sql

2015-05-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 2904d3f8b -> 40989cea0 [SPARK-7758] [SQL] Override more configs to avoid failure when connect to a postgre sql https://issues.apache.org/jira/browse/SPARK-7758 When initializing `executionHive`, we only masks `javax.jdo.option.Connect

spark git commit: [SPARK-7758] [SQL] Override more configs to avoid failure when connect to a postgre sql

2015-05-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master eac00691d -> 31d5d463e [SPARK-7758] [SQL] Override more configs to avoid failure when connect to a postgre sql https://issues.apache.org/jira/browse/SPARK-7758 When initializing `executionHive`, we only masks `javax.jdo.option.ConnectionU

spark git commit: [SPARK-7724] [SQL] Support Intersect/Except in Catalyst DSL.

2015-05-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 31d5d463e -> e4aef91fe [SPARK-7724] [SQL] Support Intersect/Except in Catalyst DSL. Author: Santiago M. Mola Closes #6327 from smola/feature/catalyst-dsl-set-ops and squashes the following commits: 11db778 [Santiago M. Mola] [SPARK-7724

spark git commit: [SPARK-7724] [SQL] Support Intersect/Except in Catalyst DSL.

2015-05-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 40989cea0 -> e18d623d9 [SPARK-7724] [SQL] Support Intersect/Except in Catalyst DSL. Author: Santiago M. Mola Closes #6327 from smola/feature/catalyst-dsl-set-ops and squashes the following commits: 11db778 [Santiago M. Mola] [SPARK-

spark git commit: [SPARK-7270] [SQL] Consider dynamic partition when inserting into hive table

2015-05-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master e4aef91fe -> 126d7235d [SPARK-7270] [SQL] Consider dynamic partition when inserting into hive table JIRA: https://issues.apache.org/jira/browse/SPARK-7270 Author: Liang-Chi Hsieh Closes #5864 from viirya/dyn_partition_insert and squashes

spark git commit: [SPARK-7270] [SQL] Consider dynamic partition when inserting into hive table

2015-05-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 e18d623d9 -> d6cb04463 [SPARK-7270] [SQL] Consider dynamic partition when inserting into hive table JIRA: https://issues.apache.org/jira/browse/SPARK-7270 Author: Liang-Chi Hsieh Closes #5864 from viirya/dyn_partition_insert and squa

spark git commit: [SPARK-7834] [SQL] Better window error messages

2015-05-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 821254fb9 -> 3c1305107 [SPARK-7834] [SQL] Better window error messages Author: Michael Armbrust Closes #6363 from marmbrus/windowErrors and squashes the following commits: 516b02d [Michael Armbrust] [SPARK-7834] [SQL] Better win

spark git commit: [SPARK-7834] [SQL] Better window error messages

2015-05-22 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.4 afde4019b -> d7660dc2f [SPARK-7834] [SQL] Better window error messages Author: Michael Armbrust Closes #6363 from marmbrus/windowErrors and squashes the following commits: 516b02d [Michael Armbrust] [SPARK-7834] [SQL] Better win

spark git commit: [SPARK-7637] [SQL] O(N) merge implementation for StructType merge

2015-05-26 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 0463428b6 -> 03668348e [SPARK-7637] [SQL] O(N) merge implementation for StructType merge Contribution is my original work and I license the work to the project under the projects open source license. Author: rowan Closes #6259 from rowa

git commit: [SPARK-1368][SQL] Optimized HiveTableScan

2014-05-29 Thread marmbrus
Context) import hiveContext._ val scanCsv = hql("select key from scan_csv") val scanRcfile = hql("select key from scan_rcfile") val csvDuration = benchmark(scanCsv.count()) val rcfileDuration = benchmark(scanRcfile.count()) println(s"CSV: $csv

git commit: [SPARK-1368][SQL] Optimized HiveTableScan

2014-05-29 Thread marmbrus
parkContext) import hiveContext._ val scanCsv = hql("select key from scan_csv") val scanRcfile = hql("select key from scan_rcfile") val csvDuration = benchmark(scanCsv.count()) val rcfileDuration = benchmark(scanRcfile.count()) println(s"CSV: $csv

git commit: [SPARK-1947] [SQL] Child of SumDistinct or Average should be widened to prevent overflows the same as Sum.

2014-05-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 9ecc40d3a -> 3ce81494c [SPARK-1947] [SQL] Child of SumDistinct or Average should be widened to prevent overflows the same as Sum. Child of `SumDistinct` or `Average` should be widened to prevent overflows the same as `Sum`. Author: Takuy

git commit: [SPARK-1947] [SQL] Child of SumDistinct or Average should be widened to prevent overflows the same as Sum.

2014-05-31 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 91a64eba9 -> f9fdf32c0 [SPARK-1947] [SQL] Child of SumDistinct or Average should be widened to prevent overflows the same as Sum. Child of `SumDistinct` or `Average` should be widened to prevent overflows the same as `Sum`. Author: T

git commit: [SPARK-1995][SQL] system function upper and lower can be supported

2014-06-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d000ca98a -> ec8be274a [SPARK-1995][SQL] system function upper and lower can be supported I don't know whether it's time to implement system function about string operation in spark sql now. Author: egraldlo Closes #936 from egraldlo/st

git commit: [SPARK-1995][SQL] system function upper and lower can be supported

2014-06-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 8857489da -> 9d0fae936 [SPARK-1995][SQL] system function upper and lower can be supported I don't know whether it's time to implement system function about string operation in spark sql now. Author: egraldlo Closes #936 from egraldl

git commit: Avoid dynamic dispatching when unwrapping Hive data.

2014-06-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 9d0fae936 -> 6db0d5cfe Avoid dynamic dispatching when unwrapping Hive data. This is a follow up of PR #758. The `unwrapHiveData` function is now composed statically before actual rows are scanned according to the field object inspecto

git commit: Avoid dynamic dispatching when unwrapping Hive data.

2014-06-02 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master ec8be274a -> 862283e9c Avoid dynamic dispatching when unwrapping Hive data. This is a follow up of PR #758. The `unwrapHiveData` function is now composed statically before actual rows are scanned according to the field object inspector to

git commit: fix java.lang.ClassCastException

2014-06-03 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 350cfd311 -> d96794132 fix java.lang.ClassCastException get Exception when run:bin/run-example org.apache.spark.examples.sql.RDDRelation Exception's detail is: Exception in thread "main" java.lang.ClassCastException: java.lang.Long c

git commit: fix java.lang.ClassCastException

2014-06-03 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 8edc9d033 -> aa41a522d fix java.lang.ClassCastException get Exception when run:bin/run-example org.apache.spark.examples.sql.RDDRelation Exception's detail is: Exception in thread "main" java.lang.ClassCastException: java.lang.Long canno

git commit: [SPARK-2036] [SQL] CaseConversionExpression should check if the evaluated value is null.

2014-06-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 89cdbb087 -> e4c11eef2 [SPARK-2036] [SQL] CaseConversionExpression should check if the evaluated value is null. `CaseConversionExpression` should check if the evaluated value is `null`. Author: Takuya UESHIN Closes #982 from ueshin/issu

git commit: [SPARK-2036] [SQL] CaseConversionExpression should check if the evaluated value is null.

2014-06-05 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 8100cbdb7 -> 70383b11c [SPARK-2036] [SQL] CaseConversionExpression should check if the evaluated value is null. `CaseConversionExpression` should check if the evaluated value is `null`. Author: Takuya UESHIN Closes #982 from ueshin/

git commit: [SPARK-2050 - 2][SQL] DIV and BETWEEN should not be case sensitive.

2014-06-06 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 8d85359f8 -> 8d210560b [SPARK-2050 - 2][SQL] DIV and BETWEEN should not be case sensitive. Followup: #989 Author: Michael Armbrust Closes #994 from marmbrus/caseSensitiveFunctions2 and squashes the following commits: 9d9c8ed [Mich

git commit: [SPARK-2050 - 2][SQL] DIV and BETWEEN should not be case sensitive.

2014-06-06 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 d3717bea9 -> d7467484f [SPARK-2050 - 2][SQL] DIV and BETWEEN should not be case sensitive. Followup: #989 Author: Michael Armbrust Closes #994 from marmbrus/caseSensitiveFunctions2 and squashes the following commits: 9d9c

git commit: [SPARK-1495][SQL]add support for left semi join

2014-06-09 Thread marmbrus
and squashes the following commits: d39cd12 [Daoyuan Wang] Merge pull request #1 from marmbrus/pr/837 6713c09 [Michael Armbrust] Better debugging for failed query tests. 035b73e [Michael Armbrust] Add test for left semi that can't be done with a hash join. 5ec6fa4 [Michael Armbrust] Add left sem

git commit: [SPARK-1495][SQL]add support for left semi join

2014-06-09 Thread marmbrus
and squashes the following commits: d39cd12 [Daoyuan Wang] Merge pull request #1 from marmbrus/pr/837 6713c09 [Michael Armbrust] Better debugging for failed query tests. 035b73e [Michael Armbrust] Add test for left semi that can't be done with a hash join. 5ec6fa4 [Michael Armbrust] Add left sem

git commit: [SPARK-1704][SQL] Fully support EXPLAIN commands as SchemaRDD.

2014-06-09 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master c6e041d17 -> a9ec033c8 [SPARK-1704][SQL] Fully support EXPLAIN commands as SchemaRDD. This PR attempts to resolve [SPARK-1704](https://issues.apache.org/jira/browse/SPARK-1704) by introducing a physical plan for EXPLAIN commands, which ju

git commit: [SPARK-1704][SQL] Fully support EXPLAIN commands as SchemaRDD.

2014-06-09 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 65fa7bcac -> 5a79ba13e [SPARK-1704][SQL] Fully support EXPLAIN commands as SchemaRDD. This PR attempts to resolve [SPARK-1704](https://issues.apache.org/jira/browse/SPARK-1704) by introducing a physical plan for EXPLAIN commands, whic

git commit: [SPARK-1508][SQL] Add SQLConf to SQLContext.

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master f971d6cb6 -> 08ed9ad81 [SPARK-1508][SQL] Add SQLConf to SQLContext. This PR (1) introduces a new class SQLConf that stores key-value properties for a SQLContext (2) clean up the semantics of various forms of SET commands. The SQLConf clas

git commit: [SPARK-1508][SQL] Add SQLConf to SQLContext.

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 5a79ba13e -> d5da81cdd [SPARK-1508][SQL] Add SQLConf to SQLContext. This PR (1) introduces a new class SQLConf that stores key-value properties for a SQLContext (2) clean up the semantics of various forms of SET commands. The SQLConf

git commit: Moved hiveOperators.scala to the right package folder

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 08ed9ad81 -> a9a461c59 Moved hiveOperators.scala to the right package folder The package is `org.apache.spark.sql.hive.execution`, while the file was placed under `sql/hive/src/main/scala/org/apache/spark/sql/hive/`. Author: Cheng Lian

git commit: Moved hiveOperators.scala to the right package folder

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 d5da81cdd -> 89caa40e3 Moved hiveOperators.scala to the right package folder The package is `org.apache.spark.sql.hive.execution`, while the file was placed under `sql/hive/src/main/scala/org/apache/spark/sql/hive/`. Author: Cheng Lia

git commit: [SPARK-2076][SQL] Pushdown the join filter & predication for outer join

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 884ca718b -> db0c038a6 [SPARK-2076][SQL] Pushdown the join filter & predication for outer join As the rule described in https://cwiki.apache.org/confluence/display/Hive/OuterJoinBehavior, we can optimize the SQL Join by pushing down the J

git commit: [SPARK-2076][SQL] Pushdown the join filter & predication for outer join

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 5bc186dc7 -> f397ffbdf [SPARK-2076][SQL] Pushdown the join filter & predication for outer join As the rule described in https://cwiki.apache.org/confluence/display/Hive/OuterJoinBehavior, we can optimize the SQL Join by pushing down t

git commit: [SQL] Add average overflow test case from #978

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 55a0e87ee -> 1abbde0e8 [SQL] Add average overflow test case from #978 By @egraldlo. Author: egraldlo Author: Michael Armbrust Closes #1033 from marmbrus/pr/978 and squashes the following commits: e228c5e [Michael Armbrust] Rem

git commit: [SQL] Add average overflow test case from #978

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 f397ffbdf -> 86c4a79dc [SQL] Add average overflow test case from #978 By @egraldlo. Author: egraldlo Author: Michael Armbrust Closes #1033 from marmbrus/pr/978 and squashes the following commits: e228c5e [Michael Armbrust] Rem

git commit: HOTFIX: clear() configs in SQLConf-related unit tests.

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master a2052a44f -> 601032f5b HOTFIX: clear() configs in SQLConf-related unit tests. Thanks goes to @liancheng, who pointed out that `sql/test-only *.SQLConfSuite *.SQLQuerySuite` passed but `sql/test-only *.SQLQuerySuite *.SQLConfSuite` failed.

git commit: HOTFIX: clear() configs in SQLConf-related unit tests.

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 ac8c27bdf -> 2cdce7cf3 HOTFIX: clear() configs in SQLConf-related unit tests. Thanks goes to @liancheng, who pointed out that `sql/test-only *.SQLConfSuite *.SQLQuerySuite` passed but `sql/test-only *.SQLQuerySuite *.SQLConfSuite` fai

git commit: [SPARK-2093] [SQL] NullPropagation should use exact type value.

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 601032f5b -> 0402bd77e [SPARK-2093] [SQL] NullPropagation should use exact type value. `NullPropagation` should use exact type value when transform `Count` or `Sum`. Author: Takuya UESHIN Closes #1034 from ueshin/issues/SPARK-2093 and sq

git commit: [SPARK-2093] [SQL] NullPropagation should use exact type value.

2014-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 2cdce7cf3 -> 6d15e9f7c [SPARK-2093] [SQL] NullPropagation should use exact type value. `NullPropagation` should use exact type value when transform `Count` or `Sum`. Author: Takuya UESHIN Closes #1034 from ueshin/issues/SPARK-2093 an

git commit: [SPARK-1968][SQL] SQL/HiveQL command for caching/uncaching tables

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 0402bd77e -> 0266a0c8a [SPARK-1968][SQL] SQL/HiveQL command for caching/uncaching tables JIRA issue: [SPARK-1968](https://issues.apache.org/jira/browse/SPARK-1968) This PR added support for SQL/HiveQL command for caching/uncaching tables:

git commit: [SPARK-1968][SQL] SQL/HiveQL command for caching/uncaching tables

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 6d15e9f7c -> 65ed7793d [SPARK-1968][SQL] SQL/HiveQL command for caching/uncaching tables JIRA issue: [SPARK-1968](https://issues.apache.org/jira/browse/SPARK-1968) This PR added support for SQL/HiveQL command for caching/uncaching tabl

git commit: SPARK-2107: FilterPushdownSuite doesn't need Junit jar.

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 0f1dc3a73 -> 6e1193031 SPARK-2107: FilterPushdownSuite doesn't need Junit jar. Author: Qiuzhuang.Lian Closes #1046 from Qiuzhuang/master and squashes the following commits: 0a9921a [Qiuzhuang.Lian] SPARK-2107: FilterPushdownSuite doesn't

git commit: SPARK-2107: FilterPushdownSuite doesn't need Junit jar.

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 65ed7793d -> 54ff00547 SPARK-2107: FilterPushdownSuite doesn't need Junit jar. Author: Qiuzhuang.Lian Closes #1046 from Qiuzhuang/master and squashes the following commits: 0a9921a [Qiuzhuang.Lian] SPARK-2107: FilterPushdownSuite doe

git commit: [SPARK-2042] Prevent unnecessary shuffle triggered by take()

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 4d5c12aa1 -> 4107cce58 [SPARK-2042] Prevent unnecessary shuffle triggered by take() This PR implements `take()` on a `SchemaRDD` by inserting a logical limit that is followed by a `collect()`. This is also accompanied by adding a catalyst

git commit: [SPARK-2042] Prevent unnecessary shuffle triggered by take()

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 684a93a72 -> cc004488d [SPARK-2042] Prevent unnecessary shuffle triggered by take() This PR implements `take()` on a `SchemaRDD` by inserting a logical limit that is followed by a `collect()`. This is also accompanied by adding a catal

git commit: [SQL] Code Cleanup: Left Semi Hash Join

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 4107cce58 -> ce6deb1e5 [SQL] Code Cleanup: Left Semi Hash Join Some improvement for PR #837, add another case to white list and use `filter` to build result iterator. Author: Daoyuan Closes #1049 from adrian-wang/clean-LeftSemiJoinHash

git commit: [SQL] Code Cleanup: Left Semi Hash Join

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 cc004488d -> 597c7efdc [SQL] Code Cleanup: Left Semi Hash Join Some improvement for PR #837, add another case to white list and use `filter` to build result iterator. Author: Daoyuan Closes #1049 from adrian-wang/clean-LeftSemiJoinH

git commit: [SPARK-2052] [SQL] Add optimization for CaseConversionExpression's.

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master d45e0c6b9 -> 9a2448daf [SPARK-2052] [SQL] Add optimization for CaseConversionExpression's. Add optimization for `CaseConversionExpression`'s. Author: Takuya UESHIN Closes #990 from ueshin/issues/SPARK-2052 and squashes the following comm

git commit: [SPARK-2052] [SQL] Add optimization for CaseConversionExpression's.

2014-06-11 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 81049eb54 -> e3955643d [SPARK-2052] [SQL] Add optimization for CaseConversionExpression's. Add optimization for `CaseConversionExpression`'s. Author: Takuya UESHIN Closes #990 from ueshin/issues/SPARK-2052 and squashes the following

git commit: [SPARK-2135][SQL] Use planner for in-memory scans

2014-06-12 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master f95ac686b -> 13f8cfdc0 [SPARK-2135][SQL] Use planner for in-memory scans Author: Michael Armbrust Closes #1072 from marmbrus/cachedStars and squashes the following commits: 8757c8e [Michael Armbrust] Use planner for in-memory sc

git commit: [SPARK-2135][SQL] Use planner for in-memory scans

2014-06-12 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 3962abaf9 -> 57526e40a [SPARK-2135][SQL] Use planner for in-memory scans Author: Michael Armbrust Closes #1072 from marmbrus/cachedStars and squashes the following commits: 8757c8e [Michael Armbrust] Use planner for in-memory sc

git commit: [SPARK-1964][SQL] Add timestamp to HiveMetastoreTypes.toMetastoreType

2014-06-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 70c8116c0 -> 1c2fd015b [SPARK-1964][SQL] Add timestamp to HiveMetastoreTypes.toMetastoreType Author: Michael Armbrust Closes #1061 from marmbrus/timestamp and squashes the following commits: 79c3903 [Michael Armbrust] Add timestamp

git commit: [SPARK-2094][SQL] "Exactly once" semantics for DDL and command statements

2014-06-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1c2fd015b -> ac96d9657 [SPARK-2094][SQL] "Exactly once" semantics for DDL and command statements ## Related JIRA issues - Main issue: - [SPARK-2094](https://issues.apache.org/jira/browse/SPARK-2094): Ensure exactly once semantics for D

git commit: [SPARK-1964][SQL] Add timestamp to HiveMetastoreTypes.toMetastoreType

2014-06-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 bc8d24ac1 -> eb2f5c50c [SPARK-1964][SQL] Add timestamp to HiveMetastoreTypes.toMetastoreType Author: Michael Armbrust Closes #1061 from marmbrus/timestamp and squashes the following commits: 79c3903 [Michael Armbrust] Add timest

git commit: [SQL] Support transforming TreeNodes with Option children.

2014-06-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 7dd9fc67a -> 269fc62b2 [SQL] Support transforming TreeNodes with Option children. Thanks goes to @marmbrus for his implementation. Author: Michael Armbrust Author: Zongheng Yang Closes #1074 from concretevitamin/option-treenode

git commit: [SQL] Support transforming TreeNodes with Option children.

2014-06-15 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 05d85c86e -> 868cf421e [SQL] Support transforming TreeNodes with Option children. Thanks goes to @marmbrus for his implementation. Author: Michael Armbrust Author: Zongheng Yang Closes #1074 from concretevitamin/option-treenode

git commit: [SQL][SPARK-2094] Follow up of PR #1071 for Java API

2014-06-16 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master cdf2b0457 -> 273afcb25 [SQL][SPARK-2094] Follow up of PR #1071 for Java API Updated `JavaSQLContext` and `JavaHiveContext` similar to what we've done to `SQLContext` and `HiveContext` in PR #1071. Added corresponding test case for Spark S

git commit: [SPARK-2164][SQL] Allow Hive UDF on columns of type struct

2014-06-17 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 09deb3eee -> f5a4049e5 [SPARK-2164][SQL] Allow Hive UDF on columns of type struct Author: Xi Liu Closes #796 from xiliu82/sqlbug and squashes the following commits: 328dfc4 [Xi Liu] [Spark SQL] remove a temporary function after test 3543

git commit: [SPARK-2164][SQL] Allow Hive UDF on columns of type struct

2014-06-17 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 3d4fa2dab -> 8e6b77fe0 [SPARK-2164][SQL] Allow Hive UDF on columns of type struct Author: Xi Liu Closes #796 from xiliu82/sqlbug and squashes the following commits: 328dfc4 [Xi Liu] [Spark SQL] remove a temporary function after test

git commit: [SPARK-2053][SQL] Add Catalyst expressions for CASE WHEN.

2014-06-17 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master f5a4049e5 -> e243c5ffa [SPARK-2053][SQL] Add Catalyst expressions for CASE WHEN. JIRA ticket: https://issues.apache.org/jira/browse/SPARK-2053 This PR adds support for two types of CASE statements present in Hive. The first type is of the

git commit: [SPARK-2053][SQL] Add Catalyst expressions for CASE WHEN.

2014-06-17 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 8e6b77fe0 -> 6ece39bd6 [SPARK-2053][SQL] Add Catalyst expressions for CASE WHEN. JIRA ticket: https://issues.apache.org/jira/browse/SPARK-2053 This PR adds support for two types of CASE statements present in Hive. The first type is of

git commit: [SQL] Break hiveOperators.scala into multiple files.

2014-06-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master ca5d8b590 -> ec935abce [SQL] Break hiveOperators.scala into multiple files. The single file was getting very long (500+ loc). Author: Reynold Xin Closes #1166 from rxin/hiveOperators and squashes the following commits: 5b43068 [Reynold

git commit: [SQL] Break hiveOperators.scala into multiple files.

2014-06-21 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 1829ec411 -> 4881fc62d [SQL] Break hiveOperators.scala into multiple files. The single file was getting very long (500+ loc). Author: Reynold Xin Closes #1166 from rxin/hiveOperators and squashes the following commits: 5b43068 [Reyn

git commit: [SPARK-1669][SQL] Made cacheTable idempotent

2014-06-23 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 853a2b951 -> a4bc442ca [SPARK-1669][SQL] Made cacheTable idempotent JIRA issue: [SPARK-1669](https://issues.apache.org/jira/browse/SPARK-1669) Caching the same table multiple times should end up with only 1 in-memory columnar representat

git commit: [SPARK-1669][SQL] Made cacheTable idempotent

2014-06-23 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 cf2fa4fe3 -> 6d821f0c0 [SPARK-1669][SQL] Made cacheTable idempotent JIRA issue: [SPARK-1669](https://issues.apache.org/jira/browse/SPARK-1669) Caching the same table multiple times should end up with only 1 in-memory columnar represe

git commit: [SQL]Add base row updating methods for JoinedRow

2014-06-24 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 8ca41769f -> 133495d82 [SQL]Add base row updating methods for JoinedRow This will be helpful in join operators. Author: Cheng Hao Closes #1187 from chenghao-intel/joinedRow and squashes the following commits: 87c19e3 [Cheng Hao] Add bas

git commit: [SQL]Add base row updating methods for JoinedRow

2014-06-24 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 05f84e28f -> c3ebf8ee6 [SQL]Add base row updating methods for JoinedRow This will be helpful in join operators. Author: Cheng Hao Closes #1187 from chenghao-intel/joinedRow and squashes the following commits: 87c19e3 [Cheng Hao] Add

git commit: [SQL] SPARK-1800 Add broadcast hash join operator & associated hints.

2014-06-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 1132e472e -> 9d824fed8 [SQL] SPARK-1800 Add broadcast hash join operator & associated hints. This PR is based off Michael's [PR 734](https://github.com/apache/spark/pull/734) and includes a bunch of cleanups. Moreover, this PR also - make

git commit: [SPARK-2283][SQL] Reset test environment before running PruningSuite

2014-06-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 9d824fed8 -> 7f196b009 [SPARK-2283][SQL] Reset test environment before running PruningSuite JIRA issue: [SPARK-2283](https://issues.apache.org/jira/browse/SPARK-2283) If `PruningSuite` is run right after `HiveCompatibilitySuite`, the first

git commit: [SPARK-2283][SQL] Reset test environment before running PruningSuite

2014-06-25 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 abb62f0b9 -> 5869f8bf1 [SPARK-2283][SQL] Reset test environment before running PruningSuite JIRA issue: [SPARK-2283](https://issues.apache.org/jira/browse/SPARK-2283) If `PruningSuite` is run right after `HiveCompatibilitySuite`, the f

git commit: [SPARK-2295] [SQL] Make JavaBeans nullability stricter.

2014-06-26 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-1.0 45bf91025 -> a7bebd14d [SPARK-2295] [SQL] Make JavaBeans nullability stricter. Author: Takuya UESHIN Closes #1235 from ueshin/issues/SPARK-2295 and squashes the following commits: 201c508 [Takuya UESHIN] Make JavaBeans nullability st

<    1   2   3   4   5   6   7   8   9   10   >