[GitHub] spark pull request #14553: [SPARK-16963] [STREAMING] [SQL] Changes to Source...

2016-09-19 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14553#discussion_r79492259 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamExecution.scala --- @@ -72,13 +74,17 @@ class StreamExecution

[GitHub] spark pull request #14803: [SPARK-17153][SQL] Should read partition data whe...

2016-09-19 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14803#discussion_r79488089 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala --- @@ -197,10 +197,13 @@ case class DataSource

[GitHub] spark pull request #14803: [SPARK-17153][SQL] Should read partition data whe...

2016-09-19 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14803#discussion_r79488934 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/FileStreamSourceSuite.scala --- @@ -608,6 +608,34 @@ class FileStreamSourceSuite extends

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79088749 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79089396 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79088295 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79088253 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/CachedKafkaConsumer.scala --- @@ -0,0 +1,186 @@ +/* + * Licensed

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79089110 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79089641 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79089541 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79088325 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79088914 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* --- End diff

[GitHub] spark issue #15102: [SPARK-17346][SQL] Add Kafka source for Structured Strea...

2016-09-15 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/15102 > This already does depend on most of the existing Kafka DStream implementation I pushed for this code to be copied rather than refactored because I think this is the right direct

[GitHub] spark pull request #15054: [SPARK-17502] [SQL] Fix Multiple Bugs in DDL Stat...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15054#discussion_r79056087 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala --- @@ -360,6 +360,7 @@ trait CheckAnalysis extends

[GitHub] spark issue #15023: Backport [SPARK-5847] Allow for configuring MetricsSyste...

2016-09-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/15023 Thanks for understanding! I do hope you guys upgrade eventually, there's a lot of good stuff and 2.0.1 should be out in the near future. Please do report any issues you see :) --- If your

[GitHub] spark issue #15023: Backport [SPARK-5847] Allow for configuring MetricsSyste...

2016-09-12 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/15023 Thanks for spending the time to backport this, but it does seem a little risky to include changes to the configuration system in a maintenance release. As such, I'd probably error on the side

[GitHub] spark pull request #14728: [SPARK-17165][SQL] FileStreamSource should not tr...

2016-08-22 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14728#discussion_r75786696 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FileStreamSource.scala --- @@ -17,21 +17,18 @@ package

[GitHub] spark issue #14124: [SPARK-16472][SQL] Inconsistent nullability in schema af...

2016-07-26 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14124 @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #14356: [SPARK-16724] Expose DefinedByConstructorParams

2016-07-25 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14356 /cc @rxin --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #14356: [SPARK-16724] Expose DefinedByConstructorParams

2016-07-25 Thread marmbrus
GitHub user marmbrus opened a pull request: https://github.com/apache/spark/pull/14356 [SPARK-16724] Expose DefinedByConstructorParams We don't generally make things in catalyst/execution private. Instead they are just undocumented due to their lack of stability guarantees. You

[GitHub] spark issue #14252: [SPARK-16615][SQL] Expose sqlContext in SparkSession

2016-07-18 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14252 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #14087: [SPARK-16411][SQL][STREAMING] Add textFile to Structured...

2016-07-15 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14087 /cc @tdas --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #14087: [SPARK-16411][SQL][STREAMING] Add textFile to Str...

2016-07-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14087#discussion_r71025786 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/FileStreamSourceSuite.scala --- @@ -331,6 +331,24 @@ class FileStreamSourceSuite extends

[GitHub] spark issue #14214: [SPARK-16545][SQL] Eliminate unnecessary rounds of physi...

2016-07-15 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14214 Thanks for working on this, but I'm tempted to close this as "won't fix". Its likely we are going to have to rewrite the incremental planner completely for 2.1 and this is ju

[GitHub] spark issue #14170: [SPARK-16531][SQL][TEST] Remove timezone setting from Da...

2016-07-14 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 Oh, I see what you are saying, although I'm not sure I agree with the conclusion. Given that tests can run in parallel I don't think you actually want to toggle back and forth between timezones

[GitHub] spark issue #14170: [SPARK-16531][SQL][TEST] Remove timezone setting from Da...

2016-07-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 I think thats where we are today. All query tests use LA and the harness configures that. The problem before this PR was this one suite was setting LA (due to its base class), and then UTC (due

[GitHub] spark issue #14170: [SPARK-16531][SQL][TEST] Remove timezone setting from Da...

2016-07-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 All the tests in SQL are written to assume `Los_Angeles`, so I think this is actually desired. Otherwise people have to configure their machine specially to run spark tests. --- If your project

spark git commit: [SPARK-16531][SQL][TEST] Remove timezone setting from DataFrameTimeWindowingSuite

2016-07-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-2.0 2e97f3a08 -> 7de183d97 [SPARK-16531][SQL][TEST] Remove timezone setting from DataFrameTimeWindowingSuite ## What changes were proposed in this pull request? It's unnecessary. `QueryTest` already sets it. Author: Burak Yavuz

spark git commit: [SPARK-16531][SQL][TEST] Remove timezone setting from DataFrameTimeWindowingSuite

2016-07-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 01f09b161 -> 0744d84c9 [SPARK-16531][SQL][TEST] Remove timezone setting from DataFrameTimeWindowingSuite ## What changes were proposed in this pull request? It's unnecessary. `QueryTest` already sets it. Author: Burak Yavuz

[GitHub] spark issue #14170: [SPARK-16531][SQL][TEST] Remove timezone setting from Da...

2016-07-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 Thanks, merging to master and 2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #14170: [MINOR][SQL][TEST] Remove timezone setting from DataFram...

2016-07-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 We should put this in 2.0 for whoever merges. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #14170: [MINOR][SQL][TEST] Remove timezone setting from DataFram...

2016-07-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 LGTM, can you make a JIRA? Its a little scary to change tests w/o one in case there is flakiness. --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark issue #14139: [SPARK-16313][SQL][BRANCH-1.6] Spark should not silently...

2016-07-12 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14139 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #13890: [SPARK-16189][SQL] Add ExternalRDD logical plan f...

2016-07-08 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13890#discussion_r70139445 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/ExistingRDD.scala --- @@ -74,13 +74,71 @@ object RDDConversions

[GitHub] spark issue #14094: [SPARK-16430][SQL][STREAMING] Add option maxFilesPerTrig...

2016-07-07 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14094 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #14094: [SPARK-16430][SQL][STREAMING] Add option maxFiles...

2016-07-07 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14094#discussion_r69986165 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FileStreamSource.scala --- @@ -45,6 +47,7 @@ class FileStreamSource

[GitHub] spark pull request #14094: [SPARK-16430][SQL][STREAMING] Add option maxFiles...

2016-07-07 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14094#discussion_r69985831 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FileStreamSource.scala --- @@ -26,6 +27,7 @@ import

[GitHub] spark pull request #14030: [SPARK-16350][SQL] Fix support for incremental pl...

2016-07-06 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14030#discussion_r69819131 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/ForeachSink.scala --- @@ -30,7 +32,42 @@ import org.apache.spark.sql.{DataFrame

[GitHub] spark pull request #14030: [SPARK-16350][SQL] Fix support for incremental pl...

2016-07-06 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14030#discussion_r69819064 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -155,7 +155,7 @@ private[sql] object Dataset { class Dataset[T] private[sql

[GitHub] spark issue #13873: [SPARK-16167][SQL] RowEncoder should preserve array/map ...

2016-07-05 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13873 /cc @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #13890: [SPARK-16189][SQL] Add ExistingRDD logical plan for inpu...

2016-07-05 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13890 /cc @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #14002: [SPARK-16335][SQL] Structured streaming should fail if s...

2016-06-30 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14002 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #14000: [SPARK-16331] [SQL] Reduce code generation time

2016-06-30 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14000 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #13901: [SPARK-16199][SQL] Add a method to list the referenced c...

2016-06-28 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13901 No tests? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #13939: [SPARK-16248][SQL] Whitelist the list of Hive fallback f...

2016-06-28 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13939 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #13939: [SPARK-16248][SQL] Whitelist the list of Hive fal...

2016-06-28 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13939#discussion_r68841421 --- Diff: sql/hive/compatibility/src/test/scala/org/apache/spark/sql/hive/execution/HiveWindowFunctionQuerySuite.scala --- @@ -569,6 +572,7 @@ class

[GitHub] spark pull request #13939: [SPARK-16248][SQL] Whitelist the list of Hive fal...

2016-06-28 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13939#discussion_r68840940 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveSessionCatalog.scala --- @@ -196,6 +185,10 @@ private[sql] class HiveSessionCatalog

[GitHub] spark issue #13862: [SPARK-16159][SQL] Move RDD creation logic from FileSour...

2016-06-22 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13862 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #13862: [SPARK-16159][SQL] Break partitioning logic from ...

2016-06-22 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13862#discussion_r68148358 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileScanRDD.scala --- @@ -43,13 +48,16 @@ case class PartitionedFile

[GitHub] spark pull request #13718: [SPARK-16002][SQL]Sleep when no new data arrives ...

2016-06-20 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13718#discussion_r67740050 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamSuite.scala --- @@ -211,6 +217,7 @@ class StreamSuite extends StreamTest

[GitHub] spark pull request #13718: [SPARK-16002][SQL]Sleep when no new data arrives ...

2016-06-20 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13718#discussion_r67736904 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamSuite.scala --- @@ -211,6 +217,7 @@ class StreamSuite extends StreamTest

[GitHub] spark pull request #13718: [SPARK-16002][SQL]Sleep when no new data arrives ...

2016-06-20 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13718#discussion_r67734059 --- Diff: core/src/main/scala/org/apache/spark/util/ManualClock.scala --- @@ -57,9 +59,19 @@ private[spark] class ManualClock(private var time: Long

spark git commit: [SPARK-16050][TESTS] Remove the flaky test: ConsoleSinkSuite

2016-06-20 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 905f774b7 -> 5cfabec87 [SPARK-16050][TESTS] Remove the flaky test: ConsoleSinkSuite ## What changes were proposed in this pull request? ConsoleSinkSuite just collects content from stdout and compare them with the expected string.

spark git commit: [SPARK-16050][TESTS] Remove the flaky test: ConsoleSinkSuite

2016-06-20 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-2.0 0b0b5fe54 -> 363db9f8b [SPARK-16050][TESTS] Remove the flaky test: ConsoleSinkSuite ## What changes were proposed in this pull request? ConsoleSinkSuite just collects content from stdout and compare them with the expected string.

[GitHub] spark issue #13776: [SPARK-16050][Tests]Remove the flaky test: ConsoleSinkSu...

2016-06-20 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13776 LGTM, merging to master and 2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67578285 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark issue #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harmonize t...

2016-06-17 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13727 A few comments. Overall LGTM. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67575918 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67575723 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala --- @@ -276,7 +267,45 @@ class DataFrameReader private[sql](sparkSession

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67575684 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala --- @@ -276,7 +267,45 @@ class DataFrameReader private[sql](sparkSession

[GitHub] spark issue #13740: [SPARK-16020][SQL]Fix complete mode aggregation with con...

2016-06-17 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13740 There are examples in `quietly`. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #13740: [SPARK-16020][SQL]Fix complete mode aggregation with con...

2016-06-17 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13740 LGTM and we should merge before the RC. How hard to add a test? You could redirect stdout temporarily? --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67573034 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67572740 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67572585 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67572462 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67572121 --- Diff: sql/core/src/test/java/test/org/apache/spark/sql/JavaDataFrameReaderWriterSuite.java --- @@ -0,0 +1,158 @@ +/* +* Licensed to the Apache

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67572068 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala --- @@ -368,6 +397,63 @@ class DataFrameReader private[sql](sparkSession

[GitHub] spark issue #13718: [SPARK-16002][SQL]Sleep when no new data arrives to avoi...

2016-06-16 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13718 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #13718: [SPARK-16002][SQL]Sleep when no new data arrives ...

2016-06-16 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13718#discussion_r67435503 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -545,6 +545,13 @@ object SQLConf { .booleanConf

[GitHub] spark issue #13673: [WIP][SPARK-15953][SQL][STREAMING] Renamed ContinuousQue...

2016-06-14 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13673 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reade...

2016-06-14 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13653#discussion_r67029686 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala --- @@ -0,0 +1,401 @@ +/* + * Licensed to the Apache

[GitHub] spark issue #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reader-write...

2016-06-14 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13653 Overall looks pretty good. Feel free to merge after addressing comments / passing tests to avoid more conflicts. --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark pull request #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reade...

2016-06-14 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13653#discussion_r67023372 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala --- @@ -0,0 +1,401 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reade...

2016-06-14 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13653#discussion_r67023216 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala --- @@ -0,0 +1,401 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reade...

2016-06-14 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13653#discussion_r67023075 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala --- @@ -0,0 +1,401 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reade...

2016-06-14 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13653#discussion_r67022595 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamReader.scala --- @@ -0,0 +1,288 @@ +/* + * Licensed to the Apache

[GitHub] spark issue #13638: [SPARK-15915][SQL] Logical plans should use canonicalize...

2016-06-14 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13638 Hmmm, does not apply cleanly to 1.6. @ueshin if you have time it might be nice to backport. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

spark git commit: [SPARK-15915][SQL] Logical plans should use canonicalized plan when override sameResult.

2016-06-14 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-2.0 d5e60748b -> 83aa17d44 [SPARK-15915][SQL] Logical plans should use canonicalized plan when override sameResult. ## What changes were proposed in this pull request? `DataFrame` with plan overriding `sameResult` but not using

spark git commit: [SPARK-15915][SQL] Logical plans should use canonicalized plan when override sameResult.

2016-06-14 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master bc02d0112 -> c5b735581 [SPARK-15915][SQL] Logical plans should use canonicalized plan when override sameResult. ## What changes were proposed in this pull request? `DataFrame` with plan overriding `sameResult` but not using canonicalized

[GitHub] spark issue #13638: [SPARK-15915][SQL] Logical plans should use canonicalize...

2016-06-14 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13638 Yeah, sounds reasonable. Merging to master, 2.0 and 1.6. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request #13638: [SPARK-15915][SQL] CacheManager should use canoni...

2016-06-13 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13638#discussion_r66884087 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/CacheManager.scala --- @@ -155,8 +156,9 @@ private[sql] class CacheManager extends Logging

[GitHub] spark pull request #13638: [SPARK-15915][SQL] CacheManager should use canoni...

2016-06-13 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13638#discussion_r66882083 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/CacheManager.scala --- @@ -155,8 +156,9 @@ private[sql] class CacheManager extends Logging

[GitHub] spark issue #13638: [SPARK-15915][SQL] CacheManager should use canonicalized...

2016-06-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13638 Seems reasonable. Is this a regression from 1.6? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request #13638: [SPARK-15915][SQL] CacheManager should use canoni...

2016-06-13 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13638#discussion_r66876008 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/CacheManager.scala --- @@ -155,8 +156,9 @@ private[sql] class CacheManager extends Logging

[GitHub] spark issue #8416: [SPARK-10185] [SQL] Feat sql comma separated paths

2016-06-11 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/8416 @rxin I believe I fixed that limitation in my recent refactoring. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #13424: [SPARK-15489][SQL] Dataset kryo encoder won't load custo...

2016-06-10 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13424 Thanks! Merged into master and 2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

spark git commit: [SPARK-15489][SQL] Dataset kryo encoder won't load custom user settings

2016-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master aec502d91 -> 127a6678d [SPARK-15489][SQL] Dataset kryo encoder won't load custom user settings ## What changes were proposed in this pull request? Serializer instantiation will consider existing SparkConf ## How was this patch tested?

spark git commit: [SPARK-15489][SQL] Dataset kryo encoder won't load custom user settings

2016-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-2.0 bc53422ad -> e6ebb547b [SPARK-15489][SQL] Dataset kryo encoder won't load custom user settings ## What changes were proposed in this pull request? Serializer instantiation will consider existing SparkConf ## How was this patch

[GitHub] spark issue #13147: [SPARK-6320][SQL] Move planLater method into GenericStra...

2016-06-10 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13147 Thanks, merged to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

spark git commit: [SPARK-6320][SQL] Move planLater method into GenericStrategy.

2016-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master fb219029d -> 667d4ea7b [SPARK-6320][SQL] Move planLater method into GenericStrategy. ## What changes were proposed in this pull request? This PR moves `QueryPlanner.planLater()` method into `GenericStrategy` for extra strategies to be

[GitHub] spark issue #13424: [SPARK-15489][SQL] Dataset kryo encoder won't load custo...

2016-06-10 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13424 LGTM, can you update the description (it still says WIP). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark issue #13486: [SPARK-15743][SQL] Prevent saving with all-column partit...

2016-06-10 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13486 Merging to master and 2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #13597: [SPARK-15871][SQL] Add `assertNotPartitioned` check in `...

2016-06-10 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13597 Seems fine to me. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #13604: [SPARK-15856][SQL] Revert API breaking changes made in D...

2016-06-10 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13604 I'm not sure I agree with all of the reasoning here. Here are my thoughts: - `SQLContext` should probably not break any APIs (its only there for compatibility anyway). - In `SparkSession

[GitHub] spark issue #13549: [SPARK-15812][SQ][Streaming] Added support for sorting a...

2016-06-08 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13549 This is okay for 2.0, but we'll need to rethink the way we are doing query planning to handle incremental input. --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark pull request #13549: [SPARK-15812][SQ][Streaming] Added support for so...

2016-06-08 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13549#discussion_r66361438 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationsSuite.scala --- @@ -189,9 +189,20 @@ class

[GitHub] spark pull request #13549: [SPARK-15812][SQ][Streaming] Added support for so...

2016-06-08 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13549#discussion_r66361373 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamingAggregationSuite.scala --- @@ -104,6 +104,31 @@ class StreamingAggregationSuite

[GitHub] spark pull request #13549: [SPARK-15812][SQ][Streaming] Added support for so...

2016-06-08 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13549#discussion_r66361125 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationChecker.scala --- @@ -43,6 +43,41 @@ object

[GitHub] spark issue #13424: [SPARK-15489][SQL] Dataset kryo encoder won't load custo...

2016-06-08 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13424 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

<    2   3   4   5   6   7   8   9   10   11   >