[GitHub] spark issue #16902: [SPARK-19564][SPARK-19559][SS][KAFKA] KafkaOffsetReader'...

2017-02-12 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16902 @zsxwing would you take a look --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #16686: [SPARK-18682][SS] Batch Source for Kafka

2017-02-12 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16686 Hi @kayousterhout, a fix to `KafkaSourceSuite.subscribing topic by pattern with topic deletions` is here: https://github.com/apache/spark/pull/16902 --- If your project is set up for it, you can

[GitHub] spark issue #16902: [SPARK-19564][SS][KAFKA] KafkaOffsetReader's consumers s...

2017-02-12 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16902 ok all 100 times of `KafkaSourceSuite` passed. now revert back to one normal pass of testing --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark pull request #16902: [SPARK-19564][SS][KAFKA] KafkaOffsetReader's cons...

2017-02-12 Thread lw-lin
GitHub user lw-lin opened a pull request: https://github.com/apache/spark/pull/16902 [SPARK-19564][SS][KAFKA] KafkaOffsetReader's consumers should not be in the same group ## What changes were proposed in this pull request? In `KafkaOffsetReader`, when error occurs, we

[GitHub] spark issue #16902: [SPARK-19564][SS][KAFKA] KafkaOffsetReader's consumers s...

2017-02-12 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16902 Let's start by running only KafkaSourceSuite for 100 times --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request #16900: [SPARK-19564][SS][KAFKA] KafkaOffsetReader's cons...

2017-02-12 Thread lw-lin
Github user lw-lin closed the pull request at: https://github.com/apache/spark/pull/16900 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark pull request #16899: [DO_NOT_MERGE][Test] Investigate KafkaSourceSuite...

2017-02-12 Thread lw-lin
Github user lw-lin closed the pull request at: https://github.com/apache/spark/pull/16899 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark issue #16900: [SPARK-19564][SS][KAFKA] KafkaOffsetReader's consumers s...

2017-02-12 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16900 ok, all 100 times of `KafkaSourceSuite` passed. let's now do a full pass of testing. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark issue #16900: [SPARK-19564][SS][KAFKA] KafkaOffsetReader's consumers s...

2017-02-12 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16900 Let's start by running only `KafkaSourceSuite` for 1000 times --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request #16900: [SPARK-19564][SS][KAFKA] KafkaOffsetReader's cons...

2017-02-12 Thread lw-lin
GitHub user lw-lin opened a pull request: https://github.com/apache/spark/pull/16900 [SPARK-19564][SS][KAFKA] KafkaOffsetReader's consumers should not be in the same group ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix

[GitHub] spark pull request #16899: [DO_NOT_MERGE][Test] Investigate KafkaSourceSuite...

2017-02-12 Thread lw-lin
GitHub user lw-lin opened a pull request: https://github.com/apache/spark/pull/16899 [DO_NOT_MERGE][Test] Investigate KafkaSourceSuite flakiness ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) ## How was this patch

[GitHub] spark issue #16736: [SPARK-19265][SQL][Follow-up] Configurable `tableRelatio...

2017-02-08 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16736 @gatorsmile @cloud-fan thank you for the time and efforts you've put in reviewing this! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request #16736: [SPARK-19265][SQL][Follow-up] Configurable `table...

2017-02-08 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16736#discussion_r100089378 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/internal/SQLConfEntrySuite.scala --- @@ -164,6 +164,18 @@ class SQLConfEntrySuite extends

[GitHub] spark pull request #16736: [SPARK-19265][SQL][Follow-up] Configurable `table...

2017-02-08 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16736#discussion_r100089218 --- Diff: core/src/test/scala/org/apache/spark/internal/config/ConfigEntrySuite.scala --- @@ -128,6 +128,25 @@ class ConfigEntrySuite extends SparkFunSuite

[GitHub] spark pull request #16736: [SPARK-19265][SQL][Follow-up] Configurable `table...

2017-02-07 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16736#discussion_r13797 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/internal/SQLConfEntrySuite.scala --- @@ -164,6 +164,18 @@ class SQLConfEntrySuite extends

[GitHub] spark pull request #16736: [SPARK-19265][SQL][Follow-up] Configurable `table...

2017-02-07 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16736#discussion_r13250 --- Diff: core/src/test/scala/org/apache/spark/internal/config/ConfigEntrySuite.scala --- @@ -128,6 +128,25 @@ class ConfigEntrySuite extends SparkFunSuite

[GitHub] spark issue #16736: [SPARK-19265][SQL][Follow-up] Configurable `tableRelatio...

2017-02-07 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16736 Jenkins retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #16736: [SPARK-19265][SQL][Follow-up] Configurable `tableRelatio...

2017-02-07 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16736 Jenkins retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #16736: [SPARK-19265][SQL][Follow-up] Configurable `tableRelatio...

2017-02-07 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16736 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #16833: [SPARK-19491][SQL]add a config for table relation cache ...

2017-02-07 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16833 Ah this has been done in https://github.com/apache/spark/pull/16736, with additional sanity-checks. that pr had received 'lgtm' from @smilegator. Do you want to take a look at that? --- If your

[GitHub] spark pull request #16758: [SPARK-19413][SS] MapGroupsWithState for arbitrar...

2017-02-01 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16758#discussion_r99063259 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/KeyedState.scala --- @@ -0,0 +1,134 @@ +/* + * Licensed to the Apache Software Foundation (ASF

[GitHub] spark pull request #16758: [SPARK-19413][SS] MapGroupsWithState for arbitrar...

2017-02-01 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16758#discussion_r99062793 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/MapGroupsWithStateSuite.scala --- @@ -0,0 +1,240 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request #16758: [SPARK-19413][SS] MapGroupsWithState for arbitrar...

2017-02-01 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16758#discussion_r99062438 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/MapGroupsWithStateSuite.scala --- @@ -0,0 +1,240 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request #16758: [SPARK-19413][SS] MapGroupsWithState for arbitrar...

2017-02-01 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16758#discussion_r99059679 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/KeyedStateImpl.scala --- @@ -0,0 +1,57 @@ +/* + * Licensed to the Apache

[GitHub] spark issue #16736: [SPARK-19265][SQL][Follow-up] Configurable `tableRelatio...

2017-02-01 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16736 PR summary & description updated. thanks for comments @gatorsmile ! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your pro

[GitHub] spark issue #16731: [SPARK-19393][SQL] Add `approx_percentile` Dataset/DataF...

2017-01-30 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16731 understood. they really were lots of functions -- let's tell users to use `expr("approx_percentile(...)")`. closing this. thanks for the comments! --- If your project is set up f

[GitHub] spark pull request #16731: [SPARK-19393][SQL] Add `approx_percentile` Datase...

2017-01-30 Thread lw-lin
Github user lw-lin closed the pull request at: https://github.com/apache/spark/pull/16731 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark pull request #16736: [WIP][SPARK][SQL][Follow-up] Configurable `tableR...

2017-01-30 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16736#discussion_r98592980 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/internal/SQLConfSuite.scala --- @@ -221,17 +221,8 @@ class SQLConfSuite extends QueryTest

[GitHub] spark pull request #16736: [WIP][SPARK][SQL][Follow-up] Configurable `tableR...

2017-01-30 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16736#discussion_r98589474 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -464,6 +464,16 @@ object SQLConf { .intConf

[GitHub] spark pull request #16736: [WIP][SPARK][SQL][Follow-up] Configurable `tableR...

2017-01-30 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16736#discussion_r98589278 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/internal/SQLConfSuite.scala --- @@ -221,6 +221,19 @@ class SQLConfSuite extends QueryTest

[GitHub] spark issue #16736: [WIP][SPARK][SQL][Follow-up] Configurable `tableRelation...

2017-01-30 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16736 tests passed for the default `maxSize = 1000` now let's test against `maxSize = 1` --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark issue #16736: [WIP][SPARK][SQL][Follow-up] Configurable `tableRelation...

2017-01-30 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16736 @gatorsmile fairly good point and thanks for the clarification! - [Guava checks that maxSize >= 0 must holds](https://github.com/google/guava/blob/v14.0.1/guava/src/com/google/common/ca

[GitHub] spark pull request #16736: [WIP][SPARK][SQL][Follow-up] Configurable `tableR...

2017-01-29 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16736#discussion_r98375909 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -464,6 +464,12 @@ object SQLConf { .intConf

[GitHub] spark pull request #16736: [WIP][SPARK][SQL][Follow-up] Configurable `tableR...

2017-01-29 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16736#discussion_r98375736 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/CatalystConf.scala --- @@ -69,6 +71,7 @@ case class SimpleCatalystConf

[GitHub] spark pull request #16736: [WIP][SPARK][SQL][Follow-up] Configurable `tableR...

2017-01-29 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16736#discussion_r98374466 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/CatalystConf.scala --- @@ -69,6 +71,7 @@ case class SimpleCatalystConf

[GitHub] spark pull request #16736: [WIP][SPARK][SQL][Follow-up] Configurable `tableR...

2017-01-29 Thread lw-lin
GitHub user lw-lin opened a pull request: https://github.com/apache/spark/pull/16736 [WIP][SPARK][SQL][Follow-up] Configurable `tableRelationCache` maximum size ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) ## How

[GitHub] spark issue #16719: [SPARK-19385][SQL] During canonicalization, `NOT(...(l, ...

2017-01-29 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16719 Jenkins retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #16731: [SPARK-19393][SQL] Add `approx_percentile` Dataset/DataF...

2017-01-29 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16731 also @cloud-fan @gatorsmile could you take a look --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request #16719: [SPARK-19385][SQL] During canonicalization, `NOT(...

2017-01-29 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16719#discussion_r98347229 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Canonicalize.scala --- @@ -78,14 +78,18 @@ object Canonicalize extends

[GitHub] spark issue #16731: [SPARK-19393][SQL] Add `approx_percentile` Dataset/DataF...

2017-01-28 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16731 @rxin @hvanhovell could you take a look, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request #16719: [SPARK-19385][SQL] During canonicalization, `NOT(...

2017-01-28 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16719#discussion_r98346999 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/ExpressionSetSuite.scala --- @@ -32,6 +32,38 @@ class ExpressionSetSuite

[GitHub] spark pull request #16719: [SPARK-19385][SQL] During canonicalization, `NOT(...

2017-01-28 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16719#discussion_r98346981 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/ExpressionSetSuite.scala --- @@ -32,6 +32,38 @@ class ExpressionSetSuite

[GitHub] spark pull request #16719: [SPARK-19385][SQL] During canonicalization, `NOT(...

2017-01-28 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16719#discussion_r98346453 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/ExpressionSetSuite.scala --- @@ -32,6 +32,38 @@ class ExpressionSetSuite

[GitHub] spark pull request #16731: [WIP][SPARK-][SQL] Add `approx_percentile` Datafr...

2017-01-28 Thread lw-lin
GitHub user lw-lin opened a pull request: https://github.com/apache/spark/pull/16731 [WIP][SPARK-][SQL] Add `approx_percentile` Dataframe API ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) ## How was this patch

[GitHub] spark issue #16725: [SPARK-19377] [WEBUI] [CORE] Killed tasks should have th...

2017-01-28 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16725 Jenkins test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request #16719: [SPARK-19385][SQL] During canonicalization, `NOT(...

2017-01-27 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16719#discussion_r98322977 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/Canonicalize.scala --- @@ -78,14 +78,18 @@ object Canonicalize extends

[GitHub] spark pull request #16719: [SPARK-19385][SQL] During canonicalization, `NOT(...

2017-01-27 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16719#discussion_r98322886 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/ExpressionSetSuite.scala --- @@ -75,10 +107,14 @@ class ExpressionSetSuite

[GitHub] spark issue #16719: [SPARK-19385][SQL] During canonicalization, `NOT(l, r)` ...

2017-01-27 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16719 @cloud-fan @gatorsmile @dongjoon-hyun would you take a look, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request #16719: [SPARK-19385][SQL] During canonicalization, `NOT(...

2017-01-27 Thread lw-lin
GitHub user lw-lin opened a pull request: https://github.com/apache/spark/pull/16719 [SPARK-19385][SQL] During canonicalization, `NOT(l, r)` should not expect such cases that l.hashcode > r.hashcode ## What changes were proposed in this pull request? Dur

[GitHub] spark issue #16673: [SPARK-19330][DStreams] Also show tooltip for successful...

2017-01-22 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16673 @zsxwing would you take a look, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16673: [SPARK-19330][DStreams] Also show tooltip for successful...

2017-01-22 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16673 @uncleGen yea I mean the tooltip won't show for a blue circle (which represents a successful batch) when the cursor hovers over the blue circle in the 'processing time' line. Which version of spark

[GitHub] spark pull request #16673: [SPARK-19330][DStreams] Also show tooltip for suc...

2017-01-22 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16673#discussion_r97225131 --- Diff: streaming/src/main/resources/org/apache/spark/streaming/ui/static/streaming-page.js --- @@ -169,7 +169,7 @@ function drawTimeline(id, data, minX

[GitHub] spark pull request #16673: [SPARK-19330][DStreams] Also show tooltip for suc...

2017-01-22 Thread lw-lin
GitHub user lw-lin opened a pull request: https://github.com/apache/spark/pull/16673 [SPARK-19330][DStreams] Also show tooltip for successful batches ## What changes were proposed in this pull request? [Before] ![_streaming_before](https://cloud.githubusercontent.com

[GitHub] spark issue #16628: Update known_translations for contributor names

2017-01-17 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16628 sorry, I made a mistake. It should really be: `Yuhai Cen`, rather than `Yuhai Ceng`. could you fix it? --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark issue #16628: Update known_translations for contributor names

2017-01-17 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16628 this lgtm, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #16547: [SPARK-19168][Structured Streaming] StateStore sh...

2017-01-17 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16547#discussion_r96557737 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamingAggregationSuite.scala --- @@ -335,4 +344,67 @@ class StreamingAggregationSuite

[GitHub] spark pull request #16547: [SPARK-19168][Structured Streaming] StateStore sh...

2017-01-17 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16547#discussion_r96445159 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamingAggregationSuite.scala --- @@ -335,4 +344,67 @@ class StreamingAggregationSuite

[GitHub] spark issue #16578: [SPARK-4502][SQL] Parquet nested column pruning

2017-01-14 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16578 Maybe we also want to get feedback from @liancheng ? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #16547: [SPARK-19168][Structured Streaming] Improvement: filter ...

2017-01-12 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16547 Thanks for the feedback! Ah, sure, let me update accordingly. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark issue #16547: [SPARK-19168][Structured Streaming] Improvement: filter ...

2017-01-12 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16547 @zsxwing updated as per your comments; would you take another look? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #16547: [SPARK-19168][Structured Streaming] Improvement: filter ...

2017-01-12 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16547 Jenkins retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #16547: [SPARK-19168][Structured Streaming] Improvement: filter ...

2017-01-11 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16547 @zsxwing thanks for the comments. I'll update this. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #16547: [SPARK-19168][Structured Streaming] Improvement: filter ...

2017-01-11 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16547 @tdas @zsxwing would you take a look, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request #16547: [SPARK-19168][Structured Streaming] Improvement: ...

2017-01-11 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16547#discussion_r95571209 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/EventTimeWatermarkSuite.scala --- @@ -132,13 +132,13 @@ class EventTimeWatermarkSuite

[GitHub] spark pull request #16547: [SPARK-19168][Structured Streaming] Improvement: ...

2017-01-11 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16547#discussion_r95571124 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/EventTimeWatermarkSuite.scala --- @@ -132,13 +132,13 @@ class EventTimeWatermarkSuite

[GitHub] spark pull request #16547: [SPARK-19168][Structured Streaming] Improvement: ...

2017-01-11 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16547#discussion_r95527829 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/UnsupportedOperationChecker.scala --- @@ -69,8 +69,8 @@ object

[GitHub] spark pull request #16547: [SPARK-19168][Structured Streaming] Improvement: ...

2017-01-10 Thread lw-lin
GitHub user lw-lin opened a pull request: https://github.com/apache/spark/pull/16547 [SPARK-19168][Structured Streaming] Improvement: filter late data using watermark for `Append` mode ## What changes were proposed in this pull request? Currently we're filtering late data

[GitHub] spark issue #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrdering` gro...

2017-01-10 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15480 Jenkins retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrderi...

2017-01-09 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/15480#discussion_r95310486 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateOrdering.scala --- @@ -117,8 +117,31 @@ object

[GitHub] spark pull request #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrderi...

2017-01-09 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/15480#discussion_r95306902 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateOrdering.scala --- @@ -117,8 +117,31 @@ object

[GitHub] spark pull request #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrderi...

2017-01-09 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/15480#discussion_r95306125 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateOrdering.scala --- @@ -117,8 +117,31 @@ object

[GitHub] spark pull request #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrderi...

2017-01-09 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/15480#discussion_r95300754 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/OrderingSuite.scala --- @@ -127,4 +127,17 @@ class OrderingSuite extends

[GitHub] spark pull request #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrderi...

2017-01-09 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/15480#discussion_r95300735 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateOrdering.scala --- @@ -117,8 +117,31 @@ object

[GitHub] spark issue #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrdering` gro...

2017-01-09 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15480 I've cherry-picked the refactoring work of `splitExpression` (from @ueshin -- thank you!) into this. Also test passed. So @cloud-fan would you take a look at this again at your convenience

[GitHub] spark issue #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrdering` gro...

2017-01-09 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15480 Jenkins retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrdering` gro...

2017-01-05 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15480 Hi @mallman, I'll pick up this within this week. Thanks for the feedback! :) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request #16442: [SPARK-19041][SS] Fix code snippet compilation is...

2016-12-31 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16442#discussion_r94271336 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -537,21 +537,21 @@ Most of the common operations on DataFrame/Dataset are supported

[GitHub] spark pull request #16442: [SPARK-19041][SS] Fix code snippet compilation is...

2016-12-31 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16442#discussion_r94271332 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -537,21 +537,21 @@ Most of the common operations on DataFrame/Dataset are supported

[GitHub] spark issue #16442: [SPARK-19041][SS] Fix code snippet compilation issues in...

2016-12-31 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16442 @srowen would you take a look, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #16442: [SPARK-19041[SS] Fix code snippet compilation iss...

2016-12-31 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16442#discussion_r94271296 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -1491,14 +1503,17 @@ spark.streams.addListener(new StreamingQueryListener

[GitHub] spark pull request #16442: [SPARK-19041[SS] Fix code snippet compilation iss...

2016-12-31 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16442#discussion_r94271291 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -1169,11 +1177,11 @@ query.stop() // stop the query query.awaitTermination

[GitHub] spark pull request #16442: [SPARK-19041[SS] Fix code snippet compilation iss...

2016-12-31 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16442#discussion_r94271257 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -1026,7 +1026,9 @@ noAggDF // Write new data to Parquet files noAggDF

[GitHub] spark pull request #16442: [SPARK-19041[SS] Fix code snippet compilation iss...

2016-12-31 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16442#discussion_r94271243 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -973,7 +973,7 @@ Here is a table of all the sinks, and the corresponding settings

[GitHub] spark pull request #16442: [SPARK-19041[SS] Fix code snippet compilation iss...

2016-12-31 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/16442#discussion_r94271218 --- Diff: docs/structured-streaming-programming-guide.md --- @@ -537,21 +537,21 @@ Most of the common operations on DataFrame/Dataset are supported

[GitHub] spark pull request #16442: [SPARK-19041[SS] Fix code snippet compilation iss...

2016-12-31 Thread lw-lin
GitHub user lw-lin opened a pull request: https://github.com/apache/spark/pull/16442 [SPARK-19041[SS] Fix code snippet compilation issues in Structured Streaming Programming Guide ## What changes were proposed in this pull request? Currently some code snippets

[GitHub] spark issue #16423: Update known_translations for contributor names and also...

2016-12-29 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/16423 hi @yhuai, could you also: add: ``` lw-lin - Liwei Lin ``` update: ``` sharkdtu - Xiaogang Tu cenyuhai - Yuhai Ceng ``` --- If your project is set up

[GitHub] spark pull request #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrderi...

2016-12-01 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/15480#discussion_r90454770 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateOrdering.scala --- @@ -118,7 +118,42 @@ object

[GitHub] spark pull request #15946: [SPARK-18513][Structured Streaming] Record and re...

2016-11-29 Thread lw-lin
Github user lw-lin closed the pull request at: https://github.com/apache/spark/pull/15946 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark issue #15946: [SPARK-18513][Structured Streaming] Record and recover w...

2016-11-29 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15946 sure! closing this. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrderi...

2016-11-24 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/15480#discussion_r89516712 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateOrdering.scala --- @@ -118,7 +118,42 @@ object

[GitHub] spark pull request #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrderi...

2016-11-24 Thread lw-lin
Github user lw-lin commented on a diff in the pull request: https://github.com/apache/spark/pull/15480#discussion_r89490417 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/GenerateOrdering.scala --- @@ -118,7 +118,42 @@ object

[GitHub] spark issue #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrdering` gro...

2016-11-22 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15480 Jenkins retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #15480: [SPARK-16845][SQL] `GeneratedClass$SpecificOrdering` gro...

2016-11-22 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15480 @cloud-fan @hvanhovell would you take a look at this? Seems like it's targeted for 2.1. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark issue #15949: [SPARK-18339] [SPARK-18513] [SQL] Don't push down curren...

2016-11-22 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15949 Also cc @marmbrus @tdas @zsxwing; please be aware of [SPARK-18552](https://issues.apache.org/jira/browse/SPARK-18552), thanks. --- If your project is set up for it, you can reply to this email

[GitHub] spark issue #15949: [SPARK-18339] [SPARK-18513] [SQL] Don't push down curren...

2016-11-22 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15949 Hi @tcondie, just want to provide some information regarding watermark recovery: After some investigation, I found that even if we do have correctly recovered the watermark from log

[GitHub] spark issue #15946: [SPARK-18513][Structured Streaming] Record and recover w...

2016-11-21 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15946 Ah, I'll keep an eye on that! @zsxwing thanks for the notification. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #15946: [SPARK-18513][Structured Streaming] Record and recover w...

2016-11-21 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15946 @marmbrus @zsxwing could you take a look? Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request #15946: [SPARK-18513][Structured Streaming] Record and re...

2016-11-20 Thread lw-lin
GitHub user lw-lin opened a pull request: https://github.com/apache/spark/pull/15946 [SPARK-18513][Structured Streaming] Record and recover watermark ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) ## How

[GitHub] spark issue #15870: [SPARK-18425][Structured Streaming][Tests] Test `Compact...

2016-11-18 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/15870 @zsxwing thanks. I'm rebasing right now. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #15828: [SPARK-18187][SS] CompactibleFileStreamLog should...

2016-11-18 Thread lw-lin
Github user lw-lin closed the pull request at: https://github.com/apache/spark/pull/15828 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

<    1   2   3   4   5   6   >