[spark] branch master updated: [SPARK-31255][SQL] Add SupportsMetadataColumns to DSv2

2020-11-18 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 1df69f7 [SPARK-31255][SQL] Add

[spark] branch branch-3.0 updated: [SPARK-29314][SS] Don't overwrite the metric "updated" of state operator to 0 if empty batch is run

2020-04-08 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch branch-3.0 in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/branch-3.0 by this push: new 2221d3e [SPARK-29314][SS] Don't

[spark] branch master updated: [SPARK-29314][SS] Don't overwrite the metric "updated" of state operator to 0 if empty batch is run

2020-04-08 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new ca2ba4f [SPARK-29314][SS] Don't overwrite

[spark] branch branch-3.0 updated: [SPARK-31278][SS] Fix StreamingQuery output rows metric

2020-04-07 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch branch-3.0 in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/branch-3.0 by this push: new a856eea [SPARK-31278][SS] Fix

[spark] branch master updated: [SPARK-31278][SS] Fix StreamingQuery output rows metric

2020-04-07 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 8ab2a0c [SPARK-31278][SS] Fix StreamingQuery

[spark] branch branch-3.0 updated: [SPARK-31178][SQL] Prevent V2 exec nodes from executing multiple times

2020-03-18 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch branch-3.0 in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/branch-3.0 by this push: new a97117f [SPARK-31178][SQL] Prevent V2

[spark] branch master updated: [SPARK-31178][SQL] Prevent V2 exec nodes from executing multiple times

2020-03-18 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 4237251 [SPARK-31178][SQL] Prevent V2 exec

[spark] branch master updated: [SPARK-30669][SS] Introduce AdmissionControl APIs for StructuredStreaming

2020-01-30 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 1cd19ad [SPARK-30669][SS] Introduce

[spark] branch master updated: [SPARK-30669][SS] Introduce AdmissionControl APIs for StructuredStreaming

2020-01-30 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 940510c [SPARK-30669][SS] Introduce

[spark] branch master updated: [SPARK-30314] Add identifier and catalog information to DataSourceV2Relation

2020-01-26 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new d0800fc [SPARK-30314] Add identifier

[spark] branch master updated: [SPARK-29219][SQL] Introduce SupportsCatalogOptions for TableProvider

2020-01-09 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new f8d5957 [SPARK-29219][SQL] Introduce

[spark] branch master updated: [SPARK-30143][SS] Add a timeout on stopping a streaming query

2019-12-13 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 4c37a8a [SPARK-30143][SS] Add a timeout

[spark] branch master updated: [SPARK-29568][SS] Stop existing running streams when a new stream is launched

2019-11-13 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 363af16 [SPARK-29568][SS] Stop existing running

[spark] branch master updated: [SPARK-29352][SQL][SS] Track active streaming queries in the SparkSession.sharedState

2019-10-23 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new cbe6ead [SPARK-29352][SQL][SS] Track active

[spark] branch master updated: [SPARK-28612][SQL] Add DataFrameWriterV2 API

2019-09-19 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 2c775f4 [SPARK-28612][SQL] Add

[spark] branch master updated: [SPARK-29030][SQL] Simplify lookupV2Relation

2019-09-18 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new ee94b5d [SPARK-29030][SQL] Simplify

[spark] branch master updated: [SPARK-28628][SQL] Implement SupportsNamespaces in V2SessionCatalog

2019-09-03 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 5ea134c [SPARK-28628][SQL] Implement

[spark] branch master updated: [SPARK-28612][SQL] Add DataFrameWriterV2 API

2019-08-31 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 3821d75 [SPARK-28612][SQL] Add

[spark] branch master updated: [SPARK-28635][SQL][FOLLOWUP] CatalogManager should reflect the changes of default catalog

2019-08-21 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 97b046f [SPARK-28635][SQL][FOLLOWUP

[spark] branch master updated: [SPARK-28565][SQL] DataFrameWriter saveAsTable support for V2 catalogs

2019-08-08 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 5368eaa [SPARK-28565][SQL] DataFrameWriter

[spark] branch master updated: [SPARK-28331][SQL] Catalogs.load() should be able to load built-in catalogs

2019-08-07 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new c88df2c [SPARK-28331][SQL] Catalogs.load

[spark] branch master updated: [SPARK-27661][SQL] Add SupportsNamespaces API

2019-08-04 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 0345f11 [SPARK-27661][SQL] Add

[spark] branch master updated: [SPARK-27661][SQL] Add SupportsNamespaces API

2019-08-04 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 0f89a5d [SPARK-27661][SQL] Add

[spark] branch master updated: [SPARK-27845][SQL] DataSourceV2: InsertTable

2019-07-25 Thread brkyvz
This is an automated email from the ASF dual-hosted git repository. brkyvz pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git The following commit(s) were added to refs/heads/master by this push: new 443904a [SPARK-27845][SQL] DataSourceV2

spark git commit: [SPARK-25472][SS] Don't have legitimate stops of streams cause stream exceptions

2018-09-20 Thread brkyvz
ing other specific SparkExceptions. I've also run the `KafkaSourceStressForDontFailOnDataLossSuite`100 times, and it didn't fail, whereas it used to be flaky. Closes #22478 from brkyvz/SPARK-25472. Authored-by: Burak Yavuz Signed-off-by: Burak Yavuz Project: http://git-wip-us.apache.org/repos/

spark git commit: [SPARK-24525][SS] Provide an option to limit number of rows in a MemorySink

2018-06-15 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 90da7dc24 -> e4fee395e [SPARK-24525][SS] Provide an option to limit number of rows in a MemorySink ## What changes were proposed in this pull request? Provide an option to limit number of rows in a MemorySink. Currently, MemorySink and

spark git commit: [SPARK-20168][DSTREAM] Add changes to use kinesis fetches from specific timestamp

2017-12-25 Thread brkyvz
rom the provided timestamp. ## How was this patch tested? Unit Tests cc : budde brkyvz Author: Yash Sharma <ysha...@atlassian.com> Closes #18029 from yssharma/ysharma/kcl_resume. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit

spark git commit: [SPARK-21977] SinglePartition optimizations break certain Streaming Stateful Aggregation requirements

2017-09-20 Thread brkyvz
ten and lost. ## How was this patch tested? Regression tests Author: Burak Yavuz <brk...@gmail.com> Closes #19196 from brkyvz/sa-0. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/280ff523 Tree: http://git-wip-us.apache.

spark git commit: [SPARK-21463] Allow userSpecifiedSchema to override partition inference performed by MetadataLogFileIndex

2017-07-19 Thread brkyvz
nit tests and manual tests Author: Burak Yavuz <brk...@gmail.com> Closes #18676 from brkyvz/stream-partitioning. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/2c9d5ef1 Tree: http://git-wip-us.apache.org/repos/asf/s

spark git commit: [DSTREAM][DOC] Add documentation for kinesis retry configurations

2017-05-18 Thread brkyvz
467. The documentation was missed somewhere in the review iterations. Adding the documentation where it belongs. ## How was this patch tested? Docs. Not tested. cc budde , brkyvz Author: Yash Sharma <ysha...@atlassian.com> Closes #18028 from yssharma/ysharma/kinesis_retry_docs. (cherry picked fr

spark git commit: [DSTREAM][DOC] Add documentation for kinesis retry configurations

2017-05-18 Thread brkyvz
467. The documentation was missed somewhere in the review iterations. Adding the documentation where it belongs. ## How was this patch tested? Docs. Not tested. cc budde , brkyvz Author: Yash Sharma <ysha...@atlassian.com> Closes #18028 from yssharma/ysharma/kinesis_retry_docs. Project: http:

spark git commit: [SPARK-20140][DSTREAM] Remove hardcoded kinesis retry wait and max retries

2017-05-16 Thread brkyvz
Repository: spark Updated Branches: refs/heads/branch-2.2 75e5ea294 -> 7076ab40f [SPARK-20140][DSTREAM] Remove hardcoded kinesis retry wait and max retries ## What changes were proposed in this pull request? The pull requests proposes to remove the hardcoded values for Amazon Kinesis -

spark git commit: [SPARK-20140][DSTREAM] Remove hardcoded kinesis retry wait and max retries

2017-05-16 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 6f62e9d9b -> 38f4e8692 [SPARK-20140][DSTREAM] Remove hardcoded kinesis retry wait and max retries ## What changes were proposed in this pull request? The pull requests proposes to remove the hardcoded values for Amazon Kinesis -

spark git commit: [SPARK-20441][SPARK-20432][SS] Within the same streaming query, one StreamingRelation should only be transformed to one StreamingExecutionRelation

2017-05-03 Thread brkyvz
Repository: spark Updated Branches: refs/heads/branch-2.2 b5947f5c3 -> b1a732fea [SPARK-20441][SPARK-20432][SS] Within the same streaming query, one StreamingRelation should only be transformed to one StreamingExecutionRelation ## What changes were proposed in this pull request? Within the

spark git commit: [SPARK-20441][SPARK-20432][SS] Within the same streaming query, one StreamingRelation should only be transformed to one StreamingExecutionRelation

2017-05-03 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 7f96f2d7f -> 27f543b15 [SPARK-20441][SPARK-20432][SS] Within the same streaming query, one StreamingRelation should only be transformed to one StreamingExecutionRelation ## What changes were proposed in this pull request? Within the same

spark git commit: [SPARK-20496][SS] Bug in KafkaWriter Looks at Unanalyzed Plans

2017-04-28 Thread brkyvz
Repository: spark Updated Branches: refs/heads/branch-2.2 ea5b11446 -> ec712d751 [SPARK-20496][SS] Bug in KafkaWriter Looks at Unanalyzed Plans ## What changes were proposed in this pull request? We didn't enforce analyzed plans in Spark 2.1 when writing out to Kafka. ## How was this patch

spark git commit: [SPARK-20496][SS] Bug in KafkaWriter Looks at Unanalyzed Plans

2017-04-28 Thread brkyvz
Repository: spark Updated Branches: refs/heads/branch-2.1 6696ad0e8 -> 5131b0a96 [SPARK-20496][SS] Bug in KafkaWriter Looks at Unanalyzed Plans ## What changes were proposed in this pull request? We didn't enforce analyzed plans in Spark 2.1 when writing out to Kafka. ## How was this patch

spark git commit: [SPARK-20496][SS] Bug in KafkaWriter Looks at Unanalyzed Plans

2017-04-28 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 8c911adac -> 733b81b83 [SPARK-20496][SS] Bug in KafkaWriter Looks at Unanalyzed Plans ## What changes were proposed in this pull request? We didn't enforce analyzed plans in Spark 2.1 when writing out to Kafka. ## How was this patch

spark git commit: [SPARK-19911][STREAMING] Add builder interface for Kinesis DStreams

2017-03-24 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 9299d071f -> 707e50183 [SPARK-19911][STREAMING] Add builder interface for Kinesis DStreams ## What changes were proposed in this pull request? - Add new KinesisDStream.scala containing KinesisDStream.Builder class - Add

spark git commit: Fix compilation of the Scala 2.10 master branch

2017-03-23 Thread brkyvz
How was this patch tested? Compiled with `build/sbt -Dscala2.10 sql/compile` locally Author: Burak Yavuz <brk...@gmail.com> Closes #17403 from brkyvz/onceTrigger2.10. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/9358

spark git commit: [SPARK-19813] maxFilesPerTrigger combo latestFirst may miss old files in combination with maxFileAge in FileStreamSource

2017-03-08 Thread brkyvz
are set. ## How was this patch tested? Regression test in `FileStreamSourceSuite` Author: Burak Yavuz <brk...@gmail.com> Closes #17153 from brkyvz/maxFileAge. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/a3648b5

spark git commit: [SPARK-19813] maxFilesPerTrigger combo latestFirst may miss old files in combination with maxFileAge in FileStreamSource

2017-03-08 Thread brkyvz
are set. ## How was this patch tested? Regression test in `FileStreamSourceSuite` Author: Burak Yavuz <brk...@gmail.com> Closes #17153 from brkyvz/maxFileAge. (cherry picked from commit a3648b5d4f99ff9461d02f53e9ec71787a3abf51) Signed-off-by: Burak Yavuz <brk...@gmail.com> Project:

spark git commit: [SPARK-19304][STREAMING][KINESIS] fix kinesis slow checkpoint recovery

2017-03-06 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 339b53a13 -> 46a64d1e0 [SPARK-19304][STREAMING][KINESIS] fix kinesis slow checkpoint recovery ## What changes were proposed in this pull request? added a limit to getRecords api call call in KinesisBackedBlockRdd. This helps reduce the

spark git commit: [SPARK-19595][SQL] Support json array in from_json

2017-03-05 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 80d5338b3 -> 369a148e5 [SPARK-19595][SQL] Support json array in from_json ## What changes were proposed in this pull request? This PR proposes to both, **Do not allow json arrays with multiple elements and return null in `from_json`

spark git commit: [SPARK-19542][SS] Delete the temp checkpoint if a query is stopped without errors

2017-02-13 Thread brkyvz
Repository: spark Updated Branches: refs/heads/branch-2.1 ef4fb7ebc -> c5a7cb022 [SPARK-19542][SS] Delete the temp checkpoint if a query is stopped without errors ## What changes were proposed in this pull request? When a query uses a temp checkpoint dir, it's better to delete it if it's

spark git commit: [SPARK-19542][SS] Delete the temp checkpoint if a query is stopped without errors

2017-02-13 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 0417ce878 -> 3dbff9be0 [SPARK-19542][SS] Delete the temp checkpoint if a query is stopped without errors ## What changes were proposed in this pull request? When a query uses a temp checkpoint dir, it's better to delete it if it's

spark git commit: [SPARK-18218][ML][MLLIB] Reduce shuffled data size of BlockMatrix multiplication and solve potential OOM and low parallelism usage problem By split middle dimension in matrix multipl

2017-01-26 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 9f523d319 -> 1191fe267 [SPARK-18218][ML][MLLIB] Reduce shuffled data size of BlockMatrix multiplication and solve potential OOM and low parallelism usage problem By split middle dimension in matrix multiplication ## What changes were

spark git commit: [SPARK-18020][STREAMING][KINESIS] Checkpoint SHARD_END to finish reading closed shards

2017-01-25 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 233845126 -> 256a3a801 [SPARK-18020][STREAMING][KINESIS] Checkpoint SHARD_END to finish reading closed shards ## What changes were proposed in this pull request? This pr is to fix an issue occurred when resharding Kinesis streams; the