GitHub user gaborgsomogyi opened a pull request: https://github.com/apache/spark/pull/20703
[SPARK-19185][SS] Make Kafka consumer cache configurable ## What changes were proposed in this pull request? Use property `spark.streaming.kafka.consumer.cache.enabled` in structured streaming introduced [here](https://github.com/apache/spark/pull/18234) that allows users to enable or disable the cache for Kafka consumers. This property can be especially handy in cases where issues like [SPARK-19185](https://issues.apache.org/jira/browse/SPARK-19185) get hit, for which there isn't a solution committed yet. By default, the cache is still on, so this change doesn't change any out-of-box behavior. The structured streaming problem reported in [SPARK-23526](https://issues.apache.org/jira/browse/SPARK-23526). ## How was this patch tested? Automated: Passed Jenkins. Manually: ``` cd docs jekyll build SKIP_API=1 jekyll build ``` open _site/structured-streaming-kafka-integration.html You can merge this pull request into a Git repository by running: $ git pull https://github.com/gaborgsomogyi/spark SPARK-19185 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/20703.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #20703 ---- commit aa8eed44ad1b5e4bdee9c8c32a44fb07b1fa9f2e Author: Gabor Somogyi <gabor.g.somogyi@...> Date: 2018-03-01T13:47:49Z [SPARK-19185][SS] Make Kafka consumer cache configurable ---- --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org