GitHub user gaborgsomogyi opened a pull request:

    https://github.com/apache/spark/pull/20703

    [SPARK-19185][SS] Make Kafka consumer cache configurable

    ## What changes were proposed in this pull request?
    
    Use property `spark.streaming.kafka.consumer.cache.enabled` in structured 
streaming introduced [here](https://github.com/apache/spark/pull/18234) that 
allows users to enable or disable the cache for Kafka consumers. This property 
can be especially handy in cases where issues like 
[SPARK-19185](https://issues.apache.org/jira/browse/SPARK-19185) get hit, for 
which there isn't a solution committed yet. By default, the cache is still on, 
so this change doesn't change any out-of-box behavior. The structured streaming 
problem reported in 
[SPARK-23526](https://issues.apache.org/jira/browse/SPARK-23526).
    
    ## How was this patch tested?
    
    Automated: Passed Jenkins.
    Manually:
    ```
    cd docs
    jekyll build
    SKIP_API=1 jekyll build
    ```
    open _site/structured-streaming-kafka-integration.html


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/gaborgsomogyi/spark SPARK-19185

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20703.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20703
    
----
commit aa8eed44ad1b5e4bdee9c8c32a44fb07b1fa9f2e
Author: Gabor Somogyi <gabor.g.somogyi@...>
Date:   2018-03-01T13:47:49Z

    [SPARK-19185][SS] Make Kafka consumer cache configurable

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to