GitHub user jiangxb1987 opened a pull request:

    https://github.com/apache/spark/pull/20361

    [SPARK-23188] [SQL] Make vectorized columar reader batch size configurable

    ## What changes were proposed in this pull request?
    
    This PR include the following changes:
    - Make the capacity of `VectorizedParquetRecordReader` configurable;
    - Make the capacity of `OrcColumnarBatchReader` configurable;
    - Update the error message when required capacity in writable columnar 
vector cannot be fulfilled.
    
    ## How was this patch tested?
    
    N/A

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/jiangxb1987/spark vectorCapacity

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20361.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20361
    
----
commit 927c6b4d16b5a4c6457a190f3c1b2b8a5e439f2a
Author: Xingbo Jiang <xingbo.jiang@...>
Date:   2018-01-23T08:14:33Z

    make vector batch size configurable.

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to