[ 
https://issues.apache.org/jira/browse/DRILL-5758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178075#comment-16178075
 ] 

ASF GitHub Bot commented on DRILL-5758:
---------------------------------------

Github user paul-rogers commented on the issue:

    https://github.com/apache/drill/pull/932
  
    Added a fix for repeated columns that have a low cardinality. If, say, one 
row in ten has an array entry, then average cardinality (values per row) is 
0.1. This was represented by an int, rounded to 0 and caused a zero-length 
vector to be allocated. Drill then tried to double the length, which resulted 
in 0, which was doubled again, and so on forever.
    
    The fix has three parts:
    
    * The "record batch sizer" uses floats to allow fractional cardinality.
    * The vector initializer now works with fractional cardinality.
    * If all else fails, if a fixed-width vector is asked to double from zero, 
it sizes the vector to 256 bytes.


> Rollup of external sort fixes to issues found by QA
> ---------------------------------------------------
>
>                 Key: DRILL-5758
>                 URL: https://issues.apache.org/jira/browse/DRILL-5758
>             Project: Apache Drill
>          Issue Type: Task
>    Affects Versions: 1.12.0
>            Reporter: Paul Rogers
>            Assignee: Paul Rogers
>             Fix For: 1.12.0
>
>
> Tracking JIRA to used for the PR that combines fixes for various JIRA 
> entries. Bugs fixed in this task are given by the linked issues.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to