[ 
https://issues.apache.org/jira/browse/IMPALA-6311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16485741#comment-16485741
 ] 

Mostafa Mokhtar commented on IMPALA-6311:
-----------------------------------------

Actually I was thinking of reducing the default maximum size.

The main bottleneck here is that the memory used for handling the runtime 
filter RPCs and aggregating them is untracked.
Increasing the max filter default size or the number of filters makes things 
worse on the coordinator.

> Evaluate smaller FPP for Bloom filters
> --------------------------------------
>
>                 Key: IMPALA-6311
>                 URL: https://issues.apache.org/jira/browse/IMPALA-6311
>             Project: IMPALA
>          Issue Type: Task
>          Components: Perf Investigation
>            Reporter: Jim Apple
>            Priority: Major
>
> The Bloom filters are created by estimating the NDV and then using the FPP of 
> 75% to get the right size for the filter. This is may be too high to be very 
> useful - if our filters are currently filtering more than 75% out, then it is 
> only because we are overestimating NDV.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to