[ https://issues.apache.org/jira/browse/PHOENIX-3000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15333239#comment-15333239 ]
Lars Hofhansl commented on PHOENIX-3000: ---------------------------------------- We should probably also keep track of the size of the map during aggregate and fail if we're over some threshold to keep the region server safe. > Reduce memory consumption during DISTINCT aggregation > ----------------------------------------------------- > > Key: PHOENIX-3000 > URL: https://issues.apache.org/jira/browse/PHOENIX-3000 > Project: Phoenix > Issue Type: Bug > Reporter: Lars Hofhansl > Attachments: 3000.txt > > > In {{DistinctValueWithCountServerAggregator.aggregate}} we hold on the ptr > handed to us from HBase. > Note that this pointer points into an HFile Block, and hence we hold onto the > entire block for the duration of the aggregation. > If the column has high cardinality we might attempt holding the entire table > in memory in the extreme case. -- This message was sent by Atlassian JIRA (v6.3.4#6332)