[ 
https://issues.apache.org/jira/browse/RANGER-4761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Fateh Singh updated RANGER-4761:
--------------------------------
    Description: 
* Map<String, Set> getColumnFamilies(Map<byte[], ? extends Collection<?>> 
families) becomes a bottleneck in multiget and multiput workloads wherein 
hundred/thousands of columns may be accessed together in a single request as it 
is a computationally heavy function converting bytes to string and type-casting 
Collection to set of strings.

The implementation needs to be revisited to reduce memory footprint

make lazy memory allocation for family map instead of ahead of time memory 
allocation for family map of type Map<String, Set>. With this approach, memory 
footprint will get reduced even further when enabling column auth optimization 
supported by RANGER-4670

  was:
* Map<String, Set> getColumnFamilies(Map<byte[], ? extends Collection<?>> 
families) becomes a bottleneck in multiget and multiput workloads wherein 
hundred/thousands of columns may be accessed together in a single request as it 
is a computationally heavy function converting bytes to string and type-casting 
Collection to set of strings.

The implementation needs to be revisited to reduce memory footprint

make lazy memory allocation for family map instead of ahead of time memory 
allocation for family map of type Map<String, Set>.


> Reduce memory footprint of hbase plugin
> ---------------------------------------
>
>                 Key: RANGER-4761
>                 URL: https://issues.apache.org/jira/browse/RANGER-4761
>             Project: Ranger
>          Issue Type: Improvement
>          Components: Ranger
>            Reporter: Fateh Singh
>            Assignee: Fateh Singh
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> * Map<String, Set> getColumnFamilies(Map<byte[], ? extends Collection<?>> 
> families) becomes a bottleneck in multiget and multiput workloads wherein 
> hundred/thousands of columns may be accessed together in a single request as 
> it is a computationally heavy function converting bytes to string and 
> type-casting Collection to set of strings.
> The implementation needs to be revisited to reduce memory footprint
> make lazy memory allocation for family map instead of ahead of time memory 
> allocation for family map of type Map<String, Set>. With this approach, 
> memory footprint will get reduced even further when enabling column auth 
> optimization supported by RANGER-4670



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to