[ 
https://issues.apache.org/jira/browse/ASTERIXDB-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chen Luo updated ASTERIXDB-1933:
--------------------------------
    Description: 
Currently, the only effective disk component merge policy in AsterixDB is 
prefix merge policy (and its variant correlated prefix merge policy). This 
policy works by inspecting the disk components from the oldest to newest to 
find a sequence of components to merge.

However, one problem with this policy is that it takes O(n^2) complexity to 
merge n flushed disk component into a final disk component. Instead, the 
level-based merge policy can make the complexity as O(n logn).

The basic idea of this level-based merge policy is in 
https://docs.google.com/a/uci.edu/presentation/d/1PWw9fbf9J71YRipDPUmK9U3nx5wP7wDGNGPmazdgNfU/edit?usp=sharing.

  was:
Currently, the only effective disk component merge policy in AsterixDB is 
prefix merge policy (and its invariant correlated prefix merge policy). This 
policy works by inspecting the disk components from the oldest to newest to 
find a sequence of components to merge.

However, one problem with this policy is that it takes O(n^2) complexity to 
merge n flushed disk component into a final disk component. Instead, the 
level-based merge policy can make the complexity as O(n logn).

The basic idea of this level-based merge policy is in 
https://docs.google.com/a/uci.edu/presentation/d/1PWw9fbf9J71YRipDPUmK9U3nx5wP7wDGNGPmazdgNfU/edit?usp=sharing.


> Level-Based Merge Policy
> ------------------------
>
>                 Key: ASTERIXDB-1933
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-1933
>             Project: Apache AsterixDB
>          Issue Type: Improvement
>          Components: Storage
>            Reporter: Chen Luo
>            Assignee: Chen Luo
>
> Currently, the only effective disk component merge policy in AsterixDB is 
> prefix merge policy (and its variant correlated prefix merge policy). This 
> policy works by inspecting the disk components from the oldest to newest to 
> find a sequence of components to merge.
> However, one problem with this policy is that it takes O(n^2) complexity to 
> merge n flushed disk component into a final disk component. Instead, the 
> level-based merge policy can make the complexity as O(n logn).
> The basic idea of this level-based merge policy is in 
> https://docs.google.com/a/uci.edu/presentation/d/1PWw9fbf9J71YRipDPUmK9U3nx5wP7wDGNGPmazdgNfU/edit?usp=sharing.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to