Viraj Jasani created PHOENIX-7500: ------------------------------------- Summary: Remove PARTITION_ID from SYSTEM.CDC_STREAM table's composite pk Key: PHOENIX-7500 URL: https://issues.apache.org/jira/browse/PHOENIX-7500 Project: Phoenix Issue Type: Sub-task Reporter: Viraj Jasani
There are two ways to capture CDC Stream's parent/child relationship among data table's merged regions: # In the current schema of SYSTEM.CDC_STREAM table, provide comma separated parent regions in the PARENT_PARTITION_ID column. # Use one row for each merged parent region. Each row represents one child to parent relationship. Any CDC Consumer can continue consuming partition records from parent to child partitions. Since any num of regions can be merged simultaneously, it can be expensive to write a query that uses IN clause to check whether one of the merged parent region is same as current region (partition) being consumed by the client. Using one parent partition id for each row is efficient solution. In order to achieve this, we need to remove PARTITION_ID from SYSTEM.CDC_STREAM table's composite pk. This is needed because the child partition id remains same for different merged parent regions. -- This message was sent by Atlassian Jira (v8.20.10#820010)