[jira] [Commented] (KUDU-3367) Delta file with full of delete op can not be schedule to compact

2022-05-30 Thread YifanZhang (Jira)


[ 
https://issues.apache.org/jira/browse/KUDU-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17544095#comment-17544095
 ] 

YifanZhang commented on KUDU-3367:
--

Maybe related to KUDU-1625.

> Delta file with full of delete op can not be schedule to compact
> 
>
> Key: KUDU-3367
> URL: https://issues.apache.org/jira/browse/KUDU-3367
> Project: Kudu
>  Issue Type: New Feature
>  Components: compaction
>Reporter: dengke
>Assignee: dengke
>Priority: Major
> Attachments: image-2022-05-09-14-13-16-525.png, 
> image-2022-05-09-14-16-31-828.png, image-2022-05-09-14-18-05-647.png, 
> image-2022-05-09-14-19-56-933.png, image-2022-05-09-14-21-47-374.png, 
> image-2022-05-09-14-23-43-973.png, image-2022-05-09-14-26-45-313.png, 
> image-2022-05-09-14-32-51-573.png
>
>
> If we get a REDO delta with full of delete op, wich means there is no update 
> op in the file. The current compact algorithm will not schedule the file do 
> compact. If such files exist, after accumulating for a period of time, it 
> will greatly affect our scan speed. However, processing such files every time 
> compact reducesĀ  compact's performance.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Commented] (KUDU-3367) Delta file with full of delete op can not be schedule to compact

2022-05-30 Thread YifanZhang (Jira)


[ 
https://issues.apache.org/jira/browse/KUDU-3367?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17543808#comment-17543808
 ] 

YifanZhang commented on KUDU-3367:
--

I'm curious about if setting `tablet_history_max_age_sec` to a small value is 
helpful for your case. If so, will DeletedRowsetGCOp be scheduled and empty 
RowSets be deleted in time?

> Delta file with full of delete op can not be schedule to compact
> 
>
> Key: KUDU-3367
> URL: https://issues.apache.org/jira/browse/KUDU-3367
> Project: Kudu
>  Issue Type: New Feature
>  Components: compaction
>Reporter: dengke
>Assignee: dengke
>Priority: Major
> Attachments: image-2022-05-09-14-13-16-525.png, 
> image-2022-05-09-14-16-31-828.png, image-2022-05-09-14-18-05-647.png, 
> image-2022-05-09-14-19-56-933.png, image-2022-05-09-14-21-47-374.png, 
> image-2022-05-09-14-23-43-973.png, image-2022-05-09-14-26-45-313.png, 
> image-2022-05-09-14-32-51-573.png
>
>
> If we get a REDO delta with full of delete op, wich means there is no update 
> op in the file. The current compact algorithm will not schedule the file do 
> compact. If such files exist, after accumulating for a period of time, it 
> will greatly affect our scan speed. However, processing such files every time 
> compact reducesĀ  compact's performance.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)