vtlim commented on code in PR #12569:
URL: https://github.com/apache/druid/pull/12569#discussion_r890399497


##########
docs/configuration/index.md:
##########
@@ -987,8 +987,11 @@ Automatic compaction config example:
 }
 ```
 
-Compaction tasks fail when higher priority tasks cause Druid to revoke their 
locks. By default, realtime tasks like ingestion have a higher priority than 
compaction tasks. Therefore frequent conflicts between compaction tasks and 
realtime tasks can cause the coordinator's automatic compaction to get stuck.
-You may see this issue with streaming ingestion from Kafka and Kinesis, which 
ingest late-arriving data. To mitigate this problem, set `skipOffsetFromLatest` 
to a value large enough so that arriving data tends to fall outside the offset 
value from the current time. This way you can avoid conflicts between 
compaction tasks and realtime ingestion tasks.
+Compaction tasks fail when higher priority tasks cause Druid to revoke their 
locks. By default, realtime tasks like ingestion have a higher priority than 
compaction tasks. Therefore frequent conflicts between compaction tasks and 
realtime tasks can cause the Coordinator's automatic compaction to get stuck.
+You may see this issue with streaming ingestion from Kafka and Kinesis, which 
ingest late-arriving data.
+
+To mitigate this problem, set `skipOffsetFromLatest` to a value large enough 
so that arriving data tends to fall outside the offset value from the current 
time. This way you can avoid conflicts between compaction tasks and realtime 
ingestion tasks.

Review Comment:
   ```suggestion
   To mitigate this problem, set `skipOffsetFromLatest` to a value large enough 
so that arriving data tends to fall outside the offset value from the current 
time. This way you can avoid conflicts between compaction tasks and realtime 
ingestion tasks.
   For example, if you want to skip over segments from thirty days prior to the 
end time of the most recent segment, assign `"skipOffsetFromLatest": "P30D"`.
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to