[
https://issues.apache.org/jira/browse/OAK-11444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Daniel Iancu updated OAK-11444:
-------------------------------
Description:
Store document ID and empty properties names into a dedicated *_bin* collection
before physically deletion from Mongo nodes collection during full gc.
Motivation behind this change is that in case of accidentally deleting data
that should have not been deleted (not garbage) this `log` of removed documents
and properties will help the complete restoration from backup.
A separate collection was preferred instead of logging to files because is more
reliable. Logs usually needs to be exported to platform like Splunk and the
process does not guarantee that all logs are saved.
The data saved in *_bin* collection is temporary, the cleaning can be done via
setting document TTL or by using an external job to remove it.
was:
Store document ID and empty properties names into a dedicated _bin collection
before physically deletion from Mongo nodes collection during full gc.
Motivation behind this change is that in case of accidentally deleting data
that should have not been deleted (not garbage) this `log` of removed documents
and properties will help the complete restoration from backup.
A separate collection was preferred instead of logging to files because is more
reliable. Logs usually needs to be exported to platform like Splunk and the
process does not guarantee that all logs are saved.
> [full-gc] Save document id and empty properties names before deletion
> ----------------------------------------------------------------------
>
> Key: OAK-11444
> URL: https://issues.apache.org/jira/browse/OAK-11444
> Project: Jackrabbit Oak
> Issue Type: Story
> Components: mongomk
> Reporter: Daniel Iancu
> Priority: Major
>
> Store document ID and empty properties names into a dedicated *_bin*
> collection
> before physically deletion from Mongo nodes collection during full gc.
> Motivation behind this change is that in case of accidentally deleting data
> that should have not been deleted (not garbage) this `log` of removed
> documents and properties will help the complete restoration from backup.
> A separate collection was preferred instead of logging to files because is
> more reliable. Logs usually needs to be exported to platform like Splunk and
> the process does not guarantee that all logs are saved.
> The data saved in *_bin* collection is temporary, the cleaning can be done
> via setting document TTL or by using an external job to remove it.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)