subject:"RocksDB segfault on state restore"

Re: RocksDB segfault on state restore

2023-06-02 Thread Alexis Sarda-Espinosa

Hello, A couple of potentially relevant pieces of information: 1. https://issues.apache.org/jira/browse/FLINK-16686 2. https://stackoverflow.com/a/64721838/5793905 (question was about schema evolution, but the answer is more generally applicable) Regards, Alexis. Am Fr., 2. Juni 2023 um 07:18

Re: RocksDB segfault on state restore

2023-06-01 Thread Gyula Fóra

Hi! In our case, no schema evolution was triggered , only the TTL was set from the beginning as far as I remember. I will double check Gyula On Fri, 2 Jun 2023 at 06:12, Hangxiang Yu wrote: > Hi, Gyula. > It seems related to https://issues.apache.org/jira/browse/FLINK-23346. > We also saw

Re: RocksDB segfault on state restore

2023-06-01 Thread Hangxiang Yu

Hi, Gyula. It seems related to https://issues.apache.org/jira/browse/FLINK-23346. We also saw core dump while using list state after triggering state migration and ttl compaction filter. Have you triggered the schema evolution ? It seems a bug of the rocksdb list state together with ttl compaction

RocksDB segfault on state restore

2023-05-17 Thread Gyula Fóra

Hi All! We are encountering an error on a larger stateful job (around 1 TB + state) on restore from a rocksdb checkpoint. The taskmanagers keep crashing with a segfault coming from the rocksdb native logic and seem to be related to the FlinkCompactionFilter mechanism. The gist with the full