Looks like it's doing a lot of reads immediately on startup (AbstractQueryPager) which is potentially causing a lot of GC (guessing that's what caused the StatusLogger).
DEBUG [SharedPool-Worker-113] 2021-06-30 13:39:04,766 AbstractQueryPager.java:133 - Remaining rows to page: 2147483646 is quite suspicious. You'll want to find out what query is causing a massive scan at startup, you probably need to have a look through the start of the logs to get a better idea at what's happening at startup. On Thu, Jul 1, 2021 at 5:14 AM Fred Habash <fmhab...@gmail.com> wrote: > I have node in cluster when I start c, the cpu reaches 100% with java > process on top. Within a few minutes, jvm crashes (jvm instability) > messages in system.log and c* crashes. > > Once c* is up, cluster average read latency reaches multi-seconds and > client apps are unhappy. For now, the only way out is to drain the node and > let the cluster latency settle. > > None of these measures helped ... > 1. Rebooting the ec2 > 2. Replacing the ec2 altogether (new ec2/ new c* install/ etc). > 3. Stopping compactions (as a diagnostic measure) > Trying to understand why the java process is chewing much cpu i.e. what is > actually happening ... > > I see these error messages in the debug.log. What functional task do these > messages relate to e.g. compactions? > > > DEBUG [SharedPool-Worker-113] 2021-06-30 13:39:04,766 > AbstractQueryPager.java:95 - Fetched 1 live rows > DEBUG [SharedPool-Worker-113] 2021-06-30 13:39:04,766 > AbstractQueryPager.java:112 - Got result (1) smaller than page size (5000), > considering pager exhausted > INFO [Service Thread] 2021-06-30 13:39:04,766 StatusLogger.java:56 - > MemtablePostFlush 0 0 29 0 > 0 > > DEBUG [SharedPool-Worker-113] 2021-06-30 13:39:04,766 > AbstractQueryPager.java:133 - Remaining rows to page: 2147483646 > DEBUG [SharedPool-Worker-113] 2021-06-30 13:39:04,766 > SliceQueryPager.java:92 - Querying next page of slice query; new filter: > SliceQueryFilter [reversed=false, slices=[[, ]], count=5000, toGroup = 0] > INFO [Service Thread] 2021-06-30 13:39:04,766 StatusLogger.java:56 - > ValidationExecutor 0 0 0 0 > 0 > INFO [Service Thread] 2021-06-30 13:39:04,766 StatusLogger.java:56 - > Sampler 0 0 0 0 > 0 > INFO [Service Thread] 2021-06-30 13:39:04,767 StatusLogger.java:56 - > MemtableFlushWriter 0 0 6 0 > 0 > INFO [Service Thread] 2021-06-30 13:39:04,767 StatusLogger.java:56 - > InternalResponseStage 0 0 4 0 > 0 > DEBUG [SharedPool-Worker-131] 2021-06-30 13:39:05,078 > StorageProxy.java:1467 - Read timeout; received 1 of 2 responses (only > digests) > DEBUG [SharedPool-Worker-131] 2021-06-30 13:39:05,079 > SliceQueryPager.java:92 - Querying next page of slice query; new filter: > SliceQueryFilter [reversed=false, slices=[[, ]], count=5000, toGroup = 0] > DEBUG [SharedPool-Worker-158] 2021-06-30 13:39:05,079 > StorageProxy.java:1467 - Read timeout; received 1 of 2 responses (only > digests) > DEBUG [SharedPool-Worker-158] 2021-06-30 13:39:05,079 > SliceQueryPager.java:92 - Querying next page of slice query; new filter: > SliceQueryFilter [reversed=false, slices=[[, ]], count=5000, toGroup = 0] > DEBUG [SharedPool-Worker-90] 2021-06-30 13:39:05,080 StorageProxy.jav > .... > EBUG [SharedPool-Worker-26] 2021-06-30 13:39:01,842 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-5069-big-Data.db > DEBUG [SharedPool-Worker-12] 2021-06-30 13:39:01,847 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-5592-big-Data.db > DEBUG [SharedPool-Worker-5] 2021-06-30 13:39:01,849 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-3993-big-Data.db > DEBUG [SharedPool-Worker-5] 2021-06-30 13:39:01,849 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-5927-big-Data.db > DEBUG [SharedPool-Worker-5] 2021-06-30 13:39:01,849 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-1276-big-Data.db > DEBUG [SharedPool-Worker-12] 2021-06-30 13:39:01,854 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-5949-big-Data.db > DEBUG [SharedPool-Worker-12] 2021-06-30 13:39:01,854 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-865-big-Data.db > DEBUG [SharedPool-Worker-12] 2021-06-30 13:39:01,854 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-5741-big-Data.db > DEBUG [SharedPool-Worker-12] 2021-06-30 13:39:01,854 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-4098-big-Data.db > DEBUG [SharedPool-Worker-12] 2021-06-30 13:39:01,854 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-1662-big-Data.db > DEBUG [SharedPool-Worker-12] 2021-06-30 13:39:01,854 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-1339-big-Data.db > DEBUG [SharedPool-Worker-12] 2021-06-30 13:39:01,854 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-4598-big-Data.db > DEBUG [SharedPool-Worker-12] 2021-06-30 13:39:01,855 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-3676-big-Data.db > DEBUG [SharedPool-Worker-12] 2021-06-30 13:39:01,855 > FileCacheService.java:102 - Evicting cold readers for > /data/cassandra/mykeyspace/mytable-cf0c43b028e811e68f2b1b695a8d5b2c/lb-2814-big-Data.db > DEBUG [SharedPool-Worker-12] 2021- > > We are using c* 2.2.8 > > > ---------------------------------------- > Thank you > > > -- raft.so - Cassandra consulting, support, and managed services