Did you try sstablescrub? If that doesn't work, you can delete all files of this sstable id and then run repair -pr on this node.
On Mon, May 6, 2019 at 9:20 AM Roy Burstein <burstein....@gmail.com> wrote: > Hi , > We are having issues with Cassandra 3.11.4 , after adding node to the > cluster we get many corrupted files across the cluster (almost all nodes) > ,this is reproducible in our env. . > We have 69 nodes in the cluster ,disk_access_mode: standard . > > The stack trace : > > WARN [ReadStage-4] 2019-05-06 06:44:19,843 > AbstractLocalAwareExecutorService.java:167 - Uncaught exception on thread > Thread[ReadStage-4,5,main]: {} > java.lang.RuntimeException: > org.apache.cassandra.io.sstable.CorruptSSTableException: Corrupted: > /var/lib/cassandra/data/disk1/sessions_rawdata/sessions_v2_2019_05_06-9cae0c20585411e99aa867a11519e31c/md-816-big-I > ndex.db > at > org.apache.cassandra.service.StorageProxy$DroppableRunnable.run(StorageProxy.java:2588) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > ~[na:1.8.0-zing_19.03.0.0] > at > org.apache.cassandra.concurrent.AbstractLocalAwareExecutorService$FutureTask.run(AbstractLocalAwareExecutorService.java:162) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.concurrent.AbstractLocalAwareExecutorService$LocalSessionFutureTask.run(AbstractLocalAwareExecutorService.java:134) > [apache-cassandra-3.11.4.jar:3.11.4] > at org.apache.cassandra.concurrent.SEPWorker.run(SEPWorker.java:114) > [apache-cassandra-3.11.4.jar:3.11.4] > at java.lang.Thread.run(Thread.java:748) [na:1.8.0-zing_19.03.0.0] > Caused by: org.apache.cassandra.io.sstable.CorruptSSTableException: > Corrupted: > /var/lib/cassandra/data/disk1/sessions_rawdata/sessions_v2_2019_05_06-9cae0c20585411e99aa867a11519e31c/md-816-big-Index.db > at > org.apache.cassandra.io.sstable.format.big.BigTableReader.getPosition(BigTableReader.java:275) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.io.sstable.format.SSTableReader.getPosition(SSTableReader.java:1586) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.io.sstable.format.big.BigTableReader.iterator(BigTableReader.java:64) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.rows.UnfilteredRowIteratorWithLowerBound.initializeIterator(UnfilteredRowIteratorWithLowerBound.java:108) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.rows.LazilyInitializedUnfilteredRowIterator.maybeInit(LazilyInitializedUnfilteredRowIterator.java:48) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.rows.LazilyInitializedUnfilteredRowIterator.computeNext(LazilyInitializedUnfilteredRowIterator.java:99) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.rows.UnfilteredRowIteratorWithLowerBound.computeNext(UnfilteredRowIteratorWithLowerBound.java:119) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.rows.UnfilteredRowIteratorWithLowerBound.computeNext(UnfilteredRowIteratorWithLowerBound.java:48) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.utils.AbstractIterator.hasNext(AbstractIterator.java:47) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.utils.MergeIterator$Candidate.advance(MergeIterator.java:374) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.utils.MergeIterator$ManyToOne.advance(MergeIterator.java:186) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.utils.MergeIterator$ManyToOne.computeNext(MergeIterator.java:155) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.utils.AbstractIterator.hasNext(AbstractIterator.java:47) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.rows.UnfilteredRowIterators$UnfilteredRowMergeIterator.computeNext(UnfilteredRowIterators.java:525) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.rows.UnfilteredRowIterators$UnfilteredRowMergeIterator.computeNext(UnfilteredRowIterators.java:385) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.utils.AbstractIterator.hasNext(AbstractIterator.java:47) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.rows.UnfilteredRowIterator.isEmpty(UnfilteredRowIterator.java:67) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.SinglePartitionReadCommand.withSSTablesIterated(SinglePartitionReadCommand.java:853) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.SinglePartitionReadCommand.queryMemtableAndDiskInternal(SinglePartitionReadCommand.java:797) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.SinglePartitionReadCommand.queryMemtableAndDisk(SinglePartitionReadCommand.java:670) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.SinglePartitionReadCommand.queryStorage(SinglePartitionReadCommand.java:504) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.ReadCommand.executeLocally(ReadCommand.java:423) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.service.StorageProxy$LocalReadRunnable.runMayThrow(StorageProxy.java:1874) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.service.StorageProxy$DroppableRunnable.run(StorageProxy.java:2584) > ~[apache-cassandra-3.11.4.jar:3.11.4] > ... 5 common frames omitted > > Caused by: java.io.EOFException: EOF after 508 bytes out of 1154 > at > org.apache.cassandra.io.util.DataInputPlus.skipBytesFully(DataInputPlus.java:58) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.RowIndexEntry$Serializer.skipPromotedIndex(RowIndexEntry.java:385) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.db.RowIndexEntry$Serializer.skip(RowIndexEntry.java:376) > ~[apache-cassandra-3.11.4.jar:3.11.4] > at > org.apache.cassandra.io.sstable.format.big.BigTableReader.getPosition(BigTableReader.java:269) > ~[apache-cassandra-3.11.4.jar:3.11.4] > > Thanks, > Roy >