[jira] [Updated] (CASSANDRA-19569) sstableupgrade is very slow
[ https://issues.apache.org/jira/browse/CASSANDRA-19569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Norbert Schultz updated CASSANDRA-19569: Platform: Java11,Linux (was: All) Description: We are in the process of migrating cassandra from 3.11.x to 4.1.4 and upgrading the sstables using sstableupgrade from Cassandra V4.1.4, from `me-` to `nb-` Format Unfortunately, the process is very very slow (less than 0.5 MB/s). Some observations: - The process is only slow on (fast) SSDs, but not on ram disks. - The sstables consist of many partitions (this may be unrelated) - The upgrade process is fast, if we use `automatic_sstable_upgrade` instead of the sstableupgradetool. - We give enough RAM (export MAX_HEAP_SIZE=8g) On profiling, we found out, that sstableupgrade is burning most CPU time on {{posix_fadvise}} (see flamegraph_sstableupgrade.png ). My naive interpretation of the whole {{maybeReopenEarly}} to {{posix_fadvise}} chain is, that the process just informs the linux kernel, that the written data should not be cached. If we comment out the call to {{NativeLibrary.trySkipCache}}, the conversion is running at expected 10MB/s (see flamegraph_ok.png ) was: We are in the process of migrating cassandra from 3.11.x to 4.1.4 and upgrading the sstables using sstableupgrade from Cassandra V4.1.4, from `me-` to `nb-` Format Unfortunately, the process is very very slow (less than 0.5 MB/s). Some observations: - The process is only slow on (fast) SSDs, but not on ram disks. - The sstables consist of many partitions (this may be unrelated) - The upgrade process is fast, if we use `automatic_sstable_upgrade` instead of the sstableupgradetool. - We give enough RAM (export MAX_HEAP_SIZE=8g) On profiling, we found out, that sstableupgrade is burning most CPU time on {{posix_fadvise}} (see attached !flamegraph_sstableupgrade.png! ). My naive interpretation of the whole {{maybeReopenEarly}} to {{posix_fadvise}} chain is, that the process just informs the linux kernel, that the written data should not be cached. If we comment out the call to {{NativeLibrary.trySkipCache}}, the conversion is running at expected 10MB/s (see !flamegraph_ok.png! ) > sstableupgrade is very slow > --- > > Key: CASSANDRA-19569 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19569 > Project: Cassandra > Issue Type: Bug >Reporter: Norbert Schultz >Priority: Normal > Attachments: flamegraph_ok.png, flamegraph_sstableupgrade.png > > > We are in the process of migrating cassandra from 3.11.x to 4.1.4 and > upgrading the sstables using sstableupgrade from Cassandra V4.1.4, from `me-` > to `nb-` Format > Unfortunately, the process is very very slow (less than 0.5 MB/s). > Some observations: > - The process is only slow on (fast) SSDs, but not on ram disks. > - The sstables consist of many partitions (this may be unrelated) > - The upgrade process is fast, if we use `automatic_sstable_upgrade` instead > of the sstableupgradetool. > - We give enough RAM (export MAX_HEAP_SIZE=8g) > On profiling, we found out, that sstableupgrade is burning most CPU time on > {{posix_fadvise}} (see flamegraph_sstableupgrade.png ). > My naive interpretation of the whole {{maybeReopenEarly}} to > {{posix_fadvise}} chain is, that the process just informs the linux kernel, > that the written data should not be cached. If we comment out the call to > {{NativeLibrary.trySkipCache}}, the conversion is running at expected 10MB/s > (see flamegraph_ok.png ) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Created] (CASSANDRA-19569) sstableupgrade is very slow
Norbert Schultz created CASSANDRA-19569: --- Summary: sstableupgrade is very slow Key: CASSANDRA-19569 URL: https://issues.apache.org/jira/browse/CASSANDRA-19569 Project: Cassandra Issue Type: Bug Reporter: Norbert Schultz Attachments: flamegraph_ok.png, flamegraph_sstableupgrade.png We are in the process of migrating cassandra from 3.11.x to 4.1.4 and upgrading the sstables using sstableupgrade from Cassandra V4.1.4, from `me-` to `nb-` Format Unfortunately, the process is very very slow (less than 0.5 MB/s). Some observations: - The process is only slow on (fast) SSDs, but not on ram disks. - The sstables consist of many partitions (this may be unrelated) - The upgrade process is fast, if we use `automatic_sstable_upgrade` instead of the sstableupgradetool. - We give enough RAM (export MAX_HEAP_SIZE=8g) On profiling, we found out, that sstableupgrade is burning most CPU time on {{posix_fadvise}} (see attached !flamegraph_sstableupgrade.png! ). My naive interpretation of the whole {{maybeReopenEarly}} to {{posix_fadvise}} chain is, that the process just informs the linux kernel, that the written data should not be cached. If we comment out the call to {{NativeLibrary.trySkipCache}}, the conversion is running at expected 10MB/s (see !flamegraph_ok.png! ) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Berenguer Blasi updated CASSANDRA-19565: Reviewers: Berenguer Blasi > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug > Components: Packaging >Reporter: Thomas De Keulenaer >Assignee: Brandon Williams >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19564) MemtablePostFlush deadlock leads to stuck nodes and crashes
[ https://issues.apache.org/jira/browse/CASSANDRA-19564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838434#comment-17838434 ] Jon Haddad commented on CASSANDRA-19564: {quote}Is the Flush that is blocked the one that the postFlush is waiting on? You can check this from a heap dump. {quote} Looking at the incoming references in the heap dump I can see the reference to the MemtableFlushWriter on thread 1231, which is one of the memtable threads that's in WAITING. !image-2024-04-17-18-46-29-474.png! Should any of the write barrier be marked isBlocking=true after this runs? {{writeBarrier.markBlocking();}} {{writeBarrier.await();}} !image-2024-04-17-19-14-34-344.png! Does getting past the write barrier await require other mutations to complete? If we're unable to get memory from the memtable pool, would that block the write barrier? Fwiw, I am not 100% confident of my timeline as it's based on output from the status logger and it's entirely possible that > MemtablePostFlush deadlock leads to stuck nodes and crashes > --- > > Key: CASSANDRA-19564 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19564 > Project: Cassandra > Issue Type: Bug > Components: Local/Compaction, Local/Memtable >Reporter: Jon Haddad >Priority: Urgent > Fix For: 4.1.x > > Attachments: image-2024-04-16-11-55-54-750.png, > image-2024-04-16-12-29-15-386.png, image-2024-04-16-13-43-11-064.png, > image-2024-04-16-13-53-24-455.png, image-2024-04-17-18-46-29-474.png, > image-2024-04-17-19-13-06-769.png, image-2024-04-17-19-14-34-344.png > > > I've run into an issue on a 4.1.4 cluster where an entire node has locked up > due to what I believe is a deadlock in memtable flushing. Here's what I know > so far. I've stitched together what happened based on conversations, logs, > and some flame graphs. > *Log reports memtable flushing* > The last successful flush happens at 12:19. > {noformat} > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 > AbstractAllocatorMemtable.java:286 - Flushing largest CFS(Keyspace='ks', > ColumnFamily='version') to free up room. Used total: 0.24/0.33, live: > 0.16/0.20, flushing: 0.09/0.13, this: 0.13/0.15 > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 ColumnFamilyStore.java:1012 > - Enqueuing flush of ks.version, Reason: MEMTABLE_LIMIT, Usage: 660.521MiB > (13%) on-heap, 790.606MiB (15%) off-heap > {noformat} > *MemtablePostFlush appears to be blocked* > At this point, MemtablePostFlush completed tasks stops incrementing, active > stays at 1 and pending starts to rise. > {noformat} > MemtablePostFlush 1 1 3446 0 0 > {noformat} > > The flame graph reveals that PostFlush.call is stuck. I don't have the line > number, but I know we're stuck in > {{org.apache.cassandra.db.ColumnFamilyStore.PostFlush#call}} given the visual > below: > *!image-2024-04-16-13-43-11-064.png!* > *Memtable flushing is now blocked.* > All MemtableFlushWriter threads are Parked waiting on > {{{}OpOrder.Barrier.await{}}}. A wall clock profile of 30s reveals all time > is spent here. Presumably we're waiting on the single threaded Post Flush. > !image-2024-04-16-12-29-15-386.png! > *Memtable allocations start to block* > Eventually it looks like the NativeAllocator stops successfully allocating > memory. I assume it's waiting on memory to be freed, but since memtable > flushes are blocked, we wait indefinitely. > Looking at a wall clock flame graph, all writer threads have reached the > allocation failure path of {{MemtableAllocator.allocate()}}. I believe we're > waiting on {{signal.awaitThrowUncheckedOnInterrupt()}} > {noformat} > MutationStage 48 828425 980253369 0 0{noformat} > !image-2024-04-16-11-55-54-750.png! > > *Compaction Stops* > Since we write to the compaction history table, and that requires memtables, > compactions are now blocked as well. > > !image-2024-04-16-13-53-24-455.png! > > The node is now doing basically nothing and must be restarted. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
Re: [PR] CASSANDRA-19563: Support bulk write via S3 [cassandra-analytics]
yifan-c commented on code in PR #53: URL: https://github.com/apache/cassandra-analytics/pull/53#discussion_r1569813654 ## scripts/build-sidecar.sh: ## @@ -24,7 +24,7 @@ else SCRIPT_DIR=$( dirname -- "$( readlink -f -- "$0"; )"; ) SIDECAR_REPO="${SIDECAR_REPO:-https://github.com/apache/cassandra-sidecar.git}"; SIDECAR_BRANCH="${SIDECAR_BRANCH:-trunk}" - SIDECAR_COMMIT="${SIDECAR_COMMIT:-20795db4d708b9287e0a2281695923bfb6fa9138}" + SIDECAR_COMMIT="${SIDECAR_COMMIT:-686d9e8699bd0a56857e24523d883120023b9841}" Review Comment: will do -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19564) MemtablePostFlush deadlock leads to stuck nodes and crashes
[ https://issues.apache.org/jira/browse/CASSANDRA-19564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jon Haddad updated CASSANDRA-19564: --- Attachment: image-2024-04-17-19-14-34-344.png > MemtablePostFlush deadlock leads to stuck nodes and crashes > --- > > Key: CASSANDRA-19564 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19564 > Project: Cassandra > Issue Type: Bug > Components: Local/Compaction, Local/Memtable >Reporter: Jon Haddad >Priority: Urgent > Fix For: 4.1.x > > Attachments: image-2024-04-16-11-55-54-750.png, > image-2024-04-16-12-29-15-386.png, image-2024-04-16-13-43-11-064.png, > image-2024-04-16-13-53-24-455.png, image-2024-04-17-18-46-29-474.png, > image-2024-04-17-19-13-06-769.png, image-2024-04-17-19-14-34-344.png > > > I've run into an issue on a 4.1.4 cluster where an entire node has locked up > due to what I believe is a deadlock in memtable flushing. Here's what I know > so far. I've stitched together what happened based on conversations, logs, > and some flame graphs. > *Log reports memtable flushing* > The last successful flush happens at 12:19. > {noformat} > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 > AbstractAllocatorMemtable.java:286 - Flushing largest CFS(Keyspace='ks', > ColumnFamily='version') to free up room. Used total: 0.24/0.33, live: > 0.16/0.20, flushing: 0.09/0.13, this: 0.13/0.15 > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 ColumnFamilyStore.java:1012 > - Enqueuing flush of ks.version, Reason: MEMTABLE_LIMIT, Usage: 660.521MiB > (13%) on-heap, 790.606MiB (15%) off-heap > {noformat} > *MemtablePostFlush appears to be blocked* > At this point, MemtablePostFlush completed tasks stops incrementing, active > stays at 1 and pending starts to rise. > {noformat} > MemtablePostFlush 1 1 3446 0 0 > {noformat} > > The flame graph reveals that PostFlush.call is stuck. I don't have the line > number, but I know we're stuck in > {{org.apache.cassandra.db.ColumnFamilyStore.PostFlush#call}} given the visual > below: > *!image-2024-04-16-13-43-11-064.png!* > *Memtable flushing is now blocked.* > All MemtableFlushWriter threads are Parked waiting on > {{{}OpOrder.Barrier.await{}}}. A wall clock profile of 30s reveals all time > is spent here. Presumably we're waiting on the single threaded Post Flush. > !image-2024-04-16-12-29-15-386.png! > *Memtable allocations start to block* > Eventually it looks like the NativeAllocator stops successfully allocating > memory. I assume it's waiting on memory to be freed, but since memtable > flushes are blocked, we wait indefinitely. > Looking at a wall clock flame graph, all writer threads have reached the > allocation failure path of {{MemtableAllocator.allocate()}}. I believe we're > waiting on {{signal.awaitThrowUncheckedOnInterrupt()}} > {noformat} > MutationStage 48 828425 980253369 0 0{noformat} > !image-2024-04-16-11-55-54-750.png! > > *Compaction Stops* > Since we write to the compaction history table, and that requires memtables, > compactions are now blocked as well. > > !image-2024-04-16-13-53-24-455.png! > > The node is now doing basically nothing and must be restarted. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19564) MemtablePostFlush deadlock leads to stuck nodes and crashes
[ https://issues.apache.org/jira/browse/CASSANDRA-19564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jon Haddad updated CASSANDRA-19564: --- Attachment: image-2024-04-17-19-13-06-769.png > MemtablePostFlush deadlock leads to stuck nodes and crashes > --- > > Key: CASSANDRA-19564 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19564 > Project: Cassandra > Issue Type: Bug > Components: Local/Compaction, Local/Memtable >Reporter: Jon Haddad >Priority: Urgent > Fix For: 4.1.x > > Attachments: image-2024-04-16-11-55-54-750.png, > image-2024-04-16-12-29-15-386.png, image-2024-04-16-13-43-11-064.png, > image-2024-04-16-13-53-24-455.png, image-2024-04-17-18-46-29-474.png, > image-2024-04-17-19-13-06-769.png > > > I've run into an issue on a 4.1.4 cluster where an entire node has locked up > due to what I believe is a deadlock in memtable flushing. Here's what I know > so far. I've stitched together what happened based on conversations, logs, > and some flame graphs. > *Log reports memtable flushing* > The last successful flush happens at 12:19. > {noformat} > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 > AbstractAllocatorMemtable.java:286 - Flushing largest CFS(Keyspace='ks', > ColumnFamily='version') to free up room. Used total: 0.24/0.33, live: > 0.16/0.20, flushing: 0.09/0.13, this: 0.13/0.15 > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 ColumnFamilyStore.java:1012 > - Enqueuing flush of ks.version, Reason: MEMTABLE_LIMIT, Usage: 660.521MiB > (13%) on-heap, 790.606MiB (15%) off-heap > {noformat} > *MemtablePostFlush appears to be blocked* > At this point, MemtablePostFlush completed tasks stops incrementing, active > stays at 1 and pending starts to rise. > {noformat} > MemtablePostFlush 1 1 3446 0 0 > {noformat} > > The flame graph reveals that PostFlush.call is stuck. I don't have the line > number, but I know we're stuck in > {{org.apache.cassandra.db.ColumnFamilyStore.PostFlush#call}} given the visual > below: > *!image-2024-04-16-13-43-11-064.png!* > *Memtable flushing is now blocked.* > All MemtableFlushWriter threads are Parked waiting on > {{{}OpOrder.Barrier.await{}}}. A wall clock profile of 30s reveals all time > is spent here. Presumably we're waiting on the single threaded Post Flush. > !image-2024-04-16-12-29-15-386.png! > *Memtable allocations start to block* > Eventually it looks like the NativeAllocator stops successfully allocating > memory. I assume it's waiting on memory to be freed, but since memtable > flushes are blocked, we wait indefinitely. > Looking at a wall clock flame graph, all writer threads have reached the > allocation failure path of {{MemtableAllocator.allocate()}}. I believe we're > waiting on {{signal.awaitThrowUncheckedOnInterrupt()}} > {noformat} > MutationStage 48 828425 980253369 0 0{noformat} > !image-2024-04-16-11-55-54-750.png! > > *Compaction Stops* > Since we write to the compaction history table, and that requires memtables, > compactions are now blocked as well. > > !image-2024-04-16-13-53-24-455.png! > > The node is now doing basically nothing and must be restarted. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19564) MemtablePostFlush deadlock leads to stuck nodes and crashes
[ https://issues.apache.org/jira/browse/CASSANDRA-19564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jon Haddad updated CASSANDRA-19564: --- Attachment: image-2024-04-17-18-46-29-474.png > MemtablePostFlush deadlock leads to stuck nodes and crashes > --- > > Key: CASSANDRA-19564 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19564 > Project: Cassandra > Issue Type: Bug > Components: Local/Compaction, Local/Memtable >Reporter: Jon Haddad >Priority: Urgent > Fix For: 4.1.x > > Attachments: image-2024-04-16-11-55-54-750.png, > image-2024-04-16-12-29-15-386.png, image-2024-04-16-13-43-11-064.png, > image-2024-04-16-13-53-24-455.png, image-2024-04-17-18-46-29-474.png > > > I've run into an issue on a 4.1.4 cluster where an entire node has locked up > due to what I believe is a deadlock in memtable flushing. Here's what I know > so far. I've stitched together what happened based on conversations, logs, > and some flame graphs. > *Log reports memtable flushing* > The last successful flush happens at 12:19. > {noformat} > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 > AbstractAllocatorMemtable.java:286 - Flushing largest CFS(Keyspace='ks', > ColumnFamily='version') to free up room. Used total: 0.24/0.33, live: > 0.16/0.20, flushing: 0.09/0.13, this: 0.13/0.15 > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 ColumnFamilyStore.java:1012 > - Enqueuing flush of ks.version, Reason: MEMTABLE_LIMIT, Usage: 660.521MiB > (13%) on-heap, 790.606MiB (15%) off-heap > {noformat} > *MemtablePostFlush appears to be blocked* > At this point, MemtablePostFlush completed tasks stops incrementing, active > stays at 1 and pending starts to rise. > {noformat} > MemtablePostFlush 1 1 3446 0 0 > {noformat} > > The flame graph reveals that PostFlush.call is stuck. I don't have the line > number, but I know we're stuck in > {{org.apache.cassandra.db.ColumnFamilyStore.PostFlush#call}} given the visual > below: > *!image-2024-04-16-13-43-11-064.png!* > *Memtable flushing is now blocked.* > All MemtableFlushWriter threads are Parked waiting on > {{{}OpOrder.Barrier.await{}}}. A wall clock profile of 30s reveals all time > is spent here. Presumably we're waiting on the single threaded Post Flush. > !image-2024-04-16-12-29-15-386.png! > *Memtable allocations start to block* > Eventually it looks like the NativeAllocator stops successfully allocating > memory. I assume it's waiting on memory to be freed, but since memtable > flushes are blocked, we wait indefinitely. > Looking at a wall clock flame graph, all writer threads have reached the > allocation failure path of {{MemtableAllocator.allocate()}}. I believe we're > waiting on {{signal.awaitThrowUncheckedOnInterrupt()}} > {noformat} > MutationStage 48 828425 980253369 0 0{noformat} > !image-2024-04-16-11-55-54-750.png! > > *Compaction Stops* > Since we write to the compaction history table, and that requires memtables, > compactions are now blocked as well. > > !image-2024-04-16-13-53-24-455.png! > > The node is now doing basically nothing and must be restarted. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRASC-94) Reduce filesystem calls while streaming SSTables
[ https://issues.apache.org/jira/browse/CASSANDRASC-94?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francisco Guerrero updated CASSANDRASC-94: -- Description: When streaming snapshotted SSTables from Cassandra Sidecar, Sidecar will perform multiple filesystem calls: - Traverse the data directories to determine the keyspace / table path - Once found determine if the SSTable file exists under the snapshots directory - Read the filesystem to obtain the file type and file size - Read the requested range of the file and stream it The amount of filesystem calls is manageable for streaming a single SSTable, but when a client(s) read multiple SSTables, for example in the case of Cassandra Analytics bulk reads, hundred to thousand of requests are performed requiring every request to perform the above system calls. In this improvement, it is proposed introducing several two to reduce the amount of system calls while streaming SSTables: 1. *Cache all data file locations*: This is cached once and it will not change during the lifecycle of the application. The values come from the Storage Service MBean {{getAllDataFileLocations}} method. 2. *snapshot list cache*: to maintain a cache of recently listed snapshot files under a snapshot directory. This cache avoids having to access the filesystem every time a bulk read client list the snapshot directory. This is a short lived cache and can be disabled if the snapshot list is expected to be large. was: When streaming snapshotted SSTables from Cassandra Sidecar, Sidecar will perform multiple filesystem calls: - Traverse the data directories to determine the keyspace / table path - Once found determine if the SSTable file exists under the snapshots directory - Read the filesystem to obtain the file type and file size - Read the requested range of the file and stream it The amount of filesystem calls is manageable for streaming a single SSTable, but when a client(s) read multiple SSTables, for example in the case of Cassandra Analytics bulk reads, hundred to thousand of requests are performed requiring every request to perform the above system calls. In this improvement, it is proposed introducing several caches to reduce the amount of system calls while streaming SSTables. - *snapshot list cache*: to maintain a cache of recently listed snapshot files under a snapshot directory. This cache avoids having to access the filesystem every time a bulk read client list the snapshot directory. - *table dir cache*: to maintain a cache of recently streamed table directory paths. This cache helps avoiding having to traverse the filesystem searching for the table directory while running bulk reads for example. Since bulk reads can stream tens to hundreds of SSTable components from a snapshot directory, this cache helps avoid having to resolve the table directory each time. - *snapshot path cache*: to maintain a cache of recently streamed snapshot SSTable components. This cache avoids having to resolve the snapshot SSTable component path during bulk reads. Since bulk reads streams sub-ranges of an SSTable component, the resolution can happen multiple times during bulk reads for a single SSTable component. - *file props cache*: to maintain a cache of FileProps of recently streamed files. This cache avoids having to validate file properties during bulk reads for example where sub-ranges of an SSTable component are streamed, therefore reading the file properties can occur multiple times during bulk reads of the same file. > Reduce filesystem calls while streaming SSTables > > > Key: CASSANDRASC-94 > URL: https://issues.apache.org/jira/browse/CASSANDRASC-94 > Project: Sidecar for Apache Cassandra > Issue Type: Improvement > Components: Configuration >Reporter: Francisco Guerrero >Assignee: Francisco Guerrero >Priority: Normal > Labels: pull-request-available > > When streaming snapshotted SSTables from Cassandra Sidecar, Sidecar will > perform multiple filesystem calls: > - Traverse the data directories to determine the keyspace / table path > - Once found determine if the SSTable file exists under the snapshots > directory > - Read the filesystem to obtain the file type and file size > - Read the requested range of the file and stream it > The amount of filesystem calls is manageable for streaming a single SSTable, > but when a client(s) read multiple SSTables, for example in the case of > Cassandra Analytics bulk reads, hundred to thousand of requests are performed > requiring every request to perform the above system calls. > In this improvement, it is proposed introducing several two to reduce the > amount of system calls while streaming SSTables: > 1. *Cache all data file locations*: This is cached once and
Re: [PR] CASSANDRA-19563: Support bulk write via S3 [cassandra-analytics]
frankgh commented on code in PR #53: URL: https://github.com/apache/cassandra-analytics/pull/53#discussion_r1569627320 ## cassandra-analytics-core/src/main/java/org/apache/cassandra/spark/bulkwriter/token/ConsistencyLevel.java: ## @@ -19,16 +19,42 @@ package org.apache.cassandra.spark.bulkwriter.token; +import java.util.Collection; +import java.util.Objects; import java.util.Set; -import org.slf4j.Logger; -import org.slf4j.LoggerFactory; +import com.google.common.base.Preconditions; + +import org.apache.cassandra.spark.common.model.CassandraInstance; +import org.apache.cassandra.spark.data.ReplicationFactor; public interface ConsistencyLevel { +/** + * Whether the consistency level only considers replicas in the local data center. + * + * @return true if only considering the local replicas; otherwise, return false + */ boolean isLocal(); -Logger LOGGER = LoggerFactory.getLogger(ConsistencyLevel.class); +/** + * Check consistency level with the collection of the succeeded instances + * + * @param succeededInstances the succeeded instances in the replica set + * @param replicationFactor replication factor to check with + * @param localDC the local data center name if required for the check + * @return true means the consistency level is _definitively_ satisfied. + * Meanwhile, returning false means no conclusion can be drawn + */ +boolean canBeSatisfied(Collection succeededInstances, Review Comment: it feels like this class should have unit tests. Maybe a follow up? ## cassandra-analytics-core/src/test/java/org/apache/cassandra/spark/utils/XXHash32DigestAlgorithmTest.java: ## @@ -49,7 +49,7 @@ class XXHash32DigestAlgorithmTest // $ xxh32sum file2.txt # -> ef976cbe // $ xxh32sum file3.txt # -> 08321e1e -@ParameterizedTest(name = "{index} fileName={0} expectedMd5={1}") +@ParameterizedTest(name = "{index} fileName={0} expectedDigest={1}") Review Comment: 👍 ## gradle.properties: ## @@ -16,7 +16,7 @@ # under the License. group=org.apache.cassandra.spark -version=1.0.0 +version=1.0.0.36 Review Comment: this shouldn't change ```suggestion version=1.0.0 ``` ## scripts/build-sidecar.sh: ## @@ -47,22 +47,27 @@ else cd "${SIDECAR_BUILD_DIR}" echo "branch ${SIDECAR_BRANCH} sha ${SIDECAR_COMMIT}" # check out the correct cassandra version: + # if the SIDECAR_BRANCH directory does not exist; we initialize from the scratch Review Comment: great comments! ## cassandra-analytics-core/src/main/java/org/apache/cassandra/spark/bulkwriter/CancelJobEvent.java: ## @@ -0,0 +1,38 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See the NOTICE file + * distributed with this work for additional information + * regarding copyright ownership. The ASF licenses this file + * to you under the Apache License, Version 2.0 (the + * "License"); you may not use this file except in compliance + * with the License. You may obtain a copy of the License at + * + * http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, + * software distributed under the License is distributed on an + * "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY + * KIND, either express or implied. See the License for the + * specific language governing permissions and limitations + * under the License. + */ + +package org.apache.cassandra.spark.bulkwriter; + +public class CancelJobEvent Review Comment: can we add some javadocs? ## gradle.properties: ## @@ -31,5 +31,6 @@ spark=3 # force version 4.5.1 of vertx to prevent issues initializing io.vertx.core.json.jackson.JacksonCodec, # which requires a newer version of jackson, which is not available in spark 2 vertxVersion=4.5.1 +aswSdkVersion=2.20.43 Review Comment: this is trailing very behind in the current dependency. Should we bump this to the latest or a more recent version? ## scripts/build-sidecar.sh: ## @@ -24,7 +24,7 @@ else SCRIPT_DIR=$( dirname -- "$( readlink -f -- "$0"; )"; ) SIDECAR_REPO="${SIDECAR_REPO:-https://github.com/apache/cassandra-sidecar.git}"; SIDECAR_BRANCH="${SIDECAR_BRANCH:-trunk}" - SIDECAR_COMMIT="${SIDECAR_COMMIT:-20795db4d708b9287e0a2281695923bfb6fa9138}" + SIDECAR_COMMIT="${SIDECAR_COMMIT:-686d9e8699bd0a56857e24523d883120023b9841}" Review Comment: It looks like there's a new SHA, should we bump it ? ## cassandra-analytics-core/src/main/java/org/apache/cassandra/spark/bulkwriter/CassandraTopologyMonitor.java: ## @@ -0,0 +1,107 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See the NOTICE file + * distributed with this work for additional information + * regarding copy
[PR] CASSANDRA-19568: Jennkins pipeline's default Java version for Jabba has changed [cassandra-java-driver]
SiyaoIsHiding opened a new pull request, #1929: URL: https://github.com/apache/cassandra-java-driver/pull/1929 For DataStax internal jenkins. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Created] (CASSANDRA-19568) Jennkins pipeline's default Java version for Jabba has changed
Jane He created CASSANDRA-19568: --- Summary: Jennkins pipeline's default Java version for Jabba has changed Key: CASSANDRA-19568 URL: https://issues.apache.org/jira/browse/CASSANDRA-19568 Project: Cassandra Issue Type: Bug Components: Client/java-driver Reporter: Jane He Assignee: Henry Hughes We need Java 8 to build the driver. In the past, `jabba use default` will use Java 8. After a jenkins runner update, it uses java 11 now. Therefore, we have to specify `jabba use 1.8` instead before we build the driver. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838391#comment-17838391 ] Brandon Williams commented on CASSANDRA-19565: -- I have a patch [here|https://github.com/driftx/cassandra/tree/CASSANDRA-19565-4.1] that will set the ownership as follows: {noformat} drwxr-xr-x 6 cassandra cassandra 68 Apr 17 22:03 . drwxr-xr-x 1 root root 131 Apr 17 22:03 .. drwxr-xr-x 2 cassandra cassandra 6 Apr 17 21:48 commitlog drwxr-xr-x 2 cassandra cassandra 6 Apr 17 21:48 data drwxr-xr-x 2 cassandra cassandra 6 Apr 17 21:48 hints drwxr-xr-x 2 cassandra cassandra 6 Apr 17 21:48 saved_caches {noformat} Debian packaging already creates /var/lib/cassandra with the correct ownership. Now I just need to figure out CI for packaging on 4.1. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug > Components: Packaging >Reporter: Thomas De Keulenaer >Assignee: Brandon Williams >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRASC-117) Record additional metrics for improved observability
[ https://issues.apache.org/jira/browse/CASSANDRASC-117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francisco Guerrero updated CASSANDRASC-117: --- Fix Version/s: 1.0 Source Control Link: https://github.com/apache/cassandra-sidecar/commit/fd6f7ac5f9f19dbbeeb9e7f80ca1fcbf60d5a4c6 Resolution: Fixed Status: Resolved (was: Ready to Commit) > Record additional metrics for improved observability > > > Key: CASSANDRASC-117 > URL: https://issues.apache.org/jira/browse/CASSANDRASC-117 > Project: Sidecar for Apache Cassandra > Issue Type: Improvement > Components: Rest API >Reporter: Saranya Krishnakumar >Assignee: Saranya Krishnakumar >Priority: Normal > Labels: pull-request-available > Fix For: 1.0 > > > With this patch increasing the number of metrics captured on sidecar side for > improved observability. This includes, capturing resource level metrics, > Cassandra instance health metrics, SSTable import metrics etc. Also as part > of this patch, migrating existing metrics to Dropwizard registry. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRASC-117) Record additional metrics for improved observability
[ https://issues.apache.org/jira/browse/CASSANDRASC-117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838383#comment-17838383 ] ASF subversion and git services commented on CASSANDRASC-117: - Commit fd6f7ac5f9f19dbbeeb9e7f80ca1fcbf60d5a4c6 in cassandra-sidecar's branch refs/heads/trunk from Saranya Krishnakumar [ https://gitbox.apache.org/repos/asf?p=cassandra-sidecar.git;h=fd6f7ac ] CASSANDRASC-117 Record existing and additional metrics with dropwizard (#111) Patch by Saranya Krishnakumar; Reviewed by Yifan Cai, Francisco Guerrero for CASSANDRASC-117 > Record additional metrics for improved observability > > > Key: CASSANDRASC-117 > URL: https://issues.apache.org/jira/browse/CASSANDRASC-117 > Project: Sidecar for Apache Cassandra > Issue Type: Improvement > Components: Rest API >Reporter: Saranya Krishnakumar >Assignee: Saranya Krishnakumar >Priority: Normal > Labels: pull-request-available > > With this patch increasing the number of metrics captured on sidecar side for > improved observability. This includes, capturing resource level metrics, > Cassandra instance health metrics, SSTable import metrics etc. Also as part > of this patch, migrating existing metrics to Dropwizard registry. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19564) MemtablePostFlush deadlock leads to stuck nodes and crashes
[ https://issues.apache.org/jira/browse/CASSANDRA-19564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838376#comment-17838376 ] Benedict Elliott Smith commented on CASSANDRA-19564: Honestly a jstack output during the issue would probably be enough to spot a candidate issue. If you have one feel free to back channel it to me for a quick peek, in case I can easily spot something to dig into. > MemtablePostFlush deadlock leads to stuck nodes and crashes > --- > > Key: CASSANDRA-19564 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19564 > Project: Cassandra > Issue Type: Bug > Components: Local/Compaction, Local/Memtable >Reporter: Jon Haddad >Priority: Urgent > Fix For: 4.1.x > > Attachments: image-2024-04-16-11-55-54-750.png, > image-2024-04-16-12-29-15-386.png, image-2024-04-16-13-43-11-064.png, > image-2024-04-16-13-53-24-455.png > > > I've run into an issue on a 4.1.4 cluster where an entire node has locked up > due to what I believe is a deadlock in memtable flushing. Here's what I know > so far. I've stitched together what happened based on conversations, logs, > and some flame graphs. > *Log reports memtable flushing* > The last successful flush happens at 12:19. > {noformat} > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 > AbstractAllocatorMemtable.java:286 - Flushing largest CFS(Keyspace='ks', > ColumnFamily='version') to free up room. Used total: 0.24/0.33, live: > 0.16/0.20, flushing: 0.09/0.13, this: 0.13/0.15 > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 ColumnFamilyStore.java:1012 > - Enqueuing flush of ks.version, Reason: MEMTABLE_LIMIT, Usage: 660.521MiB > (13%) on-heap, 790.606MiB (15%) off-heap > {noformat} > *MemtablePostFlush appears to be blocked* > At this point, MemtablePostFlush completed tasks stops incrementing, active > stays at 1 and pending starts to rise. > {noformat} > MemtablePostFlush 1 1 3446 0 0 > {noformat} > > The flame graph reveals that PostFlush.call is stuck. I don't have the line > number, but I know we're stuck in > {{org.apache.cassandra.db.ColumnFamilyStore.PostFlush#call}} given the visual > below: > *!image-2024-04-16-13-43-11-064.png!* > *Memtable flushing is now blocked.* > All MemtableFlushWriter threads are Parked waiting on > {{{}OpOrder.Barrier.await{}}}. A wall clock profile of 30s reveals all time > is spent here. Presumably we're waiting on the single threaded Post Flush. > !image-2024-04-16-12-29-15-386.png! > *Memtable allocations start to block* > Eventually it looks like the NativeAllocator stops successfully allocating > memory. I assume it's waiting on memory to be freed, but since memtable > flushes are blocked, we wait indefinitely. > Looking at a wall clock flame graph, all writer threads have reached the > allocation failure path of {{MemtableAllocator.allocate()}}. I believe we're > waiting on {{signal.awaitThrowUncheckedOnInterrupt()}} > {noformat} > MutationStage 48 828425 980253369 0 0{noformat} > !image-2024-04-16-11-55-54-750.png! > > *Compaction Stops* > Since we write to the compaction history table, and that requires memtables, > compactions are now blocked as well. > > !image-2024-04-16-13-53-24-455.png! > > The node is now doing basically nothing and must be restarted. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19567) Minimize the heap consumption when registering metrics
[ https://issues.apache.org/jira/browse/CASSANDRA-19567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Muzafarov updated CASSANDRA-19567: Discovered By: DTest (was: User Report) > Minimize the heap consumption when registering metrics > -- > > Key: CASSANDRA-19567 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19567 > Project: Cassandra > Issue Type: Bug > Components: Observability/Metrics >Reporter: Maxim Muzafarov >Assignee: Maxim Muzafarov >Priority: Normal > Fix For: 5.x > > > The problem is only reproducible on the x86 machine, the problem is not > reproducible on the arm64. A quick analysis showed a lot of MetricName > objects stored in the heap, although the real cause could be related to > something else, the MetricName object requires extra attention. > To reproduce run the command run locally: > {code} > ant test-jvm-dtest-some > -Dtest.name=org.apache.cassandra.distributed.test.ReadRepairTest > {code} > The error: > {code:java} > [junit-timeout] Exception in thread "main" java.lang.OutOfMemoryError: Java > heap space > [junit-timeout] at > java.base/java.lang.StringLatin1.newString(StringLatin1.java:769) > [junit-timeout] at > java.base/java.lang.StringBuffer.toString(StringBuffer.java:716) > [junit-timeout] at > org.apache.cassandra.CassandraBriefJUnitResultFormatter.endTestSuite(CassandraBriefJUnitResultFormatter.java:191) > [junit-timeout] at > org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.fireEndTestSuite(JUnitTestRunner.java:854) > [junit-timeout] at > org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.run(JUnitTestRunner.java:578) > [junit-timeout] at > org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.launch(JUnitTestRunner.java:1197) > [junit-timeout] at > org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.main(JUnitTestRunner.java:1042) > [junit-timeout] Testsuite: > org.apache.cassandra.distributed.test.ReadRepairTest-cassandra.testtag_IS_UNDEFINED > [junit-timeout] Testsuite: > org.apache.cassandra.distributed.test.ReadRepairTest-cassandra.testtag_IS_UNDEFINED > Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0 sec > [junit-timeout] > [junit-timeout] Testcase: > org.apache.cassandra.distributed.test.ReadRepairTest:readRepairRTRangeMovementTest-cassandra.testtag_IS_UNDEFINED: > Caused an ERROR > [junit-timeout] Forked Java VM exited abnormally. Please note the time in the > report does not reflect the time until the VM exit. > [junit-timeout] junit.framework.AssertionFailedError: Forked Java VM exited > abnormally. Please note the time in the report does not reflect the time > until the VM exit. > [junit-timeout] at > jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) > [junit-timeout] at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > [junit-timeout] at java.base/java.util.Vector.forEach(Vector.java:1365) > [junit-timeout] at > jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) > [junit-timeout] at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > [junit-timeout] at > jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) > [junit-timeout] at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > [junit-timeout] at java.base/java.util.Vector.forEach(Vector.java:1365) > [junit-timeout] at > jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) > [junit-timeout] at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > [junit-timeout] at > jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) > [junit-timeout] at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > [junit-timeout] > [junit-timeout] > [junit-timeout] Test org.apache.cassandra.distributed.test.ReadRepairTest > FAILED (crashed)BUILD FAILED > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19567) Minimize the heap consumption when registering metrics
[ https://issues.apache.org/jira/browse/CASSANDRA-19567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Muzafarov updated CASSANDRA-19567: Bug Category: Parent values: Degradation(12984)Level 1 values: Resource Management(12995) Complexity: Normal Discovered By: User Report Fix Version/s: 5.x Platform: x86 (was: All) Severity: Normal Status: Open (was: Triage Needed) > Minimize the heap consumption when registering metrics > -- > > Key: CASSANDRA-19567 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19567 > Project: Cassandra > Issue Type: Bug > Components: Observability/Metrics >Reporter: Maxim Muzafarov >Assignee: Maxim Muzafarov >Priority: Normal > Fix For: 5.x > > > The problem is only reproducible on the x86 machine, the problem is not > reproducible on the arm64. A quick analysis showed a lot of MetricName > objects stored in the heap, although the real cause could be related to > something else, the MetricName object requires extra attention. > To reproduce run the command run locally: > {code} > ant test-jvm-dtest-some > -Dtest.name=org.apache.cassandra.distributed.test.ReadRepairTest > {code} > The error: > {code:java} > [junit-timeout] Exception in thread "main" java.lang.OutOfMemoryError: Java > heap space > [junit-timeout] at > java.base/java.lang.StringLatin1.newString(StringLatin1.java:769) > [junit-timeout] at > java.base/java.lang.StringBuffer.toString(StringBuffer.java:716) > [junit-timeout] at > org.apache.cassandra.CassandraBriefJUnitResultFormatter.endTestSuite(CassandraBriefJUnitResultFormatter.java:191) > [junit-timeout] at > org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.fireEndTestSuite(JUnitTestRunner.java:854) > [junit-timeout] at > org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.run(JUnitTestRunner.java:578) > [junit-timeout] at > org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.launch(JUnitTestRunner.java:1197) > [junit-timeout] at > org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.main(JUnitTestRunner.java:1042) > [junit-timeout] Testsuite: > org.apache.cassandra.distributed.test.ReadRepairTest-cassandra.testtag_IS_UNDEFINED > [junit-timeout] Testsuite: > org.apache.cassandra.distributed.test.ReadRepairTest-cassandra.testtag_IS_UNDEFINED > Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0 sec > [junit-timeout] > [junit-timeout] Testcase: > org.apache.cassandra.distributed.test.ReadRepairTest:readRepairRTRangeMovementTest-cassandra.testtag_IS_UNDEFINED: > Caused an ERROR > [junit-timeout] Forked Java VM exited abnormally. Please note the time in the > report does not reflect the time until the VM exit. > [junit-timeout] junit.framework.AssertionFailedError: Forked Java VM exited > abnormally. Please note the time in the report does not reflect the time > until the VM exit. > [junit-timeout] at > jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) > [junit-timeout] at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > [junit-timeout] at java.base/java.util.Vector.forEach(Vector.java:1365) > [junit-timeout] at > jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) > [junit-timeout] at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > [junit-timeout] at > jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) > [junit-timeout] at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > [junit-timeout] at java.base/java.util.Vector.forEach(Vector.java:1365) > [junit-timeout] at > jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) > [junit-timeout] at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > [junit-timeout] at > jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) > [junit-timeout] at > java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > [junit-timeout] > [junit-timeout] > [junit-timeout] Test org.apache.cassandra.distributed.test.ReadRepairTest > FAILED (crashed)BUILD FAILED > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Created] (CASSANDRA-19567) Minimize the heap consumption when registering metrics
Maxim Muzafarov created CASSANDRA-19567: --- Summary: Minimize the heap consumption when registering metrics Key: CASSANDRA-19567 URL: https://issues.apache.org/jira/browse/CASSANDRA-19567 Project: Cassandra Issue Type: Bug Components: Observability/Metrics Reporter: Maxim Muzafarov Assignee: Maxim Muzafarov The problem is only reproducible on the x86 machine, the problem is not reproducible on the arm64. A quick analysis showed a lot of MetricName objects stored in the heap, although the real cause could be related to something else, the MetricName object requires extra attention. To reproduce run the command run locally: {code} ant test-jvm-dtest-some -Dtest.name=org.apache.cassandra.distributed.test.ReadRepairTest {code} The error: {code:java} [junit-timeout] Exception in thread "main" java.lang.OutOfMemoryError: Java heap space [junit-timeout] at java.base/java.lang.StringLatin1.newString(StringLatin1.java:769) [junit-timeout] at java.base/java.lang.StringBuffer.toString(StringBuffer.java:716) [junit-timeout] at org.apache.cassandra.CassandraBriefJUnitResultFormatter.endTestSuite(CassandraBriefJUnitResultFormatter.java:191) [junit-timeout] at org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.fireEndTestSuite(JUnitTestRunner.java:854) [junit-timeout] at org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.run(JUnitTestRunner.java:578) [junit-timeout] at org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.launch(JUnitTestRunner.java:1197) [junit-timeout] at org.apache.tools.ant.taskdefs.optional.junit.JUnitTestRunner.main(JUnitTestRunner.java:1042) [junit-timeout] Testsuite: org.apache.cassandra.distributed.test.ReadRepairTest-cassandra.testtag_IS_UNDEFINED [junit-timeout] Testsuite: org.apache.cassandra.distributed.test.ReadRepairTest-cassandra.testtag_IS_UNDEFINED Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 0 sec [junit-timeout] [junit-timeout] Testcase: org.apache.cassandra.distributed.test.ReadRepairTest:readRepairRTRangeMovementTest-cassandra.testtag_IS_UNDEFINED: Caused an ERROR [junit-timeout] Forked Java VM exited abnormally. Please note the time in the report does not reflect the time until the VM exit. [junit-timeout] junit.framework.AssertionFailedError: Forked Java VM exited abnormally. Please note the time in the report does not reflect the time until the VM exit. [junit-timeout] at jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) [junit-timeout] at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) [junit-timeout] at java.base/java.util.Vector.forEach(Vector.java:1365) [junit-timeout] at jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) [junit-timeout] at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) [junit-timeout] at jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) [junit-timeout] at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) [junit-timeout] at java.base/java.util.Vector.forEach(Vector.java:1365) [junit-timeout] at jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) [junit-timeout] at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) [junit-timeout] at jdk.internal.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) [junit-timeout] at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) [junit-timeout] [junit-timeout] [junit-timeout] Test org.apache.cassandra.distributed.test.ReadRepairTest FAILED (crashed)BUILD FAILED {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19493) Optionally fail writes when SAI refuses to index a term value exceeding a configured maximum size
[ https://issues.apache.org/jira/browse/CASSANDRA-19493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Semb Wever updated CASSANDRA-19493: --- Fix Version/s: 5.0-beta2 > Optionally fail writes when SAI refuses to index a term value exceeding a > configured maximum size > - > > Key: CASSANDRA-19493 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19493 > Project: Cassandra > Issue Type: Bug > Components: Feature/SAI >Reporter: Caleb Rackliffe >Assignee: Caleb Rackliffe >Priority: Normal > Fix For: 5.0-beta2, 5.0, 5.1 > > Attachments: ci_summary-1.html, ci_summary.html > > Time Spent: 6h 50m > Remaining Estimate: 0h > > SAI currently emits a client warning when we try to index a text value larger > than {{{}cassandra.sai.max_string_term_size{}}}. It might be nice to have a > hard limit as well, above which we can reject the mutation entirely. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19534) unbounded queues in native transport requests lead to node instability
[ https://issues.apache.org/jira/browse/CASSANDRA-19534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838360#comment-17838360 ] Alex Petrov commented on CASSANDRA-19534: - Do you have observability data from the cluster per chance? Would you be able to maybe check out the Native, Read, and Write stages pending request counts? > unbounded queues in native transport requests lead to node instability > -- > > Key: CASSANDRA-19534 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19534 > Project: Cassandra > Issue Type: Bug > Components: Legacy/Local Write-Read Paths >Reporter: Jon Haddad >Assignee: Alex Petrov >Priority: Normal > Fix For: 5.0-rc, 5.x > > > When a node is under pressure, hundreds of thousands of requests can show up > in the native transport queue, and it looks like it can take way longer to > timeout than is configured. We should be shedding load much more > aggressively and use a bounded queue for incoming work. This is extremely > evident when we combine a resource consuming workload with a smaller one: > Running 5.0 HEAD on a single node as of today: > {noformat} > # populate only > easy-cass-stress run RandomPartitionAccess -p 100 -r 1 > --workload.rows=10 --workload.select=partition --maxrlat 100 --populate > 10m --rate 50k -n 1 > # workload 1 - larger reads > easy-cass-stress run RandomPartitionAccess -p 100 -r 1 > --workload.rows=10 --workload.select=partition --rate 200 -d 1d > # second workload - small reads > easy-cass-stress run KeyValue -p 1m --rate 20k -r .5 -d 24h{noformat} > It appears our results don't time out at the requested server time either: > > {noformat} > Writes Reads > Deletes Errors > Count Latency (p99) 1min (req/s) | Count Latency (p99) 1min (req/s) | > Count Latency (p99) 1min (req/s) | Count 1min (errors/s) > 950286 70403.93 634.77 | 789524 70442.07 426.02 | > 0 0 0 | 9580484 18980.45 > 952304 70567.62 640.1 | 791072 70634.34 428.36 | > 0 0 0 | 9636658 18969.54 > 953146 70767.34 640.1 | 791400 70767.76 428.36 | > 0 0 0 | 9695272 18969.54 > 956833 71171.28 623.14 | 794009 71175.6 412.79 | > 0 0 0 | 9749377 19002.44 > 959627 71312.58 656.93 | 795703 71349.87 435.56 | > 0 0 0 | 9804907 18943.11{noformat} > > After stopping the load test altogether, it took nearly a minute before the > requests were no longer queued. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19534) unbounded queues in native transport requests lead to node instability
[ https://issues.apache.org/jira/browse/CASSANDRA-19534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838359#comment-17838359 ] Brandon Williams commented on CASSANDRA-19534: -- The p99 from easy-cass-stress does creep up on 4.1 as well but at a much slower rate so it's not as easily observable as 5.0. > unbounded queues in native transport requests lead to node instability > -- > > Key: CASSANDRA-19534 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19534 > Project: Cassandra > Issue Type: Bug > Components: Legacy/Local Write-Read Paths >Reporter: Jon Haddad >Assignee: Alex Petrov >Priority: Normal > Fix For: 5.0-rc, 5.x > > > When a node is under pressure, hundreds of thousands of requests can show up > in the native transport queue, and it looks like it can take way longer to > timeout than is configured. We should be shedding load much more > aggressively and use a bounded queue for incoming work. This is extremely > evident when we combine a resource consuming workload with a smaller one: > Running 5.0 HEAD on a single node as of today: > {noformat} > # populate only > easy-cass-stress run RandomPartitionAccess -p 100 -r 1 > --workload.rows=10 --workload.select=partition --maxrlat 100 --populate > 10m --rate 50k -n 1 > # workload 1 - larger reads > easy-cass-stress run RandomPartitionAccess -p 100 -r 1 > --workload.rows=10 --workload.select=partition --rate 200 -d 1d > # second workload - small reads > easy-cass-stress run KeyValue -p 1m --rate 20k -r .5 -d 24h{noformat} > It appears our results don't time out at the requested server time either: > > {noformat} > Writes Reads > Deletes Errors > Count Latency (p99) 1min (req/s) | Count Latency (p99) 1min (req/s) | > Count Latency (p99) 1min (req/s) | Count 1min (errors/s) > 950286 70403.93 634.77 | 789524 70442.07 426.02 | > 0 0 0 | 9580484 18980.45 > 952304 70567.62 640.1 | 791072 70634.34 428.36 | > 0 0 0 | 9636658 18969.54 > 953146 70767.34 640.1 | 791400 70767.76 428.36 | > 0 0 0 | 9695272 18969.54 > 956833 71171.28 623.14 | 794009 71175.6 412.79 | > 0 0 0 | 9749377 19002.44 > 959627 71312.58 656.93 | 795703 71349.87 435.56 | > 0 0 0 | 9804907 18943.11{noformat} > > After stopping the load test altogether, it took nearly a minute before the > requests were no longer queued. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19534) unbounded queues in native transport requests lead to node instability
[ https://issues.apache.org/jira/browse/CASSANDRA-19534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838355#comment-17838355 ] Alex Petrov commented on CASSANDRA-19534: - I am a bit surprised to see that on 4.1 we seem to stabilize when errors begin. In essence, the problem is that request lifetime is unbounded. There are several contributing factors, such as lifetimes of local runnables, hints being re-submitted on the local mutation queue, and mutations on the replica side not respecting message expiration deadlines. I think most of these should have been present in 4.1, too. Unless, of course, there is more than one problem. I have initially discovered it pre-5.0 though. > unbounded queues in native transport requests lead to node instability > -- > > Key: CASSANDRA-19534 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19534 > Project: Cassandra > Issue Type: Bug > Components: Legacy/Local Write-Read Paths >Reporter: Jon Haddad >Assignee: Alex Petrov >Priority: Normal > Fix For: 5.0-rc, 5.x > > > When a node is under pressure, hundreds of thousands of requests can show up > in the native transport queue, and it looks like it can take way longer to > timeout than is configured. We should be shedding load much more > aggressively and use a bounded queue for incoming work. This is extremely > evident when we combine a resource consuming workload with a smaller one: > Running 5.0 HEAD on a single node as of today: > {noformat} > # populate only > easy-cass-stress run RandomPartitionAccess -p 100 -r 1 > --workload.rows=10 --workload.select=partition --maxrlat 100 --populate > 10m --rate 50k -n 1 > # workload 1 - larger reads > easy-cass-stress run RandomPartitionAccess -p 100 -r 1 > --workload.rows=10 --workload.select=partition --rate 200 -d 1d > # second workload - small reads > easy-cass-stress run KeyValue -p 1m --rate 20k -r .5 -d 24h{noformat} > It appears our results don't time out at the requested server time either: > > {noformat} > Writes Reads > Deletes Errors > Count Latency (p99) 1min (req/s) | Count Latency (p99) 1min (req/s) | > Count Latency (p99) 1min (req/s) | Count 1min (errors/s) > 950286 70403.93 634.77 | 789524 70442.07 426.02 | > 0 0 0 | 9580484 18980.45 > 952304 70567.62 640.1 | 791072 70634.34 428.36 | > 0 0 0 | 9636658 18969.54 > 953146 70767.34 640.1 | 791400 70767.76 428.36 | > 0 0 0 | 9695272 18969.54 > 956833 71171.28 623.14 | 794009 71175.6 412.79 | > 0 0 0 | 9749377 19002.44 > 959627 71312.58 656.93 | 795703 71349.87 435.56 | > 0 0 0 | 9804907 18943.11{noformat} > > After stopping the load test altogether, it took nearly a minute before the > requests were no longer queued. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRASC-117) Record additional metrics for improved observability
[ https://issues.apache.org/jira/browse/CASSANDRASC-117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francisco Guerrero updated CASSANDRASC-117: --- Reviewers: Francisco Guerrero, Yifan Cai (was: Francisco Guerrero, Yifan Cai) Status: Review In Progress (was: Patch Available) > Record additional metrics for improved observability > > > Key: CASSANDRASC-117 > URL: https://issues.apache.org/jira/browse/CASSANDRASC-117 > Project: Sidecar for Apache Cassandra > Issue Type: Improvement > Components: Rest API >Reporter: Saranya Krishnakumar >Assignee: Saranya Krishnakumar >Priority: Normal > Labels: pull-request-available > > With this patch increasing the number of metrics captured on sidecar side for > improved observability. This includes, capturing resource level metrics, > Cassandra instance health metrics, SSTable import metrics etc. Also as part > of this patch, migrating existing metrics to Dropwizard registry. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRASC-117) Record additional metrics for improved observability
[ https://issues.apache.org/jira/browse/CASSANDRASC-117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francisco Guerrero updated CASSANDRASC-117: --- Status: Ready to Commit (was: Review In Progress) > Record additional metrics for improved observability > > > Key: CASSANDRASC-117 > URL: https://issues.apache.org/jira/browse/CASSANDRASC-117 > Project: Sidecar for Apache Cassandra > Issue Type: Improvement > Components: Rest API >Reporter: Saranya Krishnakumar >Assignee: Saranya Krishnakumar >Priority: Normal > Labels: pull-request-available > > With this patch increasing the number of metrics captured on sidecar side for > improved observability. This includes, capturing resource level metrics, > Cassandra instance health metrics, SSTable import metrics etc. Also as part > of this patch, migrating existing metrics to Dropwizard registry. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRASC-117) Record additional metrics for improved observability
[ https://issues.apache.org/jira/browse/CASSANDRASC-117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838351#comment-17838351 ] Francisco Guerrero commented on CASSANDRASC-117: +1 Thanks for the contribution! I will merge on green CI. > Record additional metrics for improved observability > > > Key: CASSANDRASC-117 > URL: https://issues.apache.org/jira/browse/CASSANDRASC-117 > Project: Sidecar for Apache Cassandra > Issue Type: Improvement > Components: Rest API >Reporter: Saranya Krishnakumar >Assignee: Saranya Krishnakumar >Priority: Normal > Labels: pull-request-available > > With this patch increasing the number of metrics captured on sidecar side for > improved observability. This includes, capturing resource level metrics, > Cassandra instance health metrics, SSTable import metrics etc. Also as part > of this patch, migrating existing metrics to Dropwizard registry. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brandon Williams updated CASSANDRA-19565: - Bug Category: Parent values: Correctness(12982) Complexity: Normal Component/s: Packaging Discovered By: User Report Fix Version/s: 5.0.x 5.x Severity: Normal Assignee: Brandon Williams Status: Open (was: Triage Needed) > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug > Components: Packaging >Reporter: Thomas De Keulenaer >Assignee: Brandon Williams >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838303#comment-17838303 ] Ekaterina Dimitrova commented on CASSANDRA-19565: - {quote} perhaps should continue that here {quote} That is what I would expect too > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838298#comment-17838298 ] Brandon Williams edited comment on CASSANDRA-19565 at 4/17/24 5:01 PM: --- If we change the '-M' flag to useradd to '-m' to create the home directory, that leaves us with: {noformat} drwx-- 6 cassandra cassandra 124 Apr 17 16:58 . drwxr-xr-x 1 root root 131 Apr 17 16:58 .. -rw-r--r-- 1 cassandra cassandra 18 Aug 3 2022 .bash_logout -rw-r--r-- 1 cassandra cassandra 141 Aug 3 2022 .bash_profile -rw-r--r-- 1 cassandra cassandra 376 Aug 3 2022 .bashrc drwxr-xr-x 2 cassandra cassandra 6 Apr 17 16:55 commitlog drwxr-xr-x 2 cassandra cassandra 6 Apr 17 16:55 data drwxr-xr-x 2 cassandra cassandra 6 Apr 17 16:55 hints drwxr-xr-x 2 cassandra cassandra 6 Apr 17 16:55 saved_caches {noformat} Even though nothing else _should_ need to read or list this directory, we had more permissive perms before and perhaps should continue that here. WDYT? was (Author: brandon.williams): If we change the '-M' flag to useradd to '-m' to create the home directory, that leaves us with: {noformat} drwx-- 6 cassandra cassandra 124 Apr 17 16:58 . drwxr-xr-x 1 root root 131 Apr 17 16:58 .. -rw-r--r-- 1 cassandra cassandra 18 Aug 3 2022 .bash_logout -rw-r--r-- 1 cassandra cassandra 141 Aug 3 2022 .bash_profile -rw-r--r-- 1 cassandra cassandra 376 Aug 3 2022 .bashrc drwxr-xr-x 2 cassandra cassandra 6 Apr 17 16:55 commitlog drwxr-xr-x 2 cassandra cassandra 6 Apr 17 16:55 data drwxr-xr-x 2 cassandra cassandra 6 Apr 17 16:55 hints drwxr-xr-x 2 cassandra cassandra 6 Apr 17 16:55 saved_caches {noformat} Even though nothing else _should_ need to read or list this directory, we had more perms before and perhaps should continue that here. WDYT? > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838287#comment-17838287 ] Ekaterina Dimitrova commented on CASSANDRA-19565: - {quote}I think updating something as far reaching as JNA in a minor release is a tough sell, especially if we are only gaining cosmetics which we can obviate the need for by changing the ownership in the packaging. {quote} +1 on what [~brandon.williams] said. If we can limit the amount of change to only change of ownership with no side effects, I think the risk updating by exception JNA in a patch release is not justified. It would require also @dev discussion > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19493) Optionally fail writes when SAI refuses to index a term value exceeding a configured maximum size
[ https://issues.apache.org/jira/browse/CASSANDRA-19493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838285#comment-17838285 ] Caleb Rackliffe commented on CASSANDRA-19493: - Committed. Thanks for the review! > Optionally fail writes when SAI refuses to index a term value exceeding a > configured maximum size > - > > Key: CASSANDRA-19493 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19493 > Project: Cassandra > Issue Type: Bug > Components: Feature/SAI >Reporter: Caleb Rackliffe >Assignee: Caleb Rackliffe >Priority: Normal > Fix For: 5.0, 5.1 > > Attachments: ci_summary-1.html, ci_summary.html > > Time Spent: 6h 50m > Remaining Estimate: 0h > > SAI currently emits a client warning when we try to index a text value larger > than {{{}cassandra.sai.max_string_term_size{}}}. It might be nice to have a > hard limit as well, above which we can reject the mutation entirely. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19493) Optionally fail writes when SAI refuses to index a term value exceeding a configured maximum size
[ https://issues.apache.org/jira/browse/CASSANDRA-19493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Caleb Rackliffe updated CASSANDRA-19493: Fix Version/s: 5.0 5.1 (was: 5.x) (was: 5.0-rc) Since Version: 5.0-alpha1 Source Control Link: https://github.com/apache/cassandra/commit/9bfaee91c4fd7a269e3ff924e8a504bad5d6514a Resolution: Fixed Status: Resolved (was: Ready to Commit) > Optionally fail writes when SAI refuses to index a term value exceeding a > configured maximum size > - > > Key: CASSANDRA-19493 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19493 > Project: Cassandra > Issue Type: Bug > Components: Feature/SAI >Reporter: Caleb Rackliffe >Assignee: Caleb Rackliffe >Priority: Normal > Fix For: 5.0, 5.1 > > Attachments: ci_summary-1.html, ci_summary.html > > Time Spent: 6.5h > Remaining Estimate: 0h > > SAI currently emits a client warning when we try to index a text value larger > than {{{}cassandra.sai.max_string_term_size{}}}. It might be nice to have a > hard limit as well, above which we can reject the mutation entirely. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
(cassandra) branch cassandra-5.0 updated: Optionally fail writes when SAI refuses to index a term value exceeding a configured maximum size
This is an automated email from the ASF dual-hosted git repository. maedhroz pushed a commit to branch cassandra-5.0 in repository https://gitbox.apache.org/repos/asf/cassandra.git The following commit(s) were added to refs/heads/cassandra-5.0 by this push: new 9bfaee91c4 Optionally fail writes when SAI refuses to index a term value exceeding a configured maximum size 9bfaee91c4 is described below commit 9bfaee91c4fd7a269e3ff924e8a504bad5d6514a Author: Caleb Rackliffe AuthorDate: Tue Apr 9 17:32:56 2024 -0500 Optionally fail writes when SAI refuses to index a term value exceeding a configured maximum size patch by Caleb Rackliffe; reviewed by Berenguer Blasi and Stefan Miklosovic for CASSANDRA-19493 --- CHANGES.txt| 1 + NEWS.txt | 1 + conf/cassandra.yaml| 12 + .../config/CassandraRelevantProperties.java| 3 - src/java/org/apache/cassandra/config/Config.java | 11 +- .../apache/cassandra/config/GuardrailsOptions.java | 84 +++ .../cassandra/cql3/statements/BatchStatement.java | 2 +- .../cql3/statements/BatchUpdatesCollector.java | 23 +- .../cassandra/cql3/statements/CQL3CasRequest.java | 2 +- .../cql3/statements/ModificationStatement.java | 2 +- .../statements/SingleTableUpdatesCollector.java| 6 +- .../cql3/statements/UpdatesCollector.java | 3 +- src/java/org/apache/cassandra/db/IMutation.java| 27 +- .../apache/cassandra/db/guardrails/Guardrails.java | 98 +++- .../cassandra/db/guardrails/GuardrailsConfig.java | 54 .../cassandra/db/guardrails/GuardrailsMBean.java | 72 ++ .../cassandra/db/partitions/PartitionUpdate.java | 5 +- .../cassandra/db/virtual/VirtualMutation.java | 3 +- src/java/org/apache/cassandra/index/Index.java | 5 +- .../org/apache/cassandra/index/IndexRegistry.java | 41 ++-- .../cassandra/index/SecondaryIndexManager.java | 7 +- .../cassandra/index/internal/CassandraIndex.java | 4 +- .../cassandra/index/sai/StorageAttachedIndex.java | 77 +++--- .../index/sai/disk/v1/SSTableIndexWriter.java | 2 +- .../index/sai/memory/TrieMemoryIndex.java | 2 +- .../index/sai/memory/VectorMemoryIndex.java| 2 +- .../org/apache/cassandra/index/sasi/SASIIndex.java | 4 +- .../paxos/uncommitted/PaxosUncommittedIndex.java | 4 +- .../cassandra/anttasks/TestNameCheckTask.java | 26 +- .../guardrails/GuardrailColumnValueSizeTest.java | 237 ++ .../guardrails/GuardrailSaiFrozenTermSizeTest.java | 139 +++ .../guardrails/GuardrailSaiStringTermSizeTest.java | 215 .../guardrails/GuardrailSaiVectorTermSizeTest.java | 133 ++ .../db/guardrails/ValueThresholdTester.java| 273 + .../unit/org/apache/cassandra/index/StubIndex.java | 4 +- .../index/internal/CustomCassandraIndex.java | 4 +- .../index/sai/cql/AllTypesSimpleEqTest.java| 9 +- .../index/sai/cql/StorageAttachedIndexDDLTest.java | 47 38 files changed, 1267 insertions(+), 377 deletions(-) diff --git a/CHANGES.txt b/CHANGES.txt index 74d142089c..09c5468db4 100644 --- a/CHANGES.txt +++ b/CHANGES.txt @@ -1,4 +1,5 @@ 5.0-beta2 + * Optionally fail writes when SAI refuses to index a term value exceeding configured term max size (CASSANDRA-19493) * Vector search can restrict on clustering keys when filtering isn't required (CASSANDRA-19544) * Fix FBUtilities' parsing of gcp cos_containerd kernel versions (CASSANDRA-18594) * Clean up KeyRangeIterator classes (CASSANDRA-19428) diff --git a/NEWS.txt b/NEWS.txt index c074867069..1ba8f6639e 100644 --- a/NEWS.txt +++ b/NEWS.txt @@ -126,6 +126,7 @@ New features - Vector dimensions - Whether it is possible to execute secondary index queries without restricting on partition key - Warning and failure thresholds for maximum referenced SAI indexes on a replica when executing a SELECT query + - Warning and failure thresholds for the size of terms written to an SAI index - It is possible to list ephemeral snapshots by nodetool listsnaphots command when flag "-e" is specified. - Added a new flag to `nodetool profileload` and JMX endpoint to set up recurring profile load generation on specified intervals (see CASSANDRA-17821) diff --git a/conf/cassandra.yaml b/conf/cassandra.yaml index 961c607f93..99dd449c84 100644 --- a/conf/cassandra.yaml +++ b/conf/cassandra.yaml @@ -2152,6 +2152,18 @@ drop_compact_storage_enabled: false # before emitting a failure (defaults to -1 to disable) #sai_sstable_indexes_per_query_fail_threshold: -1 +# Guardrail specifying warn/fail thresholds for the size of string terms written to an SAI index +# sai_string_term_size_warn_threshold: 1KiB +# sai_string_term_size_fail_threshold: 8KiB + +# Guardrail specifying warn/fail thresholds for
(cassandra) branch trunk updated (f345370f35 -> c33c8ebab4)
This is an automated email from the ASF dual-hosted git repository. maedhroz pushed a change to branch trunk in repository https://gitbox.apache.org/repos/asf/cassandra.git from f345370f35 Merge branch 'cassandra-5.0' into trunk new 9bfaee91c4 Optionally fail writes when SAI refuses to index a term value exceeding a configured maximum size new c33c8ebab4 Merge branch 'cassandra-5.0' into trunk The 2 revisions listed above as "new" are entirely new to this repository and will be described in separate emails. The revisions listed as "add" were already present in the repository and have only been added to this reference. Summary of changes: CHANGES.txt| 1 + NEWS.txt | 1 + conf/cassandra.yaml| 12 + .../config/CassandraRelevantProperties.java| 3 - src/java/org/apache/cassandra/config/Config.java | 11 +- .../apache/cassandra/config/GuardrailsOptions.java | 84 +++ .../cassandra/cql3/statements/BatchStatement.java | 2 +- .../cql3/statements/BatchUpdatesCollector.java | 23 +- .../cassandra/cql3/statements/CQL3CasRequest.java | 2 +- .../cql3/statements/ModificationStatement.java | 2 +- .../statements/SingleTableUpdatesCollector.java| 6 +- .../cql3/statements/UpdatesCollector.java | 3 +- src/java/org/apache/cassandra/db/IMutation.java| 27 +- .../apache/cassandra/db/guardrails/Guardrails.java | 98 +++- .../cassandra/db/guardrails/GuardrailsConfig.java | 54 .../cassandra/db/guardrails/GuardrailsMBean.java | 72 ++ .../cassandra/db/partitions/PartitionUpdate.java | 5 +- .../cassandra/db/virtual/VirtualMutation.java | 3 +- src/java/org/apache/cassandra/index/Index.java | 5 +- .../org/apache/cassandra/index/IndexRegistry.java | 41 ++-- .../cassandra/index/SecondaryIndexManager.java | 7 +- .../cassandra/index/internal/CassandraIndex.java | 10 +- .../cassandra/index/sai/StorageAttachedIndex.java | 77 +++--- .../index/sai/disk/v1/SSTableIndexWriter.java | 2 +- .../index/sai/memory/TrieMemoryIndex.java | 2 +- .../index/sai/memory/VectorMemoryIndex.java| 2 +- .../org/apache/cassandra/index/sasi/SASIIndex.java | 4 +- .../paxos/uncommitted/PaxosUncommittedIndex.java | 4 +- .../cassandra/anttasks/TestNameCheckTask.java | 26 +- .../guardrails/GuardrailColumnValueSizeTest.java | 237 ++ .../guardrails/GuardrailSaiFrozenTermSizeTest.java | 139 +++ .../guardrails/GuardrailSaiStringTermSizeTest.java | 215 .../guardrails/GuardrailSaiVectorTermSizeTest.java | 133 ++ .../db/guardrails/ValueThresholdTester.java| 273 + .../unit/org/apache/cassandra/index/StubIndex.java | 4 +- .../index/internal/CustomCassandraIndex.java | 4 +- .../index/sai/cql/AllTypesSimpleEqTest.java| 9 +- .../index/sai/cql/StorageAttachedIndexDDLTest.java | 47 38 files changed, 1270 insertions(+), 380 deletions(-) create mode 100644 test/unit/org/apache/cassandra/db/guardrails/GuardrailSaiFrozenTermSizeTest.java create mode 100644 test/unit/org/apache/cassandra/db/guardrails/GuardrailSaiStringTermSizeTest.java create mode 100644 test/unit/org/apache/cassandra/db/guardrails/GuardrailSaiVectorTermSizeTest.java create mode 100644 test/unit/org/apache/cassandra/db/guardrails/ValueThresholdTester.java - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
(cassandra) 01/01: Merge branch 'cassandra-5.0' into trunk
This is an automated email from the ASF dual-hosted git repository. maedhroz pushed a commit to branch trunk in repository https://gitbox.apache.org/repos/asf/cassandra.git commit c33c8ebab444209a9675f273448110afd0787faa Merge: f345370f35 9bfaee91c4 Author: Caleb Rackliffe AuthorDate: Wed Apr 17 11:24:24 2024 -0500 Merge branch 'cassandra-5.0' into trunk * cassandra-5.0: Optionally fail writes when SAI refuses to index a term value exceeding a configured maximum size CHANGES.txt| 1 + NEWS.txt | 1 + conf/cassandra.yaml| 12 + .../config/CassandraRelevantProperties.java| 3 - src/java/org/apache/cassandra/config/Config.java | 11 +- .../apache/cassandra/config/GuardrailsOptions.java | 84 +++ .../cassandra/cql3/statements/BatchStatement.java | 2 +- .../cql3/statements/BatchUpdatesCollector.java | 23 +- .../cassandra/cql3/statements/CQL3CasRequest.java | 2 +- .../cql3/statements/ModificationStatement.java | 2 +- .../statements/SingleTableUpdatesCollector.java| 6 +- .../cql3/statements/UpdatesCollector.java | 3 +- src/java/org/apache/cassandra/db/IMutation.java| 27 +- .../apache/cassandra/db/guardrails/Guardrails.java | 98 +++- .../cassandra/db/guardrails/GuardrailsConfig.java | 54 .../cassandra/db/guardrails/GuardrailsMBean.java | 72 ++ .../cassandra/db/partitions/PartitionUpdate.java | 5 +- .../cassandra/db/virtual/VirtualMutation.java | 3 +- src/java/org/apache/cassandra/index/Index.java | 5 +- .../org/apache/cassandra/index/IndexRegistry.java | 41 ++-- .../cassandra/index/SecondaryIndexManager.java | 7 +- .../cassandra/index/internal/CassandraIndex.java | 10 +- .../cassandra/index/sai/StorageAttachedIndex.java | 77 +++--- .../index/sai/disk/v1/SSTableIndexWriter.java | 2 +- .../index/sai/memory/TrieMemoryIndex.java | 2 +- .../index/sai/memory/VectorMemoryIndex.java| 2 +- .../org/apache/cassandra/index/sasi/SASIIndex.java | 4 +- .../paxos/uncommitted/PaxosUncommittedIndex.java | 4 +- .../cassandra/anttasks/TestNameCheckTask.java | 26 +- .../guardrails/GuardrailColumnValueSizeTest.java | 237 ++ .../guardrails/GuardrailSaiFrozenTermSizeTest.java | 139 +++ .../guardrails/GuardrailSaiStringTermSizeTest.java | 215 .../guardrails/GuardrailSaiVectorTermSizeTest.java | 133 ++ .../db/guardrails/ValueThresholdTester.java| 273 + .../unit/org/apache/cassandra/index/StubIndex.java | 4 +- .../index/internal/CustomCassandraIndex.java | 4 +- .../index/sai/cql/AllTypesSimpleEqTest.java| 9 +- .../index/sai/cql/StorageAttachedIndexDDLTest.java | 47 38 files changed, 1270 insertions(+), 380 deletions(-) diff --cc CHANGES.txt index ab257c3baf,09c5468db4..6d9856ae5c --- a/CHANGES.txt +++ b/CHANGES.txt @@@ -1,37 -1,5 +1,38 @@@ -5.0-beta2 +5.1 + * Add new TriggersPolicy configuration to allow operators to disable triggers (CASSANDRA-19532) + * Use Transformation.Kind.id in local and distributed log tables (CASSANDRA-19516) + * Remove period field from ClusterMetadata and metadata log tables (CASSANDRA-19482) + * Enrich system_views.pending_hints vtable with hints sizes (CASSANDRA-19486) + * Expose all dropwizard metrics in virtual tables (CASSANDRA-14572) + * Ensured that PropertyFileSnitchTest do not overwrite cassandra-toploogy.properties (CASSANDRA-19502) + * Add option for MutualTlsAuthenticator to restrict the certificate validity period (CASSANDRA-18951) + * Fix StorageService::constructRangeToEndpointMap for non-distributed keyspaces (CASSANDRA-19255) + * Group nodetool cms commands into single command group (CASSANDRA-19393) + * Register the measurements of the bootstrap process as Dropwizard metrics (CASSANDRA-19447) + * Add LIST SUPERUSERS CQL statement (CASSANDRA-19417) + * Modernize CQLSH datetime conversions (CASSANDRA-18879) + * Harry model and in-JVM tests for partition-restricted 2i queries (CASSANDRA-18275) + * Refactor cqlshmain global constants (CASSANDRA-19201) + * Remove native_transport_port_ssl (CASSANDRA-19397) + * Make nodetool reconfigurecms sync by default and add --cancel to be able to cancel ongoing reconfigurations (CASSANDRA-19216) + * Expose auth mode in system_views.clients, nodetool clientstats, metrics (CASSANDRA-19366) + * Remove sealed_periods and last_sealed_period tables (CASSANDRA-19189) + * Improve setup and initialisation of LocalLog/LogSpec (CASSANDRA-19271) + * Refactor structure of caching metrics and expose auth cache metrics via JMX (CASSANDRA-17062) + * Allow CQL client certificate authentication to work without sending an AUTHENTICATE request (CASSANDRA-18857) + * Extend nodetool tpstats and system_views.thread_pools with de
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838273#comment-17838273 ] Brandon Williams commented on CASSANDRA-19565: -- I think updating something as far reaching as JNA in a minor release is a tough sell, especially if we are only gaining cosmetics which we can obviate the need for by changing the ownership in the packaging. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRASC-117) Record additional metrics for improved observability
[ https://issues.apache.org/jira/browse/CASSANDRASC-117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838267#comment-17838267 ] Saranya Krishnakumar commented on CASSANDRASC-117: -- Latest Green CI [https://app.circleci.com/pipelines/github/sarankk/cassandra-sidecar/277] > Record additional metrics for improved observability > > > Key: CASSANDRASC-117 > URL: https://issues.apache.org/jira/browse/CASSANDRASC-117 > Project: Sidecar for Apache Cassandra > Issue Type: Improvement > Components: Rest API >Reporter: Saranya Krishnakumar >Assignee: Saranya Krishnakumar >Priority: Normal > Labels: pull-request-available > > With this patch increasing the number of metrics captured on sidecar side for > improved observability. This includes, capturing resource level metrics, > Cassandra instance health metrics, SSTable import metrics etc. Also as part > of this patch, migrating existing metrics to Dropwizard registry. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stefan Miklosovic updated CASSANDRA-19401: -- Fix Version/s: 4.0.x > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 4.0.x, 4.1.x, 5.0.x, 5.x > > Time Spent: 1h 20m > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable > {code} > However, when we move the sstables (.db-Files) to > {{alternative/mykeyspace/mytable}} > and import with > {code} > nodetool import --copy-data mykeyspace mytable > /fullpath/alternative/mykeyspace/mytable > {code} > the import works > {code} > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:177 - Loading new SSTables and building secondary > indexes for mykeyspace/mytable: > [BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-2-big-Data.db'), > > BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-1-big-Data.db')] > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:190 - Done loading load new SSTables for > mykeyspace/mytable > {code} > We experienced this in Cassandra 4.1.3 on Java 11 (Linux) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19564) MemtablePostFlush deadlock leads to stuck nodes and crashes
[ https://issues.apache.org/jira/browse/CASSANDRA-19564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838252#comment-17838252 ] Jon Haddad commented on CASSANDRA-19564: Sadly I don't have one from the previous test, but this issue has come up several times previously so replicating it shouldn't be a problem. I'll take a look when I get more data. Thanks [~benedict]! > MemtablePostFlush deadlock leads to stuck nodes and crashes > --- > > Key: CASSANDRA-19564 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19564 > Project: Cassandra > Issue Type: Bug > Components: Local/Compaction, Local/Memtable >Reporter: Jon Haddad >Priority: Urgent > Fix For: 4.1.x > > Attachments: image-2024-04-16-11-55-54-750.png, > image-2024-04-16-12-29-15-386.png, image-2024-04-16-13-43-11-064.png, > image-2024-04-16-13-53-24-455.png > > > I've run into an issue on a 4.1.4 cluster where an entire node has locked up > due to what I believe is a deadlock in memtable flushing. Here's what I know > so far. I've stitched together what happened based on conversations, logs, > and some flame graphs. > *Log reports memtable flushing* > The last successful flush happens at 12:19. > {noformat} > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 > AbstractAllocatorMemtable.java:286 - Flushing largest CFS(Keyspace='ks', > ColumnFamily='version') to free up room. Used total: 0.24/0.33, live: > 0.16/0.20, flushing: 0.09/0.13, this: 0.13/0.15 > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 ColumnFamilyStore.java:1012 > - Enqueuing flush of ks.version, Reason: MEMTABLE_LIMIT, Usage: 660.521MiB > (13%) on-heap, 790.606MiB (15%) off-heap > {noformat} > *MemtablePostFlush appears to be blocked* > At this point, MemtablePostFlush completed tasks stops incrementing, active > stays at 1 and pending starts to rise. > {noformat} > MemtablePostFlush 1 1 3446 0 0 > {noformat} > > The flame graph reveals that PostFlush.call is stuck. I don't have the line > number, but I know we're stuck in > {{org.apache.cassandra.db.ColumnFamilyStore.PostFlush#call}} given the visual > below: > *!image-2024-04-16-13-43-11-064.png!* > *Memtable flushing is now blocked.* > All MemtableFlushWriter threads are Parked waiting on > {{{}OpOrder.Barrier.await{}}}. A wall clock profile of 30s reveals all time > is spent here. Presumably we're waiting on the single threaded Post Flush. > !image-2024-04-16-12-29-15-386.png! > *Memtable allocations start to block* > Eventually it looks like the NativeAllocator stops successfully allocating > memory. I assume it's waiting on memory to be freed, but since memtable > flushes are blocked, we wait indefinitely. > Looking at a wall clock flame graph, all writer threads have reached the > allocation failure path of {{MemtableAllocator.allocate()}}. I believe we're > waiting on {{signal.awaitThrowUncheckedOnInterrupt()}} > {noformat} > MutationStage 48 828425 980253369 0 0{noformat} > !image-2024-04-16-11-55-54-750.png! > > *Compaction Stops* > Since we write to the compaction history table, and that requires memtables, > compactions are now blocked as well. > > !image-2024-04-16-13-53-24-455.png! > > The node is now doing basically nothing and must be restarted. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19564) MemtablePostFlush deadlock leads to stuck nodes and crashes
[ https://issues.apache.org/jira/browse/CASSANDRA-19564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838249#comment-17838249 ] Benedict Elliott Smith commented on CASSANDRA-19564: Is the Flush that is blocked the one that the postFlush is waiting on? You can check this from a heap dump. If it is, the question is why the writeBarrier it has issued doesn't complete - any write that is behind such an issued barrier should be clear to complete without blocking. In which case we have perhaps introduced some new blocking mechanism that sits behind the completion of the barrier that depends on the barrier itself finishing. This should also be apparent from a heap dump, from which you can find the OpOrder that haven't completed, and which threads are holding a reference to it and what they are blocking on. > MemtablePostFlush deadlock leads to stuck nodes and crashes > --- > > Key: CASSANDRA-19564 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19564 > Project: Cassandra > Issue Type: Bug > Components: Local/Compaction, Local/Memtable >Reporter: Jon Haddad >Priority: Urgent > Fix For: 4.1.x > > Attachments: image-2024-04-16-11-55-54-750.png, > image-2024-04-16-12-29-15-386.png, image-2024-04-16-13-43-11-064.png, > image-2024-04-16-13-53-24-455.png > > > I've run into an issue on a 4.1.4 cluster where an entire node has locked up > due to what I believe is a deadlock in memtable flushing. Here's what I know > so far. I've stitched together what happened based on conversations, logs, > and some flame graphs. > *Log reports memtable flushing* > The last successful flush happens at 12:19. > {noformat} > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 > AbstractAllocatorMemtable.java:286 - Flushing largest CFS(Keyspace='ks', > ColumnFamily='version') to free up room. Used total: 0.24/0.33, live: > 0.16/0.20, flushing: 0.09/0.13, this: 0.13/0.15 > INFO [NativePoolCleaner] 2024-04-16 12:19:53,634 ColumnFamilyStore.java:1012 > - Enqueuing flush of ks.version, Reason: MEMTABLE_LIMIT, Usage: 660.521MiB > (13%) on-heap, 790.606MiB (15%) off-heap > {noformat} > *MemtablePostFlush appears to be blocked* > At this point, MemtablePostFlush completed tasks stops incrementing, active > stays at 1 and pending starts to rise. > {noformat} > MemtablePostFlush 1 1 3446 0 0 > {noformat} > > The flame graph reveals that PostFlush.call is stuck. I don't have the line > number, but I know we're stuck in > {{org.apache.cassandra.db.ColumnFamilyStore.PostFlush#call}} given the visual > below: > *!image-2024-04-16-13-43-11-064.png!* > *Memtable flushing is now blocked.* > All MemtableFlushWriter threads are Parked waiting on > {{{}OpOrder.Barrier.await{}}}. A wall clock profile of 30s reveals all time > is spent here. Presumably we're waiting on the single threaded Post Flush. > !image-2024-04-16-12-29-15-386.png! > *Memtable allocations start to block* > Eventually it looks like the NativeAllocator stops successfully allocating > memory. I assume it's waiting on memory to be freed, but since memtable > flushes are blocked, we wait indefinitely. > Looking at a wall clock flame graph, all writer threads have reached the > allocation failure path of {{MemtableAllocator.allocate()}}. I believe we're > waiting on {{signal.awaitThrowUncheckedOnInterrupt()}} > {noformat} > MutationStage 48 828425 980253369 0 0{noformat} > !image-2024-04-16-11-55-54-750.png! > > *Compaction Stops* > Since we write to the compaction history table, and that requires memtables, > compactions are now blocked as well. > > !image-2024-04-16-13-53-24-455.png! > > The node is now doing basically nothing and must be restarted. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838244#comment-17838244 ] Brandon Williams commented on CASSANDRA-19401: -- bq. ImportTest.testImportCorruptWithoutValidationWithCopying is just flaky and we discovered it here. If you are saying that failure is unrelated to this ticket, I don't think that's accurate: https://app.circleci.com/pipelines/github/driftx/cassandra/1584/workflows/7797df46-e331-46de-ba5e-885814d8f9a8 > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 1h 10m > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable > {code} > However, when we move the sstables (.db-Files) to > {{alternative/mykeyspace/mytable}} > and import with > {code} > nodetool import --copy-data mykeyspace mytable > /fullpath/alternative/mykeyspace/mytable > {code} > the import works > {code} > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:177 - Loading new SSTables and building secondary > indexes for mykeyspace/mytable: > [BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-2-big-Data.db'), > > BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-1-big-Data.db')] > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:190 - Done loading load new SSTables for > mykeyspace/mytable > {code} > We experienced this in Cassandra 4.1.3 on Java 11 (Linux) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19493) Optionally fail writes when SAI refuses to index a term value exceeding a configured maximum size
[ https://issues.apache.org/jira/browse/CASSANDRA-19493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Caleb Rackliffe updated CASSANDRA-19493: Status: Ready to Commit (was: Review In Progress) Alright, I should have this committed today. > Optionally fail writes when SAI refuses to index a term value exceeding a > configured maximum size > - > > Key: CASSANDRA-19493 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19493 > Project: Cassandra > Issue Type: Bug > Components: Feature/SAI >Reporter: Caleb Rackliffe >Assignee: Caleb Rackliffe >Priority: Normal > Fix For: 5.0-rc, 5.x > > Attachments: ci_summary-1.html, ci_summary.html > > Time Spent: 6h 10m > Remaining Estimate: 0h > > SAI currently emits a client warning when we try to index a text value larger > than {{{}cassandra.sai.max_string_term_size{}}}. It might be nice to have a > hard limit as well, above which we can reject the mutation entirely. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-14572) Expose all table metrics in virtual table
[ https://issues.apache.org/jira/browse/CASSANDRA-14572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838242#comment-17838242 ] Brandon Williams commented on CASSANDRA-14572: -- I too see the OOM if I loop 'ant test-jvm-dtest-some -Dtest.name=org.apache.cassandra.distributed.test.ReadRepairTest' a few times. > Expose all table metrics in virtual table > - > > Key: CASSANDRA-14572 > URL: https://issues.apache.org/jira/browse/CASSANDRA-14572 > Project: Cassandra > Issue Type: New Feature > Components: Legacy/Observability, Observability/Metrics >Reporter: Chris Lohfink >Assignee: Maxim Muzafarov >Priority: Low > Labels: virtual-tables > Fix For: 5.1 > > Attachments: flight_recording_1270017199_13.jfr, keyspayces_group > responses times.png, keyspayces_group summary.png, select keyspaces_group by > string prefix.png, select keyspaces_group compare with wo.png, select > keyspaces_group without value.png, systemv_views.metrics_dropped_message.png, > thread_pools benchmark.png > > Time Spent: 7h 50m > Remaining Estimate: 0h > > While we want a number of virtual tables to display data in a way thats great > and intuitive like in nodetool. There is also much for being able to expose > the metrics we have for tooling via CQL instead of JMX. This is more for the > tooling and adhoc advanced users who know exactly what they are looking for. > *Schema:* > Initial idea is to expose data via {{((keyspace, table), metric)}} with a > column for each metric value. Could also use a Map or UDT instead of the > column based that can be a bit more specific to each metric type. To that end > there can be a {{metric_type}} column and then a UDT for each metric type > filled in, or a single value with more of a Map style. I am > purposing the column type though as with {{ALLOW FILTERING}} it does allow > more extensive query capabilities. > *Implementations:* > * Use reflection to grab all the metrics from TableMetrics (see: > CASSANDRA-7622 impl). This is easiest and least abrasive towards new metric > implementors... but its reflection and a kinda a bad idea. > * Add a hook in TableMetrics to register with this virtual table when > registering > * Pull from the CassandraMetrics registery (either reporter or iterate > through metrics query on read of virtual table) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19498) Error reading data from credential file
[ https://issues.apache.org/jira/browse/CASSANDRA-19498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17837480#comment-17837480 ] Brad Schoening edited comment on CASSANDRA-19498 at 4/17/24 3:05 PM: - [~slavavrn] [~smiklosovic] I'm having a problem running the new pytest introduced here: {code:java} $ pytest test/test_legacy_auth.py > from bin.cqlsh import read_options as cqlsh_read_options E ModuleNotFoundError: No module named 'bin' {code} I was able to build a 4.1 branch, but encountered this issue. The 'bin' directory is not a Python package (_{_}init{_}_.py) and none of the other pytests import from bin. Short of moving read_options() into pylib, it may not be wise to introduce this dependency. was (Author: bschoeni): [~slavavrn] [~smiklosovic] I'm having a problem running the new pytest introduced here: {code:java} $ pytest test/test_legacy_auth.py > from bin.cqlsh import read_options as cqlsh_read_options E ModuleNotFoundError: No module named 'bin' {code} I was able to build a 4.1 branch, but encountered this issue. The 'bin' directory is not a Python package (__inii__.py) and none of the other pytests import from bin. Short of moving read_options() into pylib, it may not be wise to introduce this dependency. > Error reading data from credential file > --- > > Key: CASSANDRA-19498 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19498 > Project: Cassandra > Issue Type: Bug > Components: Documentation, Tool/cqlsh >Reporter: Slava >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 10m > Remaining Estimate: 0h > > The pylib/cqlshlib/cqlshmain.py code reads data from the credentials file, > however, it is immediately ignored. > https://github.com/apache/cassandra/blob/c9625e0102dab66f41d3ef2338c54d499e73a8c5/pylib/cqlshlib/cqlshmain.py#L2070 > {code:java} > if not options.username: > credentials = configparser.ConfigParser() > if options.credentials is not None: > credentials.read(options.credentials) # use the username > from credentials file but fallback to cqlshrc if username is absent from the > command line parameters > options.username = username_from_cqlshrc if not options.password: > rawcredentials = configparser.RawConfigParser() > if options.credentials is not None: > rawcredentials.read(options.credentials) # handling > password in the same way as username, priority cli > credentials > cqlshrc > options.password = option_with_default(rawcredentials.get, > 'plain_text_auth', 'password', password_from_cqlshrc) > options.password = password_from_cqlshrc{code} > These corrections have been made in accordance with > https://issues.apache.org/jira/browse/CASSANDRA-16983 and > https://issues.apache.org/jira/browse/CASSANDRA-16456. > The documentation does not indicate that AuthProviders can be used in the > cqlshrc and credentials files. > I propose to return the ability to use the legacy option of specifying the > user and password in the credentials file in the [plain_text_auth] section. > It is also required to describe the rules for using the credentials file in > the documentation. > I can make a corresponding pull request. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brandon Williams updated CASSANDRA-19401: - Reviewers: Brandon Williams > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 1h 10m > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable > {code} > However, when we move the sstables (.db-Files) to > {{alternative/mykeyspace/mytable}} > and import with > {code} > nodetool import --copy-data mykeyspace mytable > /fullpath/alternative/mykeyspace/mytable > {code} > the import works > {code} > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:177 - Loading new SSTables and building secondary > indexes for mykeyspace/mytable: > [BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-2-big-Data.db'), > > BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-1-big-Data.db')] > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:190 - Done loading load new SSTables for > mykeyspace/mytable > {code} > We experienced this in Cassandra 4.1.3 on Java 11 (Linux) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838237#comment-17838237 ] Stefan Miklosovic commented on CASSANDRA-19401: --- for reviewers: I did it in such a way that only nodetool import will behave like it does not skip when the dir structure does not match. The listing is done elsewhere too and I have not touched that. It behaves as it was. > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 1h 10m > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable > {code} > However, when we move the sstables (.db-Files) to > {{alternative/mykeyspace/mytable}} > and import with > {code} > nodetool import --copy-data mykeyspace mytable > /fullpath/alternative/mykeyspace/mytable > {code} > the import works > {code} > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:177 - Loading new SSTables and building secondary > indexes for mykeyspace/mytable: > [BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-2-big-Data.db'), > > BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-1-big-Data.db')] > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:190 - Done loading load new SSTables for > mykeyspace/mytable > {code} > We experienced this in Cassandra 4.1.3 on Java 11 (Linux) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838227#comment-17838227 ] Bowen Song commented on CASSANDRA-19566: That's great! Thank you, Stefan. > JSON encoded timestamp value does not always match non-JSON encoded value > - > > Key: CASSANDRA-19566 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 > Project: Cassandra > Issue Type: Bug > Components: Legacy/Core, Legacy/CQL >Reporter: Bowen Song >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 5.0.x, 5.x > > Time Spent: 10m > Remaining Estimate: 0h > > Description: > "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different > date than "SELECT ..." for some timestamp type values. > > Steps to reproduce: > {code:java} > $ sudo docker pull cassandra:4.1.4 > $ sudo docker create --name cass cassandra:4.1.4 > $ sudo docker start cass > $ # wait for the Cassandra instance becomes ready > $ sudo docker exec -ti cass cqlsh > Connected to Test Cluster at 127.0.0.1:9042 > [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] > Use HELP for help. > cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', > 'replication_factor': 1}; > cqlsh> use test; > cqlsh:test> create table tbl (id int, ts timestamp, primary key (id)); > cqlsh:test> insert into tbl (id, ts) values (1, -1376701920); > cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=1; > system.tounixtimestamp(ts) | ts | > system.tojson(ts) > +-+ > -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 > 12:00:00.000Z" > (1 rows) > cqlsh:test> select json * from tbl where id=1; > [json] > - > {"id": 1, "ts": "1533-09-18 12:00:00.000Z"} > (1 rows) > {code} > > Expected behaviour: > The "select ts", "select tojson(ts)" and "select json *" should all produce > the same date. > > Actual behaviour: > The "select ts" produced the "1533-09-28" date but the "select tojson(ts)" > and "select json *" produced the "1533-09-18" date. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838225#comment-17838225 ] Stefan Miklosovic commented on CASSANDRA-19401: --- 5.0 looks fine too. cqlsh_dtests failed on some circle / github networking issue. ImportTest.testImportCorruptWithoutValidationWithCopying is just flaky and we discovered it here. [CASSANDRA-19401-5.0|https://github.com/instaclustr/cassandra/tree/CASSANDRA-19401-5.0] {noformat} java17_pre-commit_tests ✓ j17_build3m 55s ✓ j17_cqlsh_dtests_py311 6m 18s ✓ j17_cqlsh_dtests_py311_vnode 6m 6s ✓ j17_cqlsh_dtests_py38 6m 9s ✓ j17_cqlshlib_cython_tests7m 47s ✓ j17_cqlshlib_tests 6m 34s ✓ j17_dtests 33m 27s ✓ j17_dtests_latest 32m 34s ✓ j17_dtests_vnode33m 32s ✓ j17_jvm_dtests 17m 57s ✓ j17_jvm_dtests_latest_vnode 16m 3s ✓ j17_unit_tests 16m 12s ✓ j17_utests_latest 14m 15s ✓ j17_utests_latest_repeat 6m 0s ✓ j17_utests_oa 14m 36s ✓ j17_utests_oa_repeat 6m 3s ✕ j17_cqlsh_dtests_py38_vnode 6m 31s ✕ j17_unit_tests_repeat 12m 43s org.apache.cassandra.db.ImportTest testImportCorruptWithoutValidationWithCopying org.apache.cassandra.db.ImportTest testImportCorruptWithoutValidationWithCopying {noformat} [java17_pre-commit_tests|https://app.circleci.com/pipelines/github/instaclustr/cassandra/4189/workflows/06a9fd1f-a7c6-4be4-bb45-ad09f1bc6e79] > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 1h 10m > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable > {code} > However, when we move the sstables (.db-Files) to > {{alternative/mykeyspace/mytable}} > and import with > {code} > nodetool import --copy-data mykeyspace mytable > /fullpath/alternative/mykeyspace/mytable > {code} > the import works > {code} > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:177 - Loading new SSTables and building secondary > indexes for mykeyspace/mytable: > [BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-2-big-Data.db'), > > BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-1-big-Data.db')] > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:190 - Done loading load new SSTables for > mykeyspace/mytable > {code} > We experienced this in Cassandra 4.1.3 on Java 11 (Linux) -- This message was sent by Atlassian Jira (v8.20.10#820010) ---
[jira] [Comment Edited] (CASSANDRA-19498) Error reading data from credential file
[ https://issues.apache.org/jira/browse/CASSANDRA-19498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17837480#comment-17837480 ] Brad Schoening edited comment on CASSANDRA-19498 at 4/17/24 2:37 PM: - [~slavavrn] [~smiklosovic] I'm having a problem running the new pytest introduced here: {code:java} $ pytest test/test_legacy_auth.py > from bin.cqlsh import read_options as cqlsh_read_options E ModuleNotFoundError: No module named 'bin' {code} I was able to build a 4.1 branch, but encountered this issue. The 'bin' directory is not a Python package (__inii__.py) and none of the other pytests import from bin. Short of moving read_options() into pylib, it may not be wise to introduce this dependency. was (Author: bschoeni): [~slavavrn] I'm having a problem running the pytest: {code:java} $ pytest test/test_legacy_auth.py > from bin.cqlsh import read_options as cqlsh_read_options E ModuleNotFoundError: No module named 'bin' {code} > Error reading data from credential file > --- > > Key: CASSANDRA-19498 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19498 > Project: Cassandra > Issue Type: Bug > Components: Documentation, Tool/cqlsh >Reporter: Slava >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 10m > Remaining Estimate: 0h > > The pylib/cqlshlib/cqlshmain.py code reads data from the credentials file, > however, it is immediately ignored. > https://github.com/apache/cassandra/blob/c9625e0102dab66f41d3ef2338c54d499e73a8c5/pylib/cqlshlib/cqlshmain.py#L2070 > {code:java} > if not options.username: > credentials = configparser.ConfigParser() > if options.credentials is not None: > credentials.read(options.credentials) # use the username > from credentials file but fallback to cqlshrc if username is absent from the > command line parameters > options.username = username_from_cqlshrc if not options.password: > rawcredentials = configparser.RawConfigParser() > if options.credentials is not None: > rawcredentials.read(options.credentials) # handling > password in the same way as username, priority cli > credentials > cqlshrc > options.password = option_with_default(rawcredentials.get, > 'plain_text_auth', 'password', password_from_cqlshrc) > options.password = password_from_cqlshrc{code} > These corrections have been made in accordance with > https://issues.apache.org/jira/browse/CASSANDRA-16983 and > https://issues.apache.org/jira/browse/CASSANDRA-16456. > The documentation does not indicate that AuthProviders can be used in the > cqlshrc and credentials files. > I propose to return the ability to use the legacy option of specifying the > user and password in the credentials file in the [plain_text_auth] section. > It is also required to describe the rules for using the credentials file in > the documentation. > I can make a corresponding pull request. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19221) CMS: Nodes can restart with new ipaddress already defined in the cluster
[ https://issues.apache.org/jira/browse/CASSANDRA-19221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Petrov updated CASSANDRA-19221: Test and Documentation Plan: Includes a test Status: Patch Available (was: Open) > CMS: Nodes can restart with new ipaddress already defined in the cluster > > > Key: CASSANDRA-19221 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19221 > Project: Cassandra > Issue Type: Bug > Components: Transactional Cluster Metadata >Reporter: Paul Chandler >Assignee: Alex Petrov >Priority: Normal > Fix For: 5.1-alpha1 > > Attachments: ci_summary.html > > > I am simulating running a cluster in Kubernetes and testing what happens when > several pods go down and ip addresses are swapped between nodes. In 4.0 this > is blocked and the node cannot be restarted. > To simulate this I create a 3 node cluster on a local machine using 3 > loopback addresses > {code} > 127.0.0.1 > 127.0.0.2 > 127.0.0.3 > {code} > The nodes are created correctly and the first node is assigned as a CMS node > as shown: > {code} > bin/nodetool -p 7199 describecms > {code} > Cluster Metadata Service: > {code} > Members: /127.0.0.1:7000 > Is Member: true > Service State: LOCAL > {code} > At this point I bring down the nodes 127.0.0.2 and 127.0.0.3 and swap the ip > addresses for the rpc_address and listen_address > > The nodes come back as normal, but the nodeid has now been swapped against > the ip address: > Before: > {code} > Datacenter: datacenter1 > === > Status=Up/Down > |/ State=Normal/Leaving/Joining/Moving > -- Address Load Tokens Owns (effective) Host ID > Rack > UN 127.0.0.3 75.2 KiB 16 76.0% > 6d194555-f6eb-41d0-c000-0003 rack1 > UN 127.0.0.2 86.77 KiB 16 59.3% > 6d194555-f6eb-41d0-c000-0002 rack1 > UN 127.0.0.1 80.88 KiB 16 64.7% > 6d194555-f6eb-41d0-c000-0001 rack1 > {code} > After: > {code} > Datacenter: datacenter1 > === > Status=Up/Down > |/ State=Normal/Leaving/Joining/Moving > -- Address Load Tokens Owns (effective) Host ID > Rack > UN 127.0.0.3 149.62 KiB 16 76.0% > 6d194555-f6eb-41d0-c000-0003 rack1 > UN 127.0.0.2 155.48 KiB 16 59.3% > 6d194555-f6eb-41d0-c000-0002 rack1 > UN 127.0.0.1 75.74 KiB 16 64.7% > 6d194555-f6eb-41d0-c000-0001 rack1 > {code} > On previous tests of this I have created a table with a replication factor of > 1, inserted some data before the swap. After the swap the data on nodes 2 > and 3 is now missing. > One theory I have is that I am using different port numbers for the different > nodes, and I am only swapping the ip addresses and not the port numbers, so > the ip:port still looks unique > i.e. 127.0.0.2:9043 becomes 127.0.0.2:9044 > and 127.0.0.3:9044 becomes 127.0.0.3:9043 > -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19221) CMS: Nodes can restart with new ipaddress already defined in the cluster
[ https://issues.apache.org/jira/browse/CASSANDRA-19221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Petrov updated CASSANDRA-19221: Attachment: ci_summary.html > CMS: Nodes can restart with new ipaddress already defined in the cluster > > > Key: CASSANDRA-19221 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19221 > Project: Cassandra > Issue Type: Bug > Components: Transactional Cluster Metadata >Reporter: Paul Chandler >Assignee: Alex Petrov >Priority: Normal > Fix For: 5.1-alpha1 > > Attachments: ci_summary.html > > > I am simulating running a cluster in Kubernetes and testing what happens when > several pods go down and ip addresses are swapped between nodes. In 4.0 this > is blocked and the node cannot be restarted. > To simulate this I create a 3 node cluster on a local machine using 3 > loopback addresses > {code} > 127.0.0.1 > 127.0.0.2 > 127.0.0.3 > {code} > The nodes are created correctly and the first node is assigned as a CMS node > as shown: > {code} > bin/nodetool -p 7199 describecms > {code} > Cluster Metadata Service: > {code} > Members: /127.0.0.1:7000 > Is Member: true > Service State: LOCAL > {code} > At this point I bring down the nodes 127.0.0.2 and 127.0.0.3 and swap the ip > addresses for the rpc_address and listen_address > > The nodes come back as normal, but the nodeid has now been swapped against > the ip address: > Before: > {code} > Datacenter: datacenter1 > === > Status=Up/Down > |/ State=Normal/Leaving/Joining/Moving > -- Address Load Tokens Owns (effective) Host ID > Rack > UN 127.0.0.3 75.2 KiB 16 76.0% > 6d194555-f6eb-41d0-c000-0003 rack1 > UN 127.0.0.2 86.77 KiB 16 59.3% > 6d194555-f6eb-41d0-c000-0002 rack1 > UN 127.0.0.1 80.88 KiB 16 64.7% > 6d194555-f6eb-41d0-c000-0001 rack1 > {code} > After: > {code} > Datacenter: datacenter1 > === > Status=Up/Down > |/ State=Normal/Leaving/Joining/Moving > -- Address Load Tokens Owns (effective) Host ID > Rack > UN 127.0.0.3 149.62 KiB 16 76.0% > 6d194555-f6eb-41d0-c000-0003 rack1 > UN 127.0.0.2 155.48 KiB 16 59.3% > 6d194555-f6eb-41d0-c000-0002 rack1 > UN 127.0.0.1 75.74 KiB 16 64.7% > 6d194555-f6eb-41d0-c000-0001 rack1 > {code} > On previous tests of this I have created a table with a replication factor of > 1, inserted some data before the swap. After the swap the data on nodes 2 > and 3 is now missing. > One theory I have is that I am using different port numbers for the different > nodes, and I am only swapping the ip addresses and not the port numbers, so > the ip:port still looks unique > i.e. 127.0.0.2:9043 becomes 127.0.0.2:9044 > and 127.0.0.3:9044 becomes 127.0.0.3:9043 > -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838215#comment-17838215 ] Stefan Miklosovic commented on CASSANDRA-19566: --- It prints it via same means as a raw timestamp is. So if timestamp covers your edge case, tojson will too. If it does not, it wont. > JSON encoded timestamp value does not always match non-JSON encoded value > - > > Key: CASSANDRA-19566 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 > Project: Cassandra > Issue Type: Bug > Components: Legacy/Core, Legacy/CQL >Reporter: Bowen Song >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 5.0.x, 5.x > > Time Spent: 10m > Remaining Estimate: 0h > > Description: > "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different > date than "SELECT ..." for some timestamp type values. > > Steps to reproduce: > {code:java} > $ sudo docker pull cassandra:4.1.4 > $ sudo docker create --name cass cassandra:4.1.4 > $ sudo docker start cass > $ # wait for the Cassandra instance becomes ready > $ sudo docker exec -ti cass cqlsh > Connected to Test Cluster at 127.0.0.1:9042 > [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] > Use HELP for help. > cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', > 'replication_factor': 1}; > cqlsh> use test; > cqlsh:test> create table tbl (id int, ts timestamp, primary key (id)); > cqlsh:test> insert into tbl (id, ts) values (1, -1376701920); > cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=1; > system.tounixtimestamp(ts) | ts | > system.tojson(ts) > +-+ > -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 > 12:00:00.000Z" > (1 rows) > cqlsh:test> select json * from tbl where id=1; > [json] > - > {"id": 1, "ts": "1533-09-18 12:00:00.000Z"} > (1 rows) > {code} > > Expected behaviour: > The "select ts", "select tojson(ts)" and "select json *" should all produce > the same date. > > Actual behaviour: > The "select ts" produced the "1533-09-28" date but the "select tojson(ts)" > and "select json *" produced the "1533-09-18" date. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838209#comment-17838209 ] Bowen Song commented on CASSANDRA-19566: Does it work with more extreme cases? E.g. something like 0001-01-01, -01-01 or even 500 BC These may not have practical use cases (or do they?), but for the sake of correctness, as long as the value is accepted by Cassandra, the JSON and non-JSON value should always match. > JSON encoded timestamp value does not always match non-JSON encoded value > - > > Key: CASSANDRA-19566 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 > Project: Cassandra > Issue Type: Bug > Components: Legacy/Core, Legacy/CQL >Reporter: Bowen Song >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 5.0.x, 5.x > > Time Spent: 10m > Remaining Estimate: 0h > > Description: > "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different > date than "SELECT ..." for some timestamp type values. > > Steps to reproduce: > {code:java} > $ sudo docker pull cassandra:4.1.4 > $ sudo docker create --name cass cassandra:4.1.4 > $ sudo docker start cass > $ # wait for the Cassandra instance becomes ready > $ sudo docker exec -ti cass cqlsh > Connected to Test Cluster at 127.0.0.1:9042 > [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] > Use HELP for help. > cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', > 'replication_factor': 1}; > cqlsh> use test; > cqlsh:test> create table tbl (id int, ts timestamp, primary key (id)); > cqlsh:test> insert into tbl (id, ts) values (1, -1376701920); > cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=1; > system.tounixtimestamp(ts) | ts | > system.tojson(ts) > +-+ > -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 > 12:00:00.000Z" > (1 rows) > cqlsh:test> select json * from tbl where id=1; > [json] > - > {"id": 1, "ts": "1533-09-18 12:00:00.000Z"} > (1 rows) > {code} > > Expected behaviour: > The "select ts", "select tojson(ts)" and "select json *" should all produce > the same date. > > Actual behaviour: > The "select ts" produced the "1533-09-28" date but the "select tojson(ts)" > and "select json *" produced the "1533-09-18" date. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838154#comment-17838154 ] Stefan Miklosovic edited comment on CASSANDRA-19566 at 4/17/24 1:53 PM: It does not. I wonder what the reason might be. {code:java} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} Going into negative values before Jan 1 1970 works {code:java} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=2; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1000 | 1969-12-31 23:59:59.00+ | "1969-12-31 23:59:59.000Z" {code} but, obviously, for some big negative number is starts to break. edit: so, it starts to happen between {code:java} cqlsh:test> select id, tounixtimestamp(ts), ts, tojson(ts) from tbl where id in (13,16, 15, 17, 18); id | system.tounixtimestamp(ts) | ts | system.tojson(ts) ++-+ 13 |-122000 | 1583-05-26 07:06:40.00+ | "1583-05-26 07:06:40.000Z" 15 |-122250 | 1582-08-09 22:40:00.00+ | "1582-07-30 22:40:00.000Z" 16 |-122150 | 1582-12-03 16:26:40.00+ | "1582-12-03 16:26:40.000Z" 17 |-122200 | 1582-10-06 19:33:20.00+ | "1582-09-26 19:33:20.000Z" 18 |-122180 | 1582-10-29 23:06:40.00+ | "1582-10-29 23:06:40.000Z" {code} 29th October 1582 and 26th September 1582. In 1582, there was Gregorian calendar reform introduced: Date Adjustment: To realign the date of the vernal equinox to March 21st (which was the date of the equinox at the time of the First Council of Nicaea in AD 325 and critical for determining the date of Easter), Thursday, October 4, 1582, was followed by Friday, October 15, 1582. This correction effectively "skipped" 10 days in the calendar. There is indeed drift of 10 days. So what happens here is that when we go to negative values, whatever we use for date parsing does not correctly parse negative values when Gregorian calendar reform was introduced. was (Author: smiklosovic): It does not. I wonder what the reason might be. {code:java} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} Going into negative values before Jan 1 1970 works {code:java} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=2; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1000 | 1969-12-31 23:59:59.00+ | "1969-12-31 23:59:59.000Z" {code} but, obviously, for some big negative number is starts to break. edit: so, it starts to happen between {code:java} cqlsh:test> select id, tounixtimestamp(ts), ts, tojson(ts) from tbl where id in (13,16, 15, 17, 18); id | system.tounixtimestamp(ts) | ts | system.tojson(ts) ++-+ 13 |-122000 | 1583-05-26 07:06:40.00+ | "1583-05-26 07:06:40.000Z" 15 |-122250 | 1582-08-09 22:40:00.00+ | "1582-07-30 22:40:00.000Z" 16 |-122150 | 1582-12-03 16:26:40.00+ | "1582-12-03 16:26:40.000Z" 17 |-122200 | 1582-10-06 19:33:20.00+ | "1582-09-26 19:33:20.000Z" 18 |-122180 | 1582-10-29 23:06:40.00+ | "1582-10-29 23:06:40.000Z" {code} 29th October 1582 and 26th September 1582. In 1582, there was Gregorian calendar introduced: Date Adjustment: To realign the date of the vernal equinox to March 21st (which was the date of the equinox at the time of the First Council of Nicaea in AD 325 and critical for determining the date of Easter), Thursday, October 4, 1582, was followed by Friday, October 1
[jira] [Updated] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stefan Miklosovic updated CASSANDRA-19566: -- Test and Documentation Plan: CI Status: Patch Available (was: In Progress) this solves it for 5.0 https://github.com/apache/cassandra/pull/3262 I ll run a build to be sure all plays fine. {code} cqlsh> select id, tounixtimestamp(ts), ts, tojson(ts) from test.tbl; id | system.tounixtimestamp(ts) | ts | system.tojson(ts) ++-+ 5 |-125900 | 1571-01-15 09:46:40.00+ | "1571-01-15 09:46:40.000Z" 10 |-125000 | 1573-11-22 01:46:40.00+ | "1573-11-22 01:46:40.000Z" 16 |-122150 | 1582-12-03 16:26:40.00+ | "1582-12-03 16:26:40.000Z" 13 |-122000 | 1583-05-26 07:06:40.00+ | "1583-05-26 07:06:40.000Z" 11 |-124000 | 1577-01-22 11:33:20.00+ | "1577-01-22 11:33:20.000Z" 1 |-125500 | 1572-04-22 08:53:20.00+ | "1572-04-22 08:53:20.000Z" 8 |-125200 | 1573-04-04 14:13:20.00+ | "1573-04-04 14:13:20.000Z" 2 |-125600 | 1571-12-28 15:06:40.00+ | "1571-12-28 15:06:40.000Z" 4 |-125800 | 1571-05-11 03:33:20.00+ | "1571-05-11 03:33:20.000Z" 18 |-122180 | 1582-10-29 23:06:40.00+ | "1582-10-29 23:06:40.000Z" 15 |-122250 | 1582-08-09 22:40:00.00+ | "1582-08-09 22:40:00.000Z" 20 | 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" 7 |-125300 | 1572-12-09 20:26:40.00+ | "1572-12-09 20:26:40.000Z" 6 |-125400 | 1572-08-16 02:40:00.00+ | "1572-08-16 02:40:00.000Z" 9 |-125100 | 1573-07-29 08:00:00.00+ | "1573-07-29 08:00:00.000Z" 14 |-122500 | 1581-10-24 14:13:20.00+ | "1581-10-24 14:13:20.000Z" 17 |-122200 | 1582-10-06 19:33:20.00+ | "1582-10-06 19:33:20.000Z" 12 |-123000 | 1580-03-24 21:20:00.00+ | "1580-03-24 21:20:00.000Z" 3 |-125700 | 1571-09-03 21:20:00.00+ | "1571-09-03 21:20:00.000Z" {code} > JSON encoded timestamp value does not always match non-JSON encoded value > - > > Key: CASSANDRA-19566 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 > Project: Cassandra > Issue Type: Bug > Components: Legacy/Core, Legacy/CQL >Reporter: Bowen Song >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 5.0.x, 5.x > > Time Spent: 10m > Remaining Estimate: 0h > > Description: > "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different > date than "SELECT ..." for some timestamp type values. > > Steps to reproduce: > {code:java} > $ sudo docker pull cassandra:4.1.4 > $ sudo docker create --name cass cassandra:4.1.4 > $ sudo docker start cass > $ # wait for the Cassandra instance becomes ready > $ sudo docker exec -ti cass cqlsh > Connected to Test Cluster at 127.0.0.1:9042 > [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] > Use HELP for help. > cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', > 'replication_factor': 1}; > cqlsh> use test; > cqlsh:test> create table tbl (id int, ts timestamp, primary key (id)); > cqlsh:test> insert into tbl (id, ts) values (1, -1376701920); > cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=1; > system.tounixtimestamp(ts) | ts | > system.tojson(ts) > +-+ > -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 > 12:00:00.000Z" > (1 rows) > cqlsh:test> select json * from tbl where id=1; > [json] > - > {"id": 1, "ts": "1533-09-18 12:00:00.000Z"} > (1 rows) > {code} > > Expected behaviour: > The "select ts", "select tojson(ts)" and "select json *" should all produce > the same date. > > Actual behaviour: > The "select ts" produced the "1533-09-28" date but the "select tojson(ts)" > and "select json *" produced the "1533-09-18" date. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandr
[jira] [Updated] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stefan Miklosovic updated CASSANDRA-19566: -- Bug Category: Parent values: Correctness(12982) Complexity: Normal Component/s: Legacy/Core Legacy/CQL Discovered By: Adhoc Test Fix Version/s: 5.0.x 5.x Severity: Low Assignee: Stefan Miklosovic Status: Open (was: Triage Needed) > JSON encoded timestamp value does not always match non-JSON encoded value > - > > Key: CASSANDRA-19566 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 > Project: Cassandra > Issue Type: Bug > Components: Legacy/Core, Legacy/CQL >Reporter: Bowen Song >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 5.0.x, 5.x > > Time Spent: 10m > Remaining Estimate: 0h > > Description: > "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different > date than "SELECT ..." for some timestamp type values. > > Steps to reproduce: > {code:java} > $ sudo docker pull cassandra:4.1.4 > $ sudo docker create --name cass cassandra:4.1.4 > $ sudo docker start cass > $ # wait for the Cassandra instance becomes ready > $ sudo docker exec -ti cass cqlsh > Connected to Test Cluster at 127.0.0.1:9042 > [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] > Use HELP for help. > cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', > 'replication_factor': 1}; > cqlsh> use test; > cqlsh:test> create table tbl (id int, ts timestamp, primary key (id)); > cqlsh:test> insert into tbl (id, ts) values (1, -1376701920); > cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=1; > system.tounixtimestamp(ts) | ts | > system.tojson(ts) > +-+ > -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 > 12:00:00.000Z" > (1 rows) > cqlsh:test> select json * from tbl where id=1; > [json] > - > {"id": 1, "ts": "1533-09-18 12:00:00.000Z"} > (1 rows) > {code} > > Expected behaviour: > The "select ts", "select tojson(ts)" and "select json *" should all produce > the same date. > > Actual behaviour: > The "select ts" produced the "1533-09-28" date but the "select tojson(ts)" > and "select json *" produced the "1533-09-18" date. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838154#comment-17838154 ] Stefan Miklosovic edited comment on CASSANDRA-19566 at 4/17/24 1:46 PM: It does not. I wonder what the reason might be. {code:java} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} Going into negative values before Jan 1 1970 works {code:java} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=2; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1000 | 1969-12-31 23:59:59.00+ | "1969-12-31 23:59:59.000Z" {code} but, obviously, for some big negative number is starts to break. edit: so, it starts to happen between {code:java} cqlsh:test> select id, tounixtimestamp(ts), ts, tojson(ts) from tbl where id in (13,16, 15, 17, 18); id | system.tounixtimestamp(ts) | ts | system.tojson(ts) ++-+ 13 |-122000 | 1583-05-26 07:06:40.00+ | "1583-05-26 07:06:40.000Z" 15 |-122250 | 1582-08-09 22:40:00.00+ | "1582-07-30 22:40:00.000Z" 16 |-122150 | 1582-12-03 16:26:40.00+ | "1582-12-03 16:26:40.000Z" 17 |-122200 | 1582-10-06 19:33:20.00+ | "1582-09-26 19:33:20.000Z" 18 |-122180 | 1582-10-29 23:06:40.00+ | "1582-10-29 23:06:40.000Z" {code} 29th October 1582 and 26th September 1582. In 1582, there was Gregorian calendar introduced: Date Adjustment: To realign the date of the vernal equinox to March 21st (which was the date of the equinox at the time of the First Council of Nicaea in AD 325 and critical for determining the date of Easter), Thursday, October 4, 1582, was followed by Friday, October 15, 1582. This correction effectively "skipped" 10 days in the calendar. There is indeed drift of 10 days. So what happens here is that when we go to negative values, whatever we use for date parsing does not correctly parse negative values when Gregorian calendar was introduced. was (Author: smiklosovic): It does not. I wonder what the reason might be. {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} I wonder what sense it makes to enable timestamp to be a negative value. Going into negative values before Jan 1 1970 works {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=2; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1000 | 1969-12-31 23:59:59.00+ | "1969-12-31 23:59:59.000Z" {code} but, obviously, for some big negative number is starts to break. edit: so, it starts to happen between {code} cqlsh:test> select id, tounixtimestamp(ts), ts, tojson(ts) from tbl where id in (13,16, 15, 17, 18); id | system.tounixtimestamp(ts) | ts | system.tojson(ts) ++-+ 13 |-122000 | 1583-05-26 07:06:40.00+ | "1583-05-26 07:06:40.000Z" 15 |-122250 | 1582-08-09 22:40:00.00+ | "1582-07-30 22:40:00.000Z" 16 |-122150 | 1582-12-03 16:26:40.00+ | "1582-12-03 16:26:40.000Z" 17 |-122200 | 1582-10-06 19:33:20.00+ | "1582-09-26 19:33:20.000Z" 18 |-122180 | 1582-10-29 23:06:40.00+ | "1582-10-29 23:06:40.000Z" {code} 29th October 1582 and 26th September 1582. In 1582, there was Gregorian calendar introduced: Date Adjustment: To realign the date of the vernal equinox to March 21st (which was the date of the equinox at the time of the First Council of Nicaea in AD 325 and critical for determining the date of Easter), Thursday,
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838178#comment-17838178 ] Berenguer Blasi commented on CASSANDRA-19565: - ^I checked code to some extent and things look the same to me C* wise. I agree jna versions is probably the difference hence why updating seems like a good idea besides the cosmetics of the failure. Having said that jna is always a hairy subject so if we'd prefer to not update it I'd be fine as well with that. I'm on the fence on this one, though I'd personally update it. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-14572) Expose all table metrics in virtual table
[ https://issues.apache.org/jira/browse/CASSANDRA-14572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838166#comment-17838166 ] Maxim Muzafarov commented on CASSANDRA-14572: - [~maedhroz] Do you have a dump or any CI result links? The patch might increase the heap usage a bit. While trying to keep the memory footprint low, it was necessary to add the Map> to keep track of metric alias names (they usually appear when a metric is renamed) and use them to deregister the metrics when it's needed (e.g. a keyspace drop). Due to the number of metrics, it may take some extra heap space, but shouldn't be too much. My calculations and tests that I'd made showed that for each metric we need ~ 10x8bytes for refs + some space for extra strings that survive the eden gc ( e.g. ~10 symbols string 2 byte each x 6 fields) = 200 bytes for each metric. Considering 77k metrics in my tests above it will be ~ 15Mb extra heap space (a JFR is attached in [the comment above|https://issues.apache.org/jira/browse/CASSANDRA-14572?focusedCommentId=17809602&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17809602] for the reference). I might be miscalculated something, of course. We can create a new issue to fix that, and I'll try to find a solution for that. As for the feature flag, I don't think it's necessary since the flag can't help the way we handle metrics internally, plus a new keyspace and virtual tables stand apart the core components. What we can do here in addition is to note in the documentation that these new tables are experimental in the next upcoming release. > Expose all table metrics in virtual table > - > > Key: CASSANDRA-14572 > URL: https://issues.apache.org/jira/browse/CASSANDRA-14572 > Project: Cassandra > Issue Type: New Feature > Components: Legacy/Observability, Observability/Metrics >Reporter: Chris Lohfink >Assignee: Maxim Muzafarov >Priority: Low > Labels: virtual-tables > Fix For: 5.1 > > Attachments: flight_recording_1270017199_13.jfr, keyspayces_group > responses times.png, keyspayces_group summary.png, select keyspaces_group by > string prefix.png, select keyspaces_group compare with wo.png, select > keyspaces_group without value.png, systemv_views.metrics_dropped_message.png, > thread_pools benchmark.png > > Time Spent: 7h 50m > Remaining Estimate: 0h > > While we want a number of virtual tables to display data in a way thats great > and intuitive like in nodetool. There is also much for being able to expose > the metrics we have for tooling via CQL instead of JMX. This is more for the > tooling and adhoc advanced users who know exactly what they are looking for. > *Schema:* > Initial idea is to expose data via {{((keyspace, table), metric)}} with a > column for each metric value. Could also use a Map or UDT instead of the > column based that can be a bit more specific to each metric type. To that end > there can be a {{metric_type}} column and then a UDT for each metric type > filled in, or a single value with more of a Map style. I am > purposing the column type though as with {{ALLOW FILTERING}} it does allow > more extensive query capabilities. > *Implementations:* > * Use reflection to grab all the metrics from TableMetrics (see: > CASSANDRA-7622 impl). This is easiest and least abrasive towards new metric > implementors... but its reflection and a kinda a bad idea. > * Add a hook in TableMetrics to register with this virtual table when > registering > * Pull from the CassandraMetrics registery (either reporter or iterate > through metrics query on read of virtual table) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838154#comment-17838154 ] Stefan Miklosovic edited comment on CASSANDRA-19566 at 4/17/24 12:46 PM: - It does not. I wonder what the reason might be. {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} I wonder what sense it makes to enable timestamp to be a negative value. Going into negative values before Jan 1 1970 works {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=2; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1000 | 1969-12-31 23:59:59.00+ | "1969-12-31 23:59:59.000Z" {code} but, obviously, for some big negative number is starts to break. edit: so, it starts to happen between {code} cqlsh:test> select id, tounixtimestamp(ts), ts, tojson(ts) from tbl where id in (13,16, 15, 17, 18); id | system.tounixtimestamp(ts) | ts | system.tojson(ts) ++-+ 13 |-122000 | 1583-05-26 07:06:40.00+ | "1583-05-26 07:06:40.000Z" 15 |-122250 | 1582-08-09 22:40:00.00+ | "1582-07-30 22:40:00.000Z" 16 |-122150 | 1582-12-03 16:26:40.00+ | "1582-12-03 16:26:40.000Z" 17 |-122200 | 1582-10-06 19:33:20.00+ | "1582-09-26 19:33:20.000Z" 18 |-122180 | 1582-10-29 23:06:40.00+ | "1582-10-29 23:06:40.000Z" {code} 29th October 1582 and 26th September 1582. In 1582, there was Gregorian calendar introduced: Date Adjustment: To realign the date of the vernal equinox to March 21st (which was the date of the equinox at the time of the First Council of Nicaea in AD 325 and critical for determining the date of Easter), Thursday, October 4, 1582, was followed by Friday, October 15, 1582. This correction effectively "skipped" 10 days in the calendar. There is indeed drift of 10 days. So what happens here is that when we go to negative values, whatever we use for date parsing does not correctly parse negative values when Gregorian calendar was introduced. was (Author: smiklosovic): It does not. I wonder what the reason might be. {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} I wonder what sense it makes to enable timestamp to be a negative value. Going into negative values before Jan 1 1970 works {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=2; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1000 | 1969-12-31 23:59:59.00+ | "1969-12-31 23:59:59.000Z" {code} but, obviously, for some big negative number is starts to break. > JSON encoded timestamp value does not always match non-JSON encoded value > - > > Key: CASSANDRA-19566 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 > Project: Cassandra > Issue Type: Bug >Reporter: Bowen Song >Priority: Normal > > Description: > "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different > date than "SELECT ..." for some timestamp type values. > > Steps to reproduce: > {code:java} > $ sudo docker pull cassandra:4.1.4 > $ sudo docker create --name cass cassandra:4.1.4 > $ sudo docker start cass > $ # wait for the Cassandra instance becomes ready > $ sudo docker exec -ti cass cqlsh > Connected to Test Cluster at 127.0.0.1:9042 > [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] > Use HELP for help. > cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', > 'replication_factor': 1}; > cqlsh> use test; > cqlsh:test> create tab
[jira] [Comment Edited] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838154#comment-17838154 ] Stefan Miklosovic edited comment on CASSANDRA-19566 at 4/17/24 12:28 PM: - It does not. I wonder what the reason might be. {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} I wonder what sense it makes to enable timestamp to be a negative value. Going into negative values before Jan 1 1970 works {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=2; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1000 | 1969-12-31 23:59:59.00+ | "1969-12-31 23:59:59.000Z" {code} but, obviously, for some big negative number is starts to break. was (Author: smiklosovic): It does not. I wonder what the reason might be. {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} I wonder what sense it makes to enable timestamp to be a negative value. > JSON encoded timestamp value does not always match non-JSON encoded value > - > > Key: CASSANDRA-19566 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 > Project: Cassandra > Issue Type: Bug >Reporter: Bowen Song >Priority: Normal > > Description: > "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different > date than "SELECT ..." for some timestamp type values. > > Steps to reproduce: > {code:java} > $ sudo docker pull cassandra:4.1.4 > $ sudo docker create --name cass cassandra:4.1.4 > $ sudo docker start cass > $ # wait for the Cassandra instance becomes ready > $ sudo docker exec -ti cass cqlsh > Connected to Test Cluster at 127.0.0.1:9042 > [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] > Use HELP for help. > cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', > 'replication_factor': 1}; > cqlsh> use test; > cqlsh:test> create table tbl (id int, ts timestamp, primary key (id)); > cqlsh:test> insert into tbl (id, ts) values (1, -1376701920); > cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=1; > system.tounixtimestamp(ts) | ts | > system.tojson(ts) > +-+ > -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 > 12:00:00.000Z" > (1 rows) > cqlsh:test> select json * from tbl where id=1; > [json] > - > {"id": 1, "ts": "1533-09-18 12:00:00.000Z"} > (1 rows) > {code} > > Expected behaviour: > The "select ts", "select tojson(ts)" and "select json *" should all produce > the same date. > > Actual behaviour: > The "select ts" produced the "1533-09-28" date but the "select tojson(ts)" > and "select json *" produced the "1533-09-18" date. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838154#comment-17838154 ] Stefan Miklosovic commented on CASSANDRA-19566: --- It does not. I wonder what the reason might be. {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} > JSON encoded timestamp value does not always match non-JSON encoded value > - > > Key: CASSANDRA-19566 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 > Project: Cassandra > Issue Type: Bug >Reporter: Bowen Song >Priority: Normal > > Description: > "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different > date than "SELECT ..." for some timestamp type values. > > Steps to reproduce: > {code:java} > $ sudo docker pull cassandra:4.1.4 > $ sudo docker create --name cass cassandra:4.1.4 > $ sudo docker start cass > $ # wait for the Cassandra instance becomes ready > $ sudo docker exec -ti cass cqlsh > Connected to Test Cluster at 127.0.0.1:9042 > [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] > Use HELP for help. > cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', > 'replication_factor': 1}; > cqlsh> use test; > cqlsh:test> create table tbl (id int, ts timestamp, primary key (id)); > cqlsh:test> insert into tbl (id, ts) values (1, -1376701920); > cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=1; > system.tounixtimestamp(ts) | ts | > system.tojson(ts) > +-+ > -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 > 12:00:00.000Z" > (1 rows) > cqlsh:test> select json * from tbl where id=1; > [json] > - > {"id": 1, "ts": "1533-09-18 12:00:00.000Z"} > (1 rows) > {code} > > Expected behaviour: > The "select ts", "select tojson(ts)" and "select json *" should all produce > the same date. > > Actual behaviour: > The "select ts" produced the "1533-09-28" date but the "select tojson(ts)" > and "select json *" produced the "1533-09-18" date. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838148#comment-17838148 ] Thomas De Keulenaer commented on CASSANDRA-19565: - The [article|https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html] gives a very detailed explaination: {quote} The usual solution seems to be to write the code into a file and then use something like mmap() to load it again into read-only-but-executable pages. In order to do this we need some temporary space that isn’t mounted noexec. [...] That tells us that libffi will sometimes create temporary executable files, and it will always create them when running under SELinux. It’s important to note that this is completely independent of the fact that JNA creates temporary executable files: libffi is a language-independent library for calling foreign functions so it doesn’t know anything about Java and therefore doesn’t have access to the Java system properties java.io.tmpdir and jna.tmpdir which control where JNA does its work. Instead, on Linux libffi tries to create its temporary executable files in various places in the following order of preference: * $LIBFFI_TMPDIR * $TMPDIR * /tmp * /var/tmp * /dev/shm * $HOME {quote} But I admit, it is also unclear to me why this is only an issue in Cassandra 4.1.4 and not on 4.0.12. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838156#comment-17838156 ] Brandon Williams edited comment on CASSANDRA-19565 at 4/17/24 12:26 PM: bq. But I admit, it is also unclear to me why this is only an issue in Cassandra 4.1.4 and not on 4.0.12. I think the difference must be in the jna versions. I've seen noexec on tmp many times, and the solution has always been to redefine java.io.tmpdir to a more forgiving path, but it seems newer jna will try to write other locations too (and then segfault if they don't work.) In any case, if fixing the home dir perms solves this I'm not opposed to that; even though we haven't needed it in the past it's the most correct thing to do. was (Author: brandon.williams): bq. But I admit, it is also unclear to me why this is only an issue in Cassandra 4.1.4 and not on 4.0.12. I think the difference must be in the jna versions. I've seen noexec on tmp many times, and the solution has always been to redefine java.io.tmpdir to a more forgiving path, but it seems newer jna will try to write other locations too (and then segfault if they don't work.) > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838156#comment-17838156 ] Brandon Williams commented on CASSANDRA-19565: -- bq. But I admit, it is also unclear to me why this is only an issue in Cassandra 4.1.4 and not on 4.0.12. I think the difference must be in the jna versions. I've seen noexec on tmp many times, and the solution has always been to redefine java.io.tmpdir to a more forgiving path, but it seems newer jna will try to write other locations too (and then segfault if they don't work.) > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838154#comment-17838154 ] Stefan Miklosovic edited comment on CASSANDRA-19566 at 4/17/24 12:24 PM: - It does not. I wonder what the reason might be. {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} I wonder what sense it makes to enable timestamp to be a negative value. was (Author: smiklosovic): It does not. I wonder what the reason might be. {code} cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" 1376701920 | 2406-04-05 12:00:00.00+ | "2406-04-05 12:00:00.000Z" {code} > JSON encoded timestamp value does not always match non-JSON encoded value > - > > Key: CASSANDRA-19566 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 > Project: Cassandra > Issue Type: Bug >Reporter: Bowen Song >Priority: Normal > > Description: > "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different > date than "SELECT ..." for some timestamp type values. > > Steps to reproduce: > {code:java} > $ sudo docker pull cassandra:4.1.4 > $ sudo docker create --name cass cassandra:4.1.4 > $ sudo docker start cass > $ # wait for the Cassandra instance becomes ready > $ sudo docker exec -ti cass cqlsh > Connected to Test Cluster at 127.0.0.1:9042 > [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] > Use HELP for help. > cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', > 'replication_factor': 1}; > cqlsh> use test; > cqlsh:test> create table tbl (id int, ts timestamp, primary key (id)); > cqlsh:test> insert into tbl (id, ts) values (1, -1376701920); > cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=1; > system.tounixtimestamp(ts) | ts | > system.tojson(ts) > +-+ > -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 > 12:00:00.000Z" > (1 rows) > cqlsh:test> select json * from tbl where id=1; > [json] > - > {"id": 1, "ts": "1533-09-18 12:00:00.000Z"} > (1 rows) > {code} > > Expected behaviour: > The "select ts", "select tojson(ts)" and "select json *" should all produce > the same date. > > Actual behaviour: > The "select ts" produced the "1533-09-28" date but the "select tojson(ts)" > and "select json *" produced the "1533-09-18" date. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838137#comment-17838137 ] Berenguer Blasi edited comment on CASSANDRA-19565 at 4/17/24 12:10 PM: --- {quote}Upgrading JNA wont actually solve the issue, it will only prevent the SIGSEGV + core dump and replace it with a java exception{quote} Correct. But it's better than just barfing imo. {quote}do we know why this is?{quote} That's the tricky bit. Getting to the bottom of it but without a repro it may be really really hard. So far I have only seen mapping of C functions. I haven't found anything suggesting this might be linked to permissions or folders like the ES case. Maybe if some gclib is not reachable or... I'm still pulling the thread. was (Author: bereng): {quote}Upgrading JNA wont actually solve the issue, it will only prevent the SIGSEGV + core dump and replace it with a java exception{quote} Correct. But it's better than just barfing imo. {quote}do we know why this is?{quote} That's the tricky bit. Getting to the bottom of it but without a repro it may be really really hard. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838136#comment-17838136 ] Brandon Williams commented on CASSANDRA-19565: -- I'm not opposed to changing the home dir owner/perms, but bq. Setting the cassandra user's home directory (/var/lib/cassandra) to read+write+executable and owned by cassandra, solves the issue. do we know why this is? The owner/perms has been this way a long time, is your TMPDIR set to the home dir? > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
[ https://issues.apache.org/jira/browse/CASSANDRA-19566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838150#comment-17838150 ] Stefan Miklosovic commented on CASSANDRA-19566: --- Does it also happen when ts is not negative value? > JSON encoded timestamp value does not always match non-JSON encoded value > - > > Key: CASSANDRA-19566 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 > Project: Cassandra > Issue Type: Bug >Reporter: Bowen Song >Priority: Normal > > Description: > "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different > date than "SELECT ..." for some timestamp type values. > > Steps to reproduce: > {code:java} > $ sudo docker pull cassandra:4.1.4 > $ sudo docker create --name cass cassandra:4.1.4 > $ sudo docker start cass > $ # wait for the Cassandra instance becomes ready > $ sudo docker exec -ti cass cqlsh > Connected to Test Cluster at 127.0.0.1:9042 > [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] > Use HELP for help. > cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', > 'replication_factor': 1}; > cqlsh> use test; > cqlsh:test> create table tbl (id int, ts timestamp, primary key (id)); > cqlsh:test> insert into tbl (id, ts) values (1, -1376701920); > cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=1; > system.tounixtimestamp(ts) | ts | > system.tojson(ts) > +-+ > -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 > 12:00:00.000Z" > (1 rows) > cqlsh:test> select json * from tbl where id=1; > [json] > - > {"id": 1, "ts": "1533-09-18 12:00:00.000Z"} > (1 rows) > {code} > > Expected behaviour: > The "select ts", "select tojson(ts)" and "select json *" should all produce > the same date. > > Actual behaviour: > The "select ts" produced the "1533-09-28" date but the "select tojson(ts)" > and "select json *" produced the "1533-09-18" date. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838133#comment-17838133 ] Brandon Williams commented on CASSANDRA-19565: -- Hmm, that was added as part of CASSANDRA-17470. I suspect the config/noreplace is what is preventing it, and I think that was chosen to not modify any existing installs that may have changed it, but it's still curious how there's no effect on new installations. Let me run some tests and see what's going on. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838127#comment-17838127 ] Thomas De Keulenaer edited comment on CASSANDRA-19565 at 4/17/24 11:50 AM: --- bq. %attr(750,%{username},%{username}) %config(noreplace) /var/lib/%{username}/* The {quote}/*{quote} at the end of line 166 is the culprit was (Author: twdkeule): strange, i would expect this line to fix that: bq. %attr(750,%{username},%{username}) %config(noreplace) /var/lib/%{username}/* > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838127#comment-17838127 ] Thomas De Keulenaer commented on CASSANDRA-19565: - strange, i would expect this line to fix that: bq. %attr(750,%{username},%{username}) %config(noreplace) /var/lib/%{username}/* > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838137#comment-17838137 ] Berenguer Blasi edited comment on CASSANDRA-19565 at 4/17/24 12:06 PM: --- {quote}Upgrading JNA wont actually solve the issue, it will only prevent the SIGSEGV + core dump and replace it with a java exception{quote} Correct. But it's better than just barfing imo. {quote}do we know why this is?{quote} That's the tricky bit. Getting to the bottom of it but without a repro it may be really really hard. was (Author: bereng): {quote}Upgrading JNA wont actually solve the issue, it will only prevent the SIGSEGV + core dump and replace it with a java exception{quote} Correct. But it's better than just barfing imo. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838137#comment-17838137 ] Berenguer Blasi commented on CASSANDRA-19565: - {quote}Upgrading JNA wont actually solve the issue, it will only prevent the SIGSEGV + core dump and replace it with a java exception{quote} Correct. But it's better than just barfing imo. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838125#comment-17838125 ] Thomas De Keulenaer commented on CASSANDRA-19565: - bq. drwxr-xr-x 6 root root 68 Apr 17 11:13 . This should be owned by cassandra:cassandra, not root:root. If root owns the HOME dir, the program cannot write to it. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838135#comment-17838135 ] Norbert Schultz commented on CASSANDRA-19401: - Hi [~smiklosovic] I tested your patch it it works for the nodetool import use case :) Thanks alot! > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 1h 10m > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable > {code} > However, when we move the sstables (.db-Files) to > {{alternative/mykeyspace/mytable}} > and import with > {code} > nodetool import --copy-data mykeyspace mytable > /fullpath/alternative/mykeyspace/mytable > {code} > the import works > {code} > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:177 - Loading new SSTables and building secondary > indexes for mykeyspace/mytable: > [BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-2-big-Data.db'), > > BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-1-big-Data.db')] > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:190 - Done loading load new SSTables for > mykeyspace/mytable > {code} > We experienced this in Cassandra 4.1.3 on Java 11 (Linux) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838134#comment-17838134 ] Brandon Williams commented on CASSANDRA-19565: -- bq. at the end of line 166 is the culprit Ah, that is because we were only changing the subdirs to not be world-writable, we only changed the umask of what was there before. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-18050) Update JNA
[ https://issues.apache.org/jira/browse/CASSANDRA-18050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838118#comment-17838118 ] Thomas De Keulenaer commented on CASSANDRA-18050: - Hi, There is an issue with JNA and libffi when the TMP and HOME dirs are not writable nor executable: * https://issues.apache.org/jira/browse/CASSANDRA-19565 * https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html With JNA >= 5.10.0 you can specify LIBFFI_TMPDIR. I think it is best to set this to the same value as jna.tmpdir. Perhaps in cassandra-env.sh? > Update JNA > -- > > Key: CASSANDRA-18050 > URL: https://issues.apache.org/jira/browse/CASSANDRA-18050 > Project: Cassandra > Issue Type: Task > Components: Build >Reporter: Ekaterina Dimitrova >Assignee: Brandon Williams >Priority: Normal > Fix For: 5.0-alpha1, 5.0 > > > JNA needs to be updated for JDK 17 support. There was an update pre-4.0 but > not to the latest version (confirmed with the people involved, there was no > particular reason not to update to the latest version, just missed on rebase > of older PR) > More details to come, opening this as a placeholder for now -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838115#comment-17838115 ] Stefan Miklosovic commented on CASSANDRA-19401: --- 4.1 build seems reasonabley fine [CASSANDRA-19401-4.1|https://github.com/instaclustr/cassandra/tree/CASSANDRA-19401-4.1] {noformat} java11_pre-commit_tests ✓ j11_build1m 28s ✓ j11_cqlsh_dtests_py3 5m 24s ✓ j11_cqlsh_dtests_py311 5m 25s ✓ j11_cqlsh_dtests_py311_vnode 6m 14s ✓ j11_cqlsh_dtests_py385m 47s ✓ j11_cqlsh_dtests_py38_vnode 5m 57s ✓ j11_cqlsh_dtests_py3_vnode 5m 33s ✓ j11_cqlshlib_cython_tests6m 55s ✓ j11_cqlshlib_tests6m 9s ✓ j11_dtests 33m 37s ✓ j11_jvm_dtests 19m 2s ✓ j11_jvm_dtests_vnode11m 38s ✓ j11_unit_tests_repeat 8m 4s ✕ j11_dtests_vnode33m 39s rebuild_test.TestRebuild test_simple_rebuild ✕ j11_unit_tests 7m 34s org.apache.cassandra.cql3.MemtableSizeTest testSize[skiplist] {noformat} [java11_pre-commit_tests|https://app.circleci.com/pipelines/github/instaclustr/cassandra/4187/workflows/9164b340-3472-4ba7-a1ae-6ab5cbf695cb] > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 50m > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable > {code} > However, when we move the sstables (.db-Files) to > {{alternative/mykeyspace/mytable}} > and import with > {code} > nodetool import --copy-data mykeyspace mytable > /fullpath/alternative/mykeyspace/mytable > {code} > the import works > {code} > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:177 - Loading new SSTables and building secondary > indexes for mykeyspace/mytable: > [BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-2-big-Data.db'), > > BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-1-big-Data.db')] > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:190 - Done loading load new SSTables for > mykeyspace/mytable > {code} > We experienced this in Cassandra 4.1.3 on Java 11 (Linux) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838114#comment-17838114 ] Thomas De Keulenaer commented on CASSANDRA-19565: - I see the trunk is already at JNA 5.13: https://github.com/apache/cassandra/blob/trunk/.build/parent-pom-template.xml#L543 idem for 5.0 > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838116#comment-17838116 ] Brandon Williams commented on CASSANDRA-19565: -- The directory is created by useradd [here|https://github.com/apache/cassandra/blob/trunk/redhat/cassandra.spec#L140], and this is what the directory listing looks like on a fresh install: {noformat} ]# ls -al /var/lib/cassandra/ total 0 drwxr-xr-x 6 root root 68 Apr 17 11:13 . drwxr-xr-x 1 root root 131 Apr 17 11:13 .. drwxr-xr-x 2 cassandra cassandra 6 Jan 23 19:52 commitlog drwxr-xr-x 2 cassandra cassandra 6 Jan 23 19:52 data drwxr-xr-x 2 cassandra cassandra 6 Jan 23 19:52 hints drwxr-xr-x 2 cassandra cassandra 6 Jan 23 19:52 saved_caches {noformat} Maybe you have a modified /etc/login.defs with a different umask or similar? > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
(cassandra) branch cassandra-5.0 updated: Update in-tree docker versions
This is an automated email from the ASF dual-hosted git repository. brandonwilliams pushed a commit to branch cassandra-5.0 in repository https://gitbox.apache.org/repos/asf/cassandra.git The following commit(s) were added to refs/heads/cassandra-5.0 by this push: new 1a66347e70 Update in-tree docker versions 1a66347e70 is described below commit 1a66347e70efe611200eef4641e7f7941c418fe6 Author: Brandon Williams AuthorDate: Wed Apr 17 05:54:00 2024 -0500 Update in-tree docker versions --- .build/docker/ubuntu2004_test.docker | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/.build/docker/ubuntu2004_test.docker b/.build/docker/ubuntu2004_test.docker index 614173312f..3e15b114d7 100644 --- a/.build/docker/ubuntu2004_test.docker +++ b/.build/docker/ubuntu2004_test.docker @@ -127,8 +127,8 @@ RUN /bin/bash -c "source ${BUILD_HOME}/env3.8/bin/activate && \ # this can be checked with: # `curl -s https://downloads.apache.org/cassandra/ | grep -oP '(?<=href=\")[0-9]+\.[0-9]+\.[0-9]+(?=)' | sort -V | uniq -w 3` RUN bash -c 'source ~/env3.8/bin/activate && \ -for i in {1..11} ; do echo $i ; ccm create --quiet -n 1 -v binary:4.0.$i test && ccm remove test ; done && \ -for i in {1..3} ; do echo $i ; ccm create --quiet -n 1 -v binary:4.1.$i test && ccm remove test ; done' +for i in {1..12} ; do echo $i ; ccm create --quiet -n 1 -v binary:4.0.$i test && ccm remove test ; done && \ +for i in {1..4} ; do echo $i ; ccm create --quiet -n 1 -v binary:4.1.$i test && ccm remove test ; done' # 5+ requires java11 RUN sudo update-java-alternatives --set java-1.11.0-openjdk-$(dpkg --print-architecture) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-18050) Update JNA
[ https://issues.apache.org/jira/browse/CASSANDRA-18050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838118#comment-17838118 ] Thomas De Keulenaer edited comment on CASSANDRA-18050 at 4/17/24 11:19 AM: --- Hi, There is an issue with JNA and libffi when the TMP and HOME dirs are not writable nor executable: * https://issues.apache.org/jira/browse/CASSANDRA-19565 * https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html With JNA >= 5.10.0 you can specify LIBFFI_TMPDIR. I think it is best to set this to the same value as jna.tmpdir. Perhaps in cassandra-env.sh? was (Author: twdkeule): Hi, There is an issue with JNA and libffi when the TMP and HOME dirs are not writable nor executable: * https://issues.apache.org/jira/browse/CASSANDRA-19565 * https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html With JNA >= 5.10.0 you can specify LIBFFI_TMPDIR. I think it is best to set this to the same value as jna.tmpdir. Perhaps in cassandra-env.sh? > Update JNA > -- > > Key: CASSANDRA-18050 > URL: https://issues.apache.org/jira/browse/CASSANDRA-18050 > Project: Cassandra > Issue Type: Task > Components: Build >Reporter: Ekaterina Dimitrova >Assignee: Brandon Williams >Priority: Normal > Fix For: 5.0-alpha1, 5.0 > > > JNA needs to be updated for JDK 17 support. There was an update pre-4.0 but > not to the latest version (confirmed with the people involved, there was no > particular reason not to update to the latest version, just missed on rebase > of older PR) > More details to come, opening this as a placeholder for now -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838115#comment-17838115 ] Stefan Miklosovic edited comment on CASSANDRA-19401 at 4/17/24 11:18 AM: - 4.1 build seems reasonably fine [CASSANDRA-19401-4.1|https://github.com/instaclustr/cassandra/tree/CASSANDRA-19401-4.1] {noformat} java11_pre-commit_tests ✓ j11_build1m 28s ✓ j11_cqlsh_dtests_py3 5m 24s ✓ j11_cqlsh_dtests_py311 5m 25s ✓ j11_cqlsh_dtests_py311_vnode 6m 14s ✓ j11_cqlsh_dtests_py385m 47s ✓ j11_cqlsh_dtests_py38_vnode 5m 57s ✓ j11_cqlsh_dtests_py3_vnode 5m 33s ✓ j11_cqlshlib_cython_tests6m 55s ✓ j11_cqlshlib_tests6m 9s ✓ j11_dtests 33m 37s ✓ j11_jvm_dtests 19m 2s ✓ j11_jvm_dtests_vnode11m 38s ✓ j11_unit_tests_repeat 8m 4s ✕ j11_dtests_vnode33m 39s rebuild_test.TestRebuild test_simple_rebuild ✕ j11_unit_tests 7m 34s org.apache.cassandra.cql3.MemtableSizeTest testSize[skiplist] {noformat} [java11_pre-commit_tests|https://app.circleci.com/pipelines/github/instaclustr/cassandra/4187/workflows/9164b340-3472-4ba7-a1ae-6ab5cbf695cb] was (Author: smiklosovic): 4.1 build seems reasonabley fine [CASSANDRA-19401-4.1|https://github.com/instaclustr/cassandra/tree/CASSANDRA-19401-4.1] {noformat} java11_pre-commit_tests ✓ j11_build1m 28s ✓ j11_cqlsh_dtests_py3 5m 24s ✓ j11_cqlsh_dtests_py311 5m 25s ✓ j11_cqlsh_dtests_py311_vnode 6m 14s ✓ j11_cqlsh_dtests_py385m 47s ✓ j11_cqlsh_dtests_py38_vnode 5m 57s ✓ j11_cqlsh_dtests_py3_vnode 5m 33s ✓ j11_cqlshlib_cython_tests6m 55s ✓ j11_cqlshlib_tests6m 9s ✓ j11_dtests 33m 37s ✓ j11_jvm_dtests 19m 2s ✓ j11_jvm_dtests_vnode11m 38s ✓ j11_unit_tests_repeat 8m 4s ✕ j11_dtests_vnode33m 39s rebuild_test.TestRebuild test_simple_rebuild ✕ j11_unit_tests 7m 34s org.apache.cassandra.cql3.MemtableSizeTest testSize[skiplist] {noformat} [java11_pre-commit_tests|https://app.circleci.com/pipelines/github/instaclustr/cassandra/4187/workflows/9164b340-3472-4ba7-a1ae-6ab5cbf695cb] > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 1h > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable
[jira] [Updated] (CASSANDRA-19538) Test Failure: test_assassinate_valid_node
[ https://issues.apache.org/jira/browse/CASSANDRA-19538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sam Tunnicliffe updated CASSANDRA-19538: Status: Ready to Commit (was: Review In Progress) +1 LGTM. Using an incorrect {{lastModified}} was causing updates to gossip state not to fire, because it looked to the listener like nothing had changed. > Test Failure: test_assassinate_valid_node > - > > Key: CASSANDRA-19538 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19538 > Project: Cassandra > Issue Type: Bug > Components: CI, Test/dtest/python >Reporter: Ekaterina Dimitrova >Assignee: Marcus Eriksson >Priority: Normal > Fix For: 5.x > > Attachments: ci_summary.html > > Time Spent: 10m > Remaining Estimate: 0h > > Failing consistently on trunk: > {code:java} > ccmlib.node.TimeoutError: 03 Apr 2024 19:39:32 [node1] after 120.11/120 > seconds Missing: ['127.0.0.4:7000.* is now UP'] not found in system.log: > Head: INFO [Messaging-EventLoop-3-1] 2024-04-03 19:37:3 > Tail: ... some nodes were not ready > INFO [OptionalTasks:1] 2024-04-03 19:39:30,454 CassandraRoleManager.java:484 > - Setup task failed with error, rescheduling > self = > def test_assassinate_valid_node(self): > """ > @jira_ticket CASSANDRA-16588 > Test that after taking two non-seed nodes down and assassinating > one of them, the other can come back up. > """ > cluster = self.cluster > > cluster.populate(5).start() > node1 = cluster.nodelist()[0] > node3 = cluster.nodelist()[2] > > self.cluster.set_configuration_options({ > 'seed_provider': [{'class_name': > 'org.apache.cassandra.locator.SimpleSeedProvider', >'parameters': [{'seeds': node1.address()}] > }] > }) > > non_seed_nodes = cluster.nodelist()[-2:] > for node in non_seed_nodes: > node.stop() > > assassination_target = non_seed_nodes[0] > logger.debug("Assassinating non-seed node > {}".format(assassination_target.address())) > out, err, _ = node1.nodetool("assassinate > {}".format(assassination_target.address())) > assert_stderr_clean(err) > > logger.debug("Starting non-seed nodes") > for node in non_seed_nodes: > > node.start() > gossip_test.py:78: > _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ > _ > ../env3.8/lib/python3.8/site-packages/ccmlib/node.py:915: in start > node.watch_log_for_alive(self, from_mark=mark) > ../env3.8/lib/python3.8/site-packages/ccmlib/node.py:684: in > watch_log_for_alive > self.watch_log_for(tofind, from_mark=from_mark, timeout=timeout, > filename=filename) > ../env3.8/lib/python3.8/site-packages/ccmlib/node.py:608: in watch_log_for > TimeoutError.raise_if_passed(start=start, timeout=timeout, node=self.name, > _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ > _ > start = 1712173052.8186479, timeout = 120 > msg = "Missing: ['127.0.0.4:7000.* is now UP'] not found in system.log:\n > Head: INFO [Messaging-EventLoop-3-1] 2024-04-03 1...[OptionalTasks:1] > 2024-04-03 19:39:30,454 CassandraRoleManager.java:484 - Setup task failed > with error, rescheduling\n" > node = 'node1' > @staticmethod > def raise_if_passed(start, timeout, msg, node=None): > if start + timeout < time.time(): > > raise TimeoutError.create(start, timeout, msg, node) > E ccmlib.node.TimeoutError: 03 Apr 2024 19:39:32 [node1] after > 120.11/120 seconds Missing: ['127.0.0.4:7000.* is now UP'] not found in > system.log: > EHead: INFO [Messaging-EventLoop-3-1] 2024-04-03 19:37:3 > ETail: ... some nodes were not ready > E INFO [OptionalTasks:1] 2024-04-03 19:39:30,454 > CassandraRoleManager.java:484 - Setup task failed with error, rescheduling > ../env3.8/lib/python3.8/site-packages/ccmlib/node.py:56: TimeoutError > {code} > https://app.circleci.com/pipelines/github/ekaterinadimitrova2/cassandra/2680/workflows/8b1c0d0a-7458-4b43-9bba-ac96b9bfe64f/jobs/58929/tests#failed-test-0 > https://ci-cassandra.apache.org/job/Cassandra-trunk/1859/#showFailuresLink -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838114#comment-17838114 ] Thomas De Keulenaer edited comment on CASSANDRA-19565 at 4/17/24 11:15 AM: --- I see the trunk is already at JNA 5.13: https://issues.apache.org/jira/browse/CASSANDRA-18050 idem for 5.0 was (Author: twdkeule): I see the trunk is already at JNA 5.13: https://github.com/apache/cassandra/blob/trunk/.build/parent-pom-template.xml#L543 idem for 5.0 > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838113#comment-17838113 ] Thomas De Keulenaer commented on CASSANDRA-19565: - If you do upgrade to JNA 5.10, you should probably set LIBFFI_TMPDIR to the same value as jna.tmpdir: bq. -Djna.tmpdir=/cassandra/tmp -Dio.netty.native.workdir=/cassandra/tmp Like https://github.com/elastic/elasticsearch/issues/77014 > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
(cassandra) 01/01: Merge branch 'cassandra-5.0' into trunk
This is an automated email from the ASF dual-hosted git repository. brandonwilliams pushed a commit to branch trunk in repository https://gitbox.apache.org/repos/asf/cassandra.git commit f345370f35dea1256ef0401a46bad8342d224ea5 Merge: f194c0772c 1a66347e70 Author: Brandon Williams AuthorDate: Wed Apr 17 05:54:06 2024 -0500 Merge branch 'cassandra-5.0' into trunk .build/docker/ubuntu2004_test.docker | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
(cassandra) branch trunk updated (f194c0772c -> f345370f35)
This is an automated email from the ASF dual-hosted git repository. brandonwilliams pushed a change to branch trunk in repository https://gitbox.apache.org/repos/asf/cassandra.git from f194c0772c Merge branch 'cassandra-5.0' into trunk new 1a66347e70 Update in-tree docker versions new f345370f35 Merge branch 'cassandra-5.0' into trunk The 2 revisions listed above as "new" are entirely new to this repository and will be described in separate emails. The revisions listed as "add" were already present in the repository and have only been added to this reference. Summary of changes: .build/docker/ubuntu2004_test.docker | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Created] (CASSANDRA-19566) JSON encoded timestamp value does not always match non-JSON encoded value
Bowen Song created CASSANDRA-19566: -- Summary: JSON encoded timestamp value does not always match non-JSON encoded value Key: CASSANDRA-19566 URL: https://issues.apache.org/jira/browse/CASSANDRA-19566 Project: Cassandra Issue Type: Bug Reporter: Bowen Song Description: "SELECT JSON ..." and "toJson(...)" on Cassandra 4.1.4 produces different date than "SELECT ..." for some timestamp type values. Steps to reproduce: {code:java} $ sudo docker pull cassandra:4.1.4 $ sudo docker create --name cass cassandra:4.1.4 $ sudo docker start cass $ # wait for the Cassandra instance becomes ready $ sudo docker exec -ti cass cqlsh Connected to Test Cluster at 127.0.0.1:9042 [cqlsh 6.1.0 | Cassandra 4.1.4 | CQL spec 3.4.6 | Native protocol v5] Use HELP for help. cqlsh> create keyspace test WITH replication = {'class': 'SimpleStrategy', 'replication_factor': 1}; cqlsh> use test; cqlsh:test> create table tbl (id int, ts timestamp, primary key (id)); cqlsh:test> insert into tbl (id, ts) values (1, -1376701920); cqlsh:test> select tounixtimestamp(ts), ts, tojson(ts) from tbl where id=1; system.tounixtimestamp(ts) | ts | system.tojson(ts) +-+ -1376701920 | 1533-09-28 12:00:00.00+ | "1533-09-18 12:00:00.000Z" (1 rows) cqlsh:test> select json * from tbl where id=1; [json] - {"id": 1, "ts": "1533-09-18 12:00:00.000Z"} (1 rows) {code} Expected behaviour: The "select ts", "select tojson(ts)" and "select json *" should all produce the same date. Actual behaviour: The "select ts" produced the "1533-09-28" date but the "select tojson(ts)" and "select json *" produced the "1533-09-18" date. -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838052#comment-17838052 ] Thomas De Keulenaer commented on CASSANDRA-19565: - Setting the cassandra user's home directory (/var/lib/cassandra) to read+write+executable and owned by cassandra, solves the issue. I suspect this is not done by default by the redhat package installation. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Assigned] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stefan Miklosovic reassigned CASSANDRA-19401: - Assignee: Stefan Miklosovic > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Assignee: Stefan Miklosovic >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 50m > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable > {code} > However, when we move the sstables (.db-Files) to > {{alternative/mykeyspace/mytable}} > and import with > {code} > nodetool import --copy-data mykeyspace mytable > /fullpath/alternative/mykeyspace/mytable > {code} > the import works > {code} > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:177 - Loading new SSTables and building secondary > indexes for mykeyspace/mytable: > [BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-2-big-Data.db'), > > BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-1-big-Data.db')] > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:190 - Done loading load new SSTables for > mykeyspace/mytable > {code} > We experienced this in Cassandra 4.1.3 on Java 11 (Linux) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838054#comment-17838054 ] Stefan Miklosovic commented on CASSANDRA-19401: --- [~nob13] this one is more complete patch [https://github.com/apache/cassandra/pull/3259] would you verify that? I will create patches for other branches as well. > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 50m > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable > {code} > However, when we move the sstables (.db-Files) to > {{alternative/mykeyspace/mytable}} > and import with > {code} > nodetool import --copy-data mykeyspace mytable > /fullpath/alternative/mykeyspace/mytable > {code} > the import works > {code} > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:177 - Loading new SSTables and building secondary > indexes for mykeyspace/mytable: > [BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-2-big-Data.db'), > > BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-1-big-Data.db')] > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:190 - Done loading load new SSTables for > mykeyspace/mytable > {code} > We experienced this in Cassandra 4.1.3 on Java 11 (Linux) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19401) Nodetool import expects directory structure
[ https://issues.apache.org/jira/browse/CASSANDRA-19401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stefan Miklosovic updated CASSANDRA-19401: -- Authors: Stefan Miklosovic Test and Documentation Plan: CI Status: Patch Available (was: In Progress) > Nodetool import expects directory structure > --- > > Key: CASSANDRA-19401 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19401 > Project: Cassandra > Issue Type: Bug > Components: Local/SSTable >Reporter: Norbert Schultz >Priority: Normal > Fix For: 4.1.x, 5.0.x, 5.x > > Time Spent: 50m > Remaining Estimate: 0h > > According to the > [documentation|https://cassandra.apache.org/doc/4.1/cassandra/operating/bulk_loading.html] > the nodetool import should not rely on the folder structure of the imported > sst files: > {quote} > Because the keyspace and table are specified on the command line for nodetool > import, there is not the same requirement as with sstableloader, to have the > SSTables in a specific directory path. When importing snapshots or > incremental backups with nodetool import, the SSTables don’t need to be > copied to another directory. > {quote} > However when importing old cassandra snapshots, we figured out, that sstables > still need to be in a directory called like $KEYSPACE/$TABLENAME files, even > when keyspace and table name are already present as parameters for the > nodetool import call. > Call we used: > {code} > nodetool import --copy-data mykeyspace mytable /full_path_to/test1 > {code} > Log: > {code} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,565 > SSTableImporter.java:72 - Loading new SSTables for mykeyspace/mytable: > Options{srcPaths='[/full_path_to/test1]', resetLevel=true, > clearRepaired=true, verifySSTables=true, verifyTokens=true, > invalidateCaches=true, extendedVerify=false, copyData= true} > INFO [RMI TCP Connection(21)-127.0.0.1] 2024-02-15 10:41:06,566 > SSTableImporter.java:173 - No new SSTables were found for mykeyspace/mytable > {code} > However, when we move the sstables (.db-Files) to > {{alternative/mykeyspace/mytable}} > and import with > {code} > nodetool import --copy-data mykeyspace mytable > /fullpath/alternative/mykeyspace/mytable > {code} > the import works > {code} > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:177 - Loading new SSTables and building secondary > indexes for mykeyspace/mytable: > [BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-2-big-Data.db'), > > BigTableReader(path='/mnt/ramdisk/cassandra4/data/mykeyspace/mytable-561a12d0cbe611eead78fbfd293cee40/me-1-big-Data.db')] > INFO [RMI TCP Connection(23)-127.0.0.1] 2024-02-15 10:43:36,093 > SSTableImporter.java:190 - Done loading load new SSTables for > mykeyspace/mytable > {code} > We experienced this in Cassandra 4.1.3 on Java 11 (Linux) -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838067#comment-17838067 ] Thomas De Keulenaer commented on CASSANDRA-19565: - bq. I wonder what's the proposed solution here: https://access.redhat.com/solutions/6991416 I cannot access that issue. bq. Also should we upgrade JNA Brandon Williams as in my PR or better leave it alone, wdyt? FYI: Upgrading JNA wont actually solve the issue, it will only prevent the SIGSEGV + core dump and replace it with a java exception. > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838062#comment-17838062 ] Berenguer Blasi commented on CASSANDRA-19565: - I wonder what's the proposed solution here: https://access.redhat.com/solutions/6991416 > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Commented] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838061#comment-17838061 ] Berenguer Blasi commented on CASSANDRA-19565: - Oh Icwym. [~brandon.williams] is familiar with the releases code so pinging him. Also should we upgrade JNA [~brandon.williams] as in my PR or better leave it alone, wdyt? > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
Re: [PR] CASSANDRA-19562: DefaultLoadBalancingPolicy considers a TimeOut as a response [cassandra-java-driver]
adutra commented on code in PR #1927: URL: https://github.com/apache/cassandra-java-driver/pull/1927#discussion_r1568473082 ## core/src/main/java/com/datastax/oss/driver/internal/core/loadbalancing/DefaultLoadBalancingPolicy.java: ## @@ -252,7 +253,9 @@ public void onNodeError( @NonNull DriverExecutionProfile executionProfile, @NonNull Node node, @NonNull String logPrefix) { -updateResponseTimes(node); +if (!(error instanceof DriverTimeoutException)) { Review Comment: @SiyaoIsHiding are you sure this can happen? My recollection is that when a driver timeout happens, the query is aborted and `RequestTracker.onError()` is invoked, not `RequestTracker.onNodeError()`. https://github.com/apache/cassandra-java-driver/blob/16260261d3df50fcf24fac1fc2d37896c4a111bf/core/src/main/java/com/datastax/oss/driver/internal/core/cql/CqlRequestHandler.java#L209-L211 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Comment Edited] (CASSANDRA-19565) SIGSEGV on Cassandra v4.1.4
[ https://issues.apache.org/jira/browse/CASSANDRA-19565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17838033#comment-17838033 ] Berenguer Blasi edited comment on CASSANDRA-19565 at 4/17/24 8:49 AM: -- ^that is exactly what I was trying to get to... Bc C* code for {{NativeLibraryLinux}} looks identical 4.0 vs 4.1 was (Author: bereng): ^that is exactly what I was trying to get to... > SIGSEGV on Cassandra v4.1.4 > --- > > Key: CASSANDRA-19565 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19565 > Project: Cassandra > Issue Type: Bug >Reporter: Thomas De Keulenaer >Priority: Normal > Fix For: 4.1.x > > Attachments: hs_err_pid1116450.log > > > Hello, > Since upgrading to v4.1. we cannat run CAssandra any more. Each start > immediately crashes: > {{Apr 17 08:58:34 SVALD108 cassandra[1116450]: # A fatal error has been > detected by the Java Runtime Environment: > Apr 17 08:58:34 SVALD108 cassandra[1116450]: # SIGSEGV (0xb) at > pc=0x7fccaab4d152, pid=1116450, tid=1116451}} > I have added the log from the coe dump. > This issue is perhaps related to > https://davecturner.github.io/2021/08/30/seven-year-old-segfault.html ? -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org
[jira] [Updated] (CASSANDRA-19538) Test Failure: test_assassinate_valid_node
[ https://issues.apache.org/jira/browse/CASSANDRA-19538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sam Tunnicliffe updated CASSANDRA-19538: Reviewers: Sam Tunnicliffe, Sam Tunnicliffe Sam Tunnicliffe, Sam Tunnicliffe (was: Sam Tunnicliffe) Status: Review In Progress (was: Patch Available) > Test Failure: test_assassinate_valid_node > - > > Key: CASSANDRA-19538 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19538 > Project: Cassandra > Issue Type: Bug > Components: CI, Test/dtest/python >Reporter: Ekaterina Dimitrova >Assignee: Marcus Eriksson >Priority: Normal > Fix For: 5.x > > Attachments: ci_summary.html > > Time Spent: 10m > Remaining Estimate: 0h > > Failing consistently on trunk: > {code:java} > ccmlib.node.TimeoutError: 03 Apr 2024 19:39:32 [node1] after 120.11/120 > seconds Missing: ['127.0.0.4:7000.* is now UP'] not found in system.log: > Head: INFO [Messaging-EventLoop-3-1] 2024-04-03 19:37:3 > Tail: ... some nodes were not ready > INFO [OptionalTasks:1] 2024-04-03 19:39:30,454 CassandraRoleManager.java:484 > - Setup task failed with error, rescheduling > self = > def test_assassinate_valid_node(self): > """ > @jira_ticket CASSANDRA-16588 > Test that after taking two non-seed nodes down and assassinating > one of them, the other can come back up. > """ > cluster = self.cluster > > cluster.populate(5).start() > node1 = cluster.nodelist()[0] > node3 = cluster.nodelist()[2] > > self.cluster.set_configuration_options({ > 'seed_provider': [{'class_name': > 'org.apache.cassandra.locator.SimpleSeedProvider', >'parameters': [{'seeds': node1.address()}] > }] > }) > > non_seed_nodes = cluster.nodelist()[-2:] > for node in non_seed_nodes: > node.stop() > > assassination_target = non_seed_nodes[0] > logger.debug("Assassinating non-seed node > {}".format(assassination_target.address())) > out, err, _ = node1.nodetool("assassinate > {}".format(assassination_target.address())) > assert_stderr_clean(err) > > logger.debug("Starting non-seed nodes") > for node in non_seed_nodes: > > node.start() > gossip_test.py:78: > _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ > _ > ../env3.8/lib/python3.8/site-packages/ccmlib/node.py:915: in start > node.watch_log_for_alive(self, from_mark=mark) > ../env3.8/lib/python3.8/site-packages/ccmlib/node.py:684: in > watch_log_for_alive > self.watch_log_for(tofind, from_mark=from_mark, timeout=timeout, > filename=filename) > ../env3.8/lib/python3.8/site-packages/ccmlib/node.py:608: in watch_log_for > TimeoutError.raise_if_passed(start=start, timeout=timeout, node=self.name, > _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ > _ > start = 1712173052.8186479, timeout = 120 > msg = "Missing: ['127.0.0.4:7000.* is now UP'] not found in system.log:\n > Head: INFO [Messaging-EventLoop-3-1] 2024-04-03 1...[OptionalTasks:1] > 2024-04-03 19:39:30,454 CassandraRoleManager.java:484 - Setup task failed > with error, rescheduling\n" > node = 'node1' > @staticmethod > def raise_if_passed(start, timeout, msg, node=None): > if start + timeout < time.time(): > > raise TimeoutError.create(start, timeout, msg, node) > E ccmlib.node.TimeoutError: 03 Apr 2024 19:39:32 [node1] after > 120.11/120 seconds Missing: ['127.0.0.4:7000.* is now UP'] not found in > system.log: > EHead: INFO [Messaging-EventLoop-3-1] 2024-04-03 19:37:3 > ETail: ... some nodes were not ready > E INFO [OptionalTasks:1] 2024-04-03 19:39:30,454 > CassandraRoleManager.java:484 - Setup task failed with error, rescheduling > ../env3.8/lib/python3.8/site-packages/ccmlib/node.py:56: TimeoutError > {code} > https://app.circleci.com/pipelines/github/ekaterinadimitrova2/cassandra/2680/workflows/8b1c0d0a-7458-4b43-9bba-ac96b9bfe64f/jobs/58929/tests#failed-test-0 > https://ci-cassandra.apache.org/job/Cassandra-trunk/1859/#showFailuresLink -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org