Re: Nodes show different number of tokens than initially

2018-01-29 Thread Oleksandr Shulgin
On Tue, Jan 30, 2018 at 5:13 AM, kurt greaves  wrote:

> Shouldn't happen. Can you send through nodetool ring output from one of
> those nodes? Also, did the logs have anything to say about tokens when you
> started the 3 seed nodes?​
>

Hi Kurt,

I cannot run nodetool ring anymore, since these test nodes are long gone.
However I've grepped the logs and this is what I've found:

Jan 25 08:57:18 ip-172-31-128-41 docker/cf3ea463915a[854]: INFO 08:57:18
Nodes /172.31.128.31 and /172.31.128.41 have the same token
-9223372036854775808. Ignoring /172.31.128.31
Jan 25 08:57:18 ip-172-31-128-41 docker/cf3ea463915a[854]: INFO 08:57:18
Nodes /172.31.144.32 and /172.31.128.41 have the same token
-8454757700450211158. Ignoring /172.31.144.32
Jan 25 08:58:30 ip-172-31-144-41 docker/48fba443d99f[852]: INFO 08:58:30
Nodes /172.31.128.41 and /172.31.128.31 have the same token
-9223372036854775808. /172.31.128.41 is the new owner
Jan 25 08:58:30 ip-172-31-144-41 docker/48fba443d99f[852]: INFO 08:58:30
Nodes /172.31.144.32 and /172.31.128.41 have the same token
-8454757700450211158. Ignoring /172.31.144.32
Jan 25 08:59:45 ip-172-31-160-41 docker/cced70e132f2[849]: INFO 08:59:45
Nodes /172.31.128.41 and /172.31.128.31 have the same token
-9223372036854775808. /172.31.128.41 is the new owner
Jan 25 08:59:45 ip-172-31-160-41 docker/cced70e132f2[849]: INFO 08:59:45
Nodes /172.31.144.32 and /172.31.128.41 have the same token
-8454757700450211158. Ignoring /172.31.144.32

Since we are allocating the tokens for seed nodes manually, it appears that
the first seed node in the new ring (172.31.128.41) gets the same first
token (-9223372036854775808) as the node in the old ring (172.31.128.31).
The same goes for the 3rd token of the new seed node (-8454757700450211158).

What is beyond me is why would that matter and why would token ownership
change at all, while these nodes are in the *different virtual DCs*?  To me
this sounds like a paticularly nasty bug...

-- 
Oleksandr "Alex" Shulgin | Database Engineer | Zalando SE | Tel: +49 176
127-59-707


Re: TWCS not deleting expired sstables

2018-01-29 Thread kurt greaves
Likely a read repair caused old data to be brought into a newer SSTable.
Try running sstableexpiredblockers to find out if there's a newer SSTable
blocking that one from being dropped.​


Re: Nodes show different number of tokens than initially

2018-01-29 Thread kurt greaves
Shouldn't happen. Can you send through nodetool ring output from one of
those nodes? Also, did the logs have anything to say about tokens when you
started the 3 seed nodes?​


Re: group by select queries

2018-01-29 Thread kurt greaves
What consistency were you querying at? Can you retry with CONSISTENCY ALL?

​


Re: Nodetool hanging - Cassandra 3.10, Oracle Java 1.8.0_131

2018-01-29 Thread James Lovato
I’ve resolved the issue.   These hosts were initially created in a different 
VLAN.  There was a holdover entry in /etc/hosts pointing to the old IP Addr for 
this host.

So, the return back to the client never happened because it was sending the 
connection to the old IP to a host that did not exist.

Thanks for looking!
James

From: James Lovato 
Reply-To: "user@cassandra.apache.org" 
Date: Thursday, January 25, 2018 at 10:44 AM
To: "user@cassandra.apache.org" 
Subject: Re: Nodetool hanging - Cassandra 3.10, Oracle Java 1.8.0_131

It starts up, I can telnet to the port.  Also the log states so:

^C[root@dfprdbcdlpcs5 logs]# grep JMX cassandra.log
INFO  [main] 2018-01-25 10:52:54,176 JMXServerUtils.java:249 - Configured JMX 
server at: service:jmx:rmi://0.0.0.0/jndi/rmi://0.0.0.0:7199/jmxrmi
INFO  [main] 2018-01-25 10:52:54,379 StartupChecks.java:164 - JMX is enabled to 
receive remote connections on port: 7199
INFO  [main] 2018-01-25 10:55:25,751 JMXServerUtils.java:249 - Configured JMX 
server at: service:jmx:rmi://0.0.0.0/jndi/rmi://0.0.0.0:7199/jmxrmi
INFO  [main] 2018-01-25 10:55:25,957 StartupChecks.java:164 - JMX is enabled to 
receive remote connections on port: 7199

From: Roger Brown 
Reply-To: "user@cassandra.apache.org" 
Date: Thursday, January 25, 2018 at 9:51 AM
To: "user@cassandra.apache.org" 
Subject: Re: Nodetool hanging - Cassandra 3.10, Oracle Java 1.8.0_131

My first thought is restart cassandra and monitor its log to make sure it 
starts up.

On Thu, Jan 25, 2018 at 9:15 AM Jeff Jirsa 
mailto:jji...@gmail.com>> wrote:
Can you telnet to the 7199 port?

--
Jeff Jirsa


On Jan 25, 2018, at 8:00 AM, James Lovato 
mailto:james.lov...@iacapps.com>> wrote:
I have a 15 node cluster, 5 in each of 3 dcs.  One host just recently started 
hanging doing any `nodetool` requests.  I’ve enabled remote JMX on all these 
hosts and it was working fine until this week. No other changes have been made. 
All the Cassandra-env.sh files are the same among the nodes, and only seeds and 
listen_address are different for Cassandra.yaml files.

Nothing in debug log about why the Connection timed out. This is what I get 
running nodetool from the same host:

[root@host5 conf]# nodetool status
nodetool: Failed to connect to 'xx.xx.xx.xx:7199' - ConnectException: 
'Connection timed out (Connection timed out)'.

Any thoughts or how to log nodetool’s actions? Or see why it’s not accepting 
the request?  Telnet to the port from localhost as well as another node work 
fine.

Thanks,
James


Re: Cassandra nodes are down

2018-01-29 Thread Jeff Jirsa
Something is invoking the shutdown hook (calling kill). May be your config 
management or similar.

-- 
Jeff Jirsa


> On Jan 29, 2018, at 9:38 AM, Mokkapati, Bhargav (Nokia - IN/Chennai) 
>  wrote:
> 
> Hi Team,
>  
> I’m getting the below warnings. Please help me out to clear these issues.
>  
> Apache Cassandra version : 3.0.13, 5 Node cluster
>  
> INFO  [main] 2018-01-29 16:58:19,487 NativeLibrary.java:167 - JNA mlockall 
> successful
> WARN  [main] 2018-01-29 16:58:19,488 StartupChecks.java:121 - jemalloc shared 
> library could not be preloaded to speed up memory allocations
> INFO  [main] 2018-01-29 16:58:19,488 StartupChecks.java:160 - JMX is enabled 
> to receive remote connections on port: 8002
> WARN  [main] 2018-01-29 16:58:19,488 StartupChecks.java:178 - OpenJDK is not 
> recommended. Please upgrade to the newest Oracle Java release
> INFO  [main] 2018-01-29 16:58:19,490 SigarLibrary.java:44 - Initializing 
> SIGAR library
> WARN  [main] 2018-01-29 16:58:19,498 SigarLibrary.java:174 - Cassandra server 
> running in degraded mode. Is swap disabled? : true,  Address space adequate? 
> : true,  nofile limit adequate? : false, nproc limit adequate? : true
> WARN  [main] 2018-01-29 16:58:19,500 StartupChecks.java:246 - Maximum number 
> of memory map areas per process (vm.max_map_count) 65530 is too low, 
> recommended value: 1048575, you can change it with sysctl.
>  
> WARN  [main] 2018-01-29 17:05:07,844 SystemKeyspace.java:1042 - No host ID 
> found, created 2dc59352-e98e-4e77-a5f2-289697e467c7 (Note: This should happen 
> exactly once per node).
> INFO  [main] 2018-01-29 17:05:16,421 Server.java:160 - Starting listening for 
> CQL clients on /10.50.21.22:9042 (unencrypted)...
> INFO  [main] 2018-01-29 17:05:16,449 CassandraDaemon.java:488 - Not starting 
> RPC server as requested. Use JMX (StorageService->startRPCServer()) or 
> nodetool (enablethrift) to start it
> INFO  [OptionalTasks:1] 2018-01-29 17:05:18,443 CassandraRoleManager.java:350 
> - Created default superuser role 'cassandra'
> INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:55,737 
> HintsService.java:212 - Paused hints dispatch
> INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:55,740 Server.java:180 - 
> Stop listening for CQL clients
> INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:55,740 Gossiper.java:1490 
> - Announcing shutdown
> INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:55,741 
> StorageService.java:1991 - Node /10.50.21.22 state jump to shutdown
> INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:57,743 
> MessagingService.java:811 - Waiting for messaging service to quiesce
> INFO  [ACCEPT-/10.50.21.22] 2018-01-29 17:09:57,743 
> MessagingService.java:1110 - MessagingService has terminated the accept() 
> thread
> INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:57,797 
> HintsService.java:212 - Paused hints dispatch
>  
> Thanks,
> Bhargav M.


Cassandra nodes are down

2018-01-29 Thread Mokkapati, Bhargav (Nokia - IN/Chennai)
Hi Team,

I'm getting the below warnings. Please help me out to clear these issues.

Apache Cassandra version : 3.0.13, 5 Node cluster

INFO  [main] 2018-01-29 16:58:19,487 NativeLibrary.java:167 - JNA mlockall 
successful
WARN  [main] 2018-01-29 16:58:19,488 StartupChecks.java:121 - jemalloc shared 
library could not be preloaded to speed up memory allocations
INFO  [main] 2018-01-29 16:58:19,488 StartupChecks.java:160 - JMX is enabled to 
receive remote connections on port: 8002
WARN  [main] 2018-01-29 16:58:19,488 StartupChecks.java:178 - OpenJDK is not 
recommended. Please upgrade to the newest Oracle Java release
INFO  [main] 2018-01-29 16:58:19,490 SigarLibrary.java:44 - Initializing SIGAR 
library
WARN  [main] 2018-01-29 16:58:19,498 SigarLibrary.java:174 - Cassandra server 
running in degraded mode. Is swap disabled? : true,  Address space adequate? : 
true,  nofile limit adequate? : false, nproc limit adequate? : true
WARN  [main] 2018-01-29 16:58:19,500 StartupChecks.java:246 - Maximum number of 
memory map areas per process (vm.max_map_count) 65530 is too low, recommended 
value: 1048575, you can change it with sysctl.

WARN  [main] 2018-01-29 17:05:07,844 SystemKeyspace.java:1042 - No host ID 
found, created 2dc59352-e98e-4e77-a5f2-289697e467c7 (Note: This should happen 
exactly once per node).
INFO  [main] 2018-01-29 17:05:16,421 Server.java:160 - Starting listening for 
CQL clients on /10.50.21.22:9042 (unencrypted)...
INFO  [main] 2018-01-29 17:05:16,449 CassandraDaemon.java:488 - Not starting 
RPC server as requested. Use JMX (StorageService->startRPCServer()) or nodetool 
(enablethrift) to start it
INFO  [OptionalTasks:1] 2018-01-29 17:05:18,443 CassandraRoleManager.java:350 - 
Created default superuser role 'cassandra'
INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:55,737 
HintsService.java:212 - Paused hints dispatch
INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:55,740 Server.java:180 - 
Stop listening for CQL clients
INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:55,740 Gossiper.java:1490 - 
Announcing shutdown
INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:55,741 
StorageService.java:1991 - Node /10.50.21.22 state jump to shutdown
INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:57,743 
MessagingService.java:811 - Waiting for messaging service to quiesce
INFO  [ACCEPT-/10.50.21.22] 2018-01-29 17:09:57,743 MessagingService.java:1110 
- MessagingService has terminated the accept() thread
INFO  [StorageServiceShutdownHook] 2018-01-29 17:09:57,797 
HintsService.java:212 - Paused hints dispatch

Thanks,
Bhargav M.