[ https://issues.apache.org/jira/browse/CASSANDRA-12905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15703341#comment-15703341 ]
Benjamin Roth commented on CASSANDRA-12905: ------------------------------------------- Version: 3.10, trunk from 2016-11-08 Yes, production but only on the node I am about to bootstrap - where this issue occured. Side effects: None so far, but node is still bootstrapping. IMHO the changes are quite overseeable. But as stated, I now run into other problems: 1. Bootstrap is still f**** slow due to streaming write path (see comment above) 2. Hints throw tons of WTE when not paused during bootstrap (see comment above) > Retry acquire MV lock on failure instead of throwing WTE on streaming > --------------------------------------------------------------------- > > Key: CASSANDRA-12905 > URL: https://issues.apache.org/jira/browse/CASSANDRA-12905 > Project: Cassandra > Issue Type: Bug > Components: Streaming and Messaging > Environment: centos 6.7 x86_64 > Reporter: Nir Zilka > Priority: Critical > Fix For: 3.9 > > > Hello, > I performed two upgrades to the current cluster (currently 15 nodes, 1 DC, > private VLAN), > first it was 2.2.5.1 and repair worked flawlessly, > second upgrade was to 3.0.9 (with upgradesstables) and also repair worked > well, > then i upgraded 2 weeks ago to 3.9 - and the repair problems started. > there are several errors types from the system.log (different nodes) : > - Sync failed between /xxx.xxx.xxx.xxx and /xxx.xxx.xxx.xxx > - Streaming error occurred on session with peer xxx.xxx.xxx.xxx Operation > timed out - received only 0 responses > - Remote peer xxx.xxx.xxx.xxx failed stream session > - Session completed with the following error > org.apache.cassandra.streaming.StreamException: Stream failed > ---- > i use 3.9 default configuration with the cluster settings adjustments (3 > seeds, GossipingPropertyFileSnitch). > streaming_socket_timeout_in_ms is the default (86400000). > i'm afraid from consistency problems while i'm not performing repair. > Any ideas? > Thanks, > Nir. -- This message was sent by Atlassian JIRA (v6.3.4#6332)