[ https://issues.apache.org/jira/browse/GEODE-7072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16912718#comment-16912718 ]
Bruce Schuchardt commented on GEODE-7072: ----------------------------------------- The failure that Aaron mentioned is in the same test and is a deserialization issue but does not concern locator recovery. Instead it is a member running v1.3.0 receiving a FinalCheckPassedMessage, but that class didn't exist until v1.4.0 {noformat} [error 2019/08/20 21:29:02.924 GMT <unicast receiver,c25ef1397f7e-51259> tid=0x2f] Exception deserializing message payload: [dst: 172.17.0.13<v1>:32770, src: 172.17.0.13<v12>:41002 (2 headers), size=107 bytes, flags=OOB|DONT_BUNDLE|NO_FC|SKIP_BARRIER] org.apache.geode.internal.DSFIDNotFoundException: Unknown DataSerializableFixedID: -158 at org.apache.geode.internal.DSFIDFactory.create(DSFIDFactory.java:1003) at org.apache.geode.internal.InternalDataSerializer.basicReadObject(InternalDataSerializer.java:2693) at org.apache.geode.DataSerializer.readObject(DataSerializer.java:2961) at org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger.deserializeMessage(JGroupsMessenger.java:1121) at org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger.readJGMessage(JGroupsMessenger.java:1013) at org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1279) at org.jgroups.JChannel.invokeCallback(JChannel.java:816) at org.jgroups.JChannel.up(JChannel.java:741) at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030) at org.jgroups.protocols.FRAG2.up(FRAG2.java:165) at org.jgroups.protocols.FlowControl.up(FlowControl.java:390) at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1070) at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:785) {noformat} A coding error in GMSHealthMonitor causes this message to be sent to a suspect-initiator if an availability check passes for the suspect and the suspect-initiator is running with v1.3.0 or above. That needs to be changed to v1.4.0 and above. > CI Failure: WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo > > EventProcessingMixedSiteOneCurrentSiteTwo[from_v130] FAILED > ------------------------------------------------------------------------------------------------------------------------------------ > > Key: GEODE-7072 > URL: https://issues.apache.org/jira/browse/GEODE-7072 > Project: Geode > Issue Type: Test > Components: wan > Reporter: Owen Nichols > Assignee: Bruce Schuchardt > Priority: Major > Fix For: 1.11.0 > > Time Spent: 20m > Remaining Estimate: 0h > > {noformat} > org.apache.geode.cache.wan.WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo > > EventProcessingMixedSiteOneCurrentSiteTwo[from_v130] FAILED > org.apache.geode.test.dunit.RMIException: While invoking > org.apache.geode.cache.wan.WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo$$Lambda$47/1509632157.run > in VM 0 running on Host aac3b458d9ea with 7 VMs with version 130 > at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:579) > at org.apache.geode.test.dunit.VM.invoke(VM.java:406) > at > org.apache.geode.cache.wan.WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo.EventProcessingMixedSiteOneCurrentSiteTwo(WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo.java:63) > Caused by: > org.apache.geode.InternalGemFireException: Unable to recover previous > membership view from locator26547view.dat > at > org.apache.geode.distributed.internal.membership.gms.locator.GMSLocator.recoverFromFile(GMSLocator.java:462) > at > org.apache.geode.distributed.internal.membership.gms.locator.GMSLocator.recover(GMSLocator.java:387) > at > org.apache.geode.distributed.internal.membership.gms.locator.GMSLocator.init(GMSLocator.java:146) > at > org.apache.geode.distributed.internal.InternalLocator$PrimaryHandler.init(InternalLocator.java:1225) > at > org.apache.geode.distributed.internal.tcpserver.TcpServer.start(TcpServer.java:232) > at > org.apache.geode.distributed.internal.InternalLocator.startTcpServer(InternalLocator.java:517) > at > org.apache.geode.distributed.internal.InternalLocator.startPeerLocation(InternalLocator.java:575) > at > org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:321) > at > org.apache.geode.distributed.Locator.startLocator(Locator.java:253) > at > org.apache.geode.distributed.Locator.startLocatorAndDS(Locator.java:140) > at > org.apache.geode.cache.wan.WANRollingUpgradeDUnitTest.startLocator(WANRollingUpgradeDUnitTest.java:105) > at > org.apache.geode.cache.wan.WANRollingUpgradeDUnitTest.startLocator(WANRollingUpgradeDUnitTest.java:97) > at > org.apache.geode.cache.wan.WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo.lambda$EventProcessingMixedSiteOneCurrentSiteTwo$6f8ee815$1(WANRollingUpgradeEventProcessingMixedSiteOneCurrentSiteTwo.java:63) > Caused by: > org.apache.geode.SerializationException: Could not create an > instance of org.apache.geode.distributed.internal.membership.NetView . > at > org.apache.geode.internal.InternalDataSerializer.invokeFromData(InternalDataSerializer.java:2381) > at > org.apache.geode.internal.DSFIDFactory.create(DSFIDFactory.java:986) > at > org.apache.geode.internal.InternalDataSerializer.basicReadObject(InternalDataSerializer.java:2693) > at > org.apache.geode.DataSerializer.readObject(DataSerializer.java:2961) > at > org.apache.geode.distributed.internal.membership.gms.locator.GMSLocator.recoverFromFile(GMSLocator.java:440) > ... 12 more > Caused by: > org.apache.geode.SerializationException: Could not create an > instance of org.apache.geode.distributed.internal.membership.gms.GMSMember . > at > org.apache.geode.internal.InternalDataSerializer.invokeFromData(InternalDataSerializer.java:2381) > at > org.apache.geode.internal.DSFIDFactory.create(DSFIDFactory.java:986) > at > org.apache.geode.internal.InternalDataSerializer.basicReadObject(InternalDataSerializer.java:2693) > at > org.apache.geode.DataSerializer.readObject(DataSerializer.java:2961) > at > org.apache.geode.distributed.internal.membership.NetView.fromData(NetView.java:603) > at > org.apache.geode.internal.InternalDataSerializer.invokeFromData(InternalDataSerializer.java:2370) > ... 16 more > Caused by: > java.io.IOException: While reading an InetAddress > at > org.apache.geode.DataSerializer.readInetAddress(DataSerializer.java:474) > at > org.apache.geode.distributed.internal.membership.gms.GMSMember.readEssentialData(GMSMember.java:512) > at > org.apache.geode.distributed.internal.membership.gms.GMSMember.fromData(GMSMember.java:495) > at > org.apache.geode.internal.InternalDataSerializer.invokeFromData(InternalDataSerializer.java:2370) > ... 21 more > Caused by: > java.net.UnknownHostException: addr is of illegal > length > at > java.net.InetAddress.getByAddress(InetAddress.java:1043) > at > java.net.InetAddress.getByAddress(InetAddress.java:1440) > at > org.apache.geode.DataSerializer.readInetAddress(DataSerializer.java:467) > ... 24 more > {noformat} > seen in > https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/UpgradeTestOpenJDK8/builds/969 > =-=-=-=-=-=-=-=-=-=-=-=-=-=-= Test Results URI > =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= > http://files.apachegeode-ci.info/builds/apache-develop-main/1.11.0-SNAPSHOT.0023/test-results/upgradeTest/1565373787/ > =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-= > Test report artifacts from this job are available at: > http://files.apachegeode-ci.info/builds/apache-develop-main/1.11.0-SNAPSHOT.0023/test-artifacts/1565373787/upgradetestfiles-OpenJDK8-1.11.0-SNAPSHOT.0023.tgz -- This message was sent by Atlassian Jira (v8.3.2#803003)