[ 
https://issues.apache.org/jira/browse/GEODE-8240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17148076#comment-17148076
 ] 

ASF GitHub Bot commented on GEODE-8240:
---------------------------------------

Bill commented on a change in pull request #5273:
URL: https://github.com/apache/geode/pull/5273#discussion_r447181500



##########
File path: 
geode-core/src/upgradeTest/java/org/apache/geode/internal/cache/rollingupgrade/RollingUpgradeDUnitTest.java
##########
@@ -191,20 +207,26 @@ void doTestRollAll(String regionType, String objectType, 
String startingVersion)
       }
 
       putAndVerify(objectType, server1, regionName, 0, 10, server2);
-      locator = rollLocatorToCurrent(locator, hostName, locatorPorts[0], 
getTestMethodName(),
-          locatorString);
+      locator =
+          rollLocatorToCurrent(locator, hostName, locatorPort, locatorProps, 
getTestMethodName(),
+              locatorString);
 
       server1 = rollServerToCurrentAndCreateRegion(server1, regionType, 
testingDirs[0],
-          shortcutName, regionName, locatorPorts);
+          shortcutName, regionName, new int[] {locatorPort});
       verifyValues(objectType, regionName, 0, 10, server1);
       putAndVerify(objectType, server1, regionName, 5, 15, server2);
       putAndVerify(objectType, server2, regionName, 10, 20, server1);
 
       server2 = rollServerToCurrentAndCreateRegion(server2, regionType, 
testingDirs[1],
-          shortcutName, regionName, locatorPorts);
+          shortcutName, regionName, ports);
       verifyValues(objectType, regionName, 0, 10, server2);
       putAndVerify(objectType, server2, regionName, 15, 25, server1);
 
+      String shutDownCommand = "shutdown --include-locators=true";

Review comment:
       My latest commit eliminates `gfsh` entirely so the issue with 
`shutDownCommand` is moot.
   
   I eliminated use of `gfsh` because I found the test had become flaky on 
versions 1.3.0-1.8.0, at least on macOS. On Linux (CI) I did not see the 
flakiness. But rather than figuring that out, I decided to grab the view more 
directly: by accessing `InternalDistributedSystem` in the locator JVM.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> View has old locator version number after rolling upgrade
> ---------------------------------------------------------
>
>                 Key: GEODE-8240
>                 URL: https://issues.apache.org/jira/browse/GEODE-8240
>             Project: Geode
>          Issue Type: Bug
>          Components: client/server, membership
>            Reporter: Ernest Burghardt
>            Assignee: Bill Burcham
>            Priority: Major
>
> as shown in [https://github.com/apache/geode/pull/5224]
> locator upgrade from version 1.12.0 doesn't seem to occur 
> {{testRollServersOnPartitionedRegion_dataserializable}}  failure results:
> Expecting:
>  <"Member Count : 3
>  Name | Id
>  ---- | 
> -------------------------------------------------------------------------------
>  vm2 | 127.0.0.1(vm2:35019:locator)<ec><v17>:41000(version:GEODE 1.12.0) 
> [Coordinator]
>  vm0 | 10.0.0.111(vm0:35025)<v27>:41001
>  vm1 | 10.0.0.111(vm1:35030)<v29>:41002
>  ">
>  not to contain:
>  <"1.12.0">
> This problem was introduced in 1.12.0 and is present in all lines derived 
> from that one, including 9.10, 1.13, and current develop/1.14
> What's actually happening is that the locator _is_ upgraded to a newer 
> version. It joins with an older coordinator (that's running e.g. 1.12.0) and 
> that coordinator produces a view showing the new locator/member as running 
> the same version, in this case 1.12.0, as the coordinator.
> Eventually, all locators will be upgraded. But the view carries the incorrect 
> version indication.
> The root cause seems to be that when {{GMSMemberData.setVersionObject(short 
> versionOrdinal)}} sees a version ordinal that is unknown, i.e. a version 
> ordinal corresponding to a new line of development: 1.13, 1.14, … that method 
> throws away that version ordinal and replaces it with the one for the 1.12 
> line.
> Since the current {{support/1.13}} and {{develop}} branches have the bug 
> upgrading a current 1.13 to 1.14 or a current development/1.14 to 1.15 would 
> exhibit the same behavior (locator apparently stuck at the older version in 
> the view.)
> Ramifications of this incorrect version indication in the view are TBD.
> Whether or not this situation resolves itself after _another_ round of 
> restarts is TBD.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to