ParallelPhoenixConnectionFailureTest.testExecuteQueryChainFailure also fails too often, especially when the test host is slow and/or the load is high. On my fast laptop, I can semi-reliably break it by running mvn clean verify -am -pl phoenix-core -DnumForkedUT=20
On Fri, Feb 9, 2024 at 9:19 AM Istvan Toth <st...@apache.org> wrote: > We're making progress. > I can see that Viraj has just landed PHOENIX-7601, and Rajeshbabu has > released Omid 1.1.1. > Thank you! > > At the moment, the following outstanding issues are on my radar: > > https://issues.apache.org/jira/browse/PHOENIX-7191 > https://issues.apache.org/jira/browse/PHOENIX-7193 > > These are bugs in my non-ZK registry implementation, which were found > during HBase 3 work. > I have some PRs up, but they may not be complete. I will push for reviews > once I have the HBase 3 tests passing, and possibly updated them based on > that. > > We also have a number of very flakey tests, see: > > > https://ci-hadoop.apache.org/job/Phoenix/job/Phoenix-mulitbranch/job/master/test_results_analyzer/ > > > > On Tue, Jan 30, 2024 at 7:09 AM Istvan Toth <st...@cloudera.com> wrote: > >> As Viraj wrote, those are just plans. >> If HBase 3 won't be released by the time the other features are ready, >> then it won't make it into 5.3. >> If other major features are ready by that time, then they will be >> included. (though we are not aware of any now) >> >> As for the new major version, in the past Phoenix didn't have a >> compatibility module system, >> so a new branch was required, which didn't support older HBases. Also, >> the API changes between HBase 1.x and 2.x were much larger, >> The HBase 2 and 3 API are pretty similar, apart from the removal of >> deprecated 1.x APIs. (and the protobuf/protocol thing, which requires a >> rather ugly hack). >> >> I will start the discussion on how we can add HBase 3 support as soon as >> I have a working POC patch. >> >> We could call 5.3 6.0 instead, after all Phoenix isn't using a strict >> semantic versioning, but then 6.0 would also support HBase 2. >> If we do not come to a consensus on the version name, we can always have >> a vote on it. >> >> I think that the main motivation is that the community wants to maintain >> as few branches as possible. >> >> Istvan >> >> On Mon, Jan 29, 2024 at 10:02 PM Stephen Jiang <syuanjiang...@gmail.com> >> wrote: >> >>> I am not sure how close HBase 3.0 is. Even if it is only less than one >>> year away, the adoption would be low at the beginning. I don't think 5.3 >>> should wait for that. And traditionally, Phoenix would have a major >>> release to support the HBase major release (4.x for HBase 1.x and 5.x for >>> HBase 2.x), in this case, we are talking about Phoenix 6.0 for HBase 3.0. >>> >>> Maybe we should adopt the HBase release model: master branch for next >>> major >>> release (6.0) and branch-5.x branch for next 5.x minor release and >>> branch-5.2 for 5.2 minor release. >>> >>> Thanks >>> Stephen >>> >>> >>> >>> On Thu, Jan 25, 2024 at 11:50 PM Viraj Jasani <vjas...@apache.org> >>> wrote: >>> >>> > Sounds good. >>> > >>> > Planned major changes for 5.3.0: >>> > >>> > 1. JSON support. >>> > 2. HBase 3.0 support. >>> > 3. CDC feature (leveraging uncovered global index framework and JSON >>> > support). >>> > >>> > >>> > On Wed, Jan 24, 2024 at 8:22 PM Kadir Ozdemir >>> > <kozde...@salesforce.com.invalid> wrote: >>> > >>> > > I suggest including another major change for 5.3, Phoenix CDC, >>> > > https://issues.apache.org/jira/browse/PHOENIX-7001. The PR for it >>> will >>> > be >>> > > posted soon. >>> > > >>> > > On Thu, Jan 25, 2024 at 5:35 AM Viraj Jasani <vjas...@apache.org> >>> wrote: >>> > > >>> > > > Thanks Istvan! I agree with your points. It’s really been a while >>> we >>> > are >>> > > > talking about releasing 5.2.0 and yet due to bandwidth issues, >>> unable >>> > to >>> > > do >>> > > > so. >>> > > > >>> > > > I agree to getting these fixes out, cut 5.2 and start the release >>> work >>> > > and >>> > > > in the meantime I also need to prepare 5.1 backport. >>> > > > >>> > > > For 5.3, let's plan HBase 3.0 support and JSON as major changes. >>> > > > >>> > > > >>> > > > On Mon, Jan 22, 2024 at 9:17 PM Istvan Toth <st...@apache.org> >>> wrote: >>> > > > >>> > > > > Hi! >>> > > > > >>> > > > > In my opinion cutting 5.2 now only makes sense IF we DO NOT plan >>> to >>> > > > release >>> > > > > the outstanding big features (like JSON) in 5.2. , otherwise it's >>> > just >>> > > > > extra work to maintain more branches. >>> > > > > >>> > > > > Having said that, releasing a 5.2 and 5.1.4 with the data >>> integrity >>> > > fixes >>> > > > > real soon, and then releasing 5.3 in a few months with JSON, and >>> any >>> > > > other >>> > > > > outstanding big features >>> > > > > that are close to being finished (and HBase 3.0 support, if it's >>> > ready >>> > > by >>> > > > > then) would not be a bad idea. >>> > > > > >>> > > > > On the CLDR side the only outstanding big feature which could >>> impact >>> > > > > Viraj's integrity work is HBase 3.0 support, and even that is >>> only >>> > > > because >>> > > > > it may require some larger refactors of existing code, not >>> because it >>> > > > would >>> > > > > change the actual behaviour or algorithms. >>> > > > > >>> > > > > Phoenix used to have several minor releases per year, the current >>> > state >>> > > > of >>> > > > > extreme longevity of 5.1 and several big new features being >>> added to >>> > it >>> > > > > (like uncovered indexes) is not ideal. >>> > > > > Releasing 5.2 and 5.3 relatively close together could be a >>> return to >>> > a >>> > > > > quicker cadence for minor releases, which could also help with >>> the >>> > > public >>> > > > > image and adoption of Phoenix. >>> > > > > >>> > > > > We were talking about releasing 5.2 at least a year ago, and I >>> have >>> > > > started >>> > > > > working on that then, but then emergencies have come up at >>> $dayjob, >>> > > and I >>> > > > > could not see that through. >>> > > > > (So I am in part responsible for the lack of minor releases) >>> > > > > >>> > > > > regards >>> > > > > Istvan >>> > > > > >>> > > > > >>> > > > > On Mon, Jan 22, 2024 at 11:35 PM Viraj Jasani < >>> vjas...@apache.org> >>> > > > wrote: >>> > > > > >>> > > > > > Sorry for the late reply. >>> > > > > > >>> > > > > > > Do you think cutting 5.2.0 now makes sense? >>> > > > > > >>> > > > > > No problem with that. I can cut 5.2 branch by the end of this >>> week >>> > or >>> > > > at >>> > > > > > the start of next week. >>> > > > > > >>> > > > > > If there is any very big change or feature ready for merge to >>> > master >>> > > > > branch >>> > > > > > with PR approvals already in place, please do let me know so >>> that I >>> > > can >>> > > > > > help collaborate on how best we can get it merged without >>> impacting >>> > > 5.2 >>> > > > > > release if required. My main motivation was for any big change >>> to >>> > go >>> > > > > > through newly introduced tests so that we know that anything >>> > > additional >>> > > > > is >>> > > > > > not broken, and also to prioritize for upcoming 5.2.0 and 5.1.4 >>> > > > releases. >>> > > > > > Moreover, there are several PRs getting merged on the master >>> > branch, >>> > > we >>> > > > > can >>> > > > > > continue that as long as they are not very big changes, which >>> might >>> > > > > require >>> > > > > > significant time to understand any correlation with data >>> integrity >>> > > > > issues. >>> > > > > > >>> > > > > > The PR is also ready for review with some additional cases >>> fixed >>> > last >>> > > > > week: >>> > > > > > >>> > > > >>> > > >>> > >>> https://urldefense.com/v3/__https://github.com/apache/phoenix/pull/1736__;!!DCbAVzZNrAf4!D4OVjUp2EWW2BqhGnBxsapDX_AHsibRphIpoFBWfgRsd3dsAikrFLo6PGxdTzGbSXJJ2fJ0j9mcz3asXMXo$ >>> > > > > > >>> > > > > > Depending on the review bandwidth, I am hopeful we should be >>> good >>> > to >>> > > > land >>> > > > > > them sooner. >>> > > > > > >>> > > > > > >>> > > > > > On Wed, Jan 17, 2024 at 11:31 AM Rushabh Shah >>> > > > > > <rushabh.s...@salesforce.com.invalid> wrote: >>> > > > > > >>> > > > > > > Thank you Viraj for initiating this thread. >>> > > > > > > >>> > > > > > > > Given the critical nature of these issues, I would like to >>> > > propose >>> > > > > that >>> > > > > > > we >>> > > > > > > treat this as a high priority for the upcoming 5.2.0 >>> release, and >>> > > not >>> > > > > > > include any other feature or big change to master branch >>> until we >>> > > > merge >>> > > > > > > this. >>> > > > > > > >>> > > > > > > Do you think cutting 5.2.0 now makes sense? This will enable >>> > other >>> > > > > > > developers to merge features into master branch (5.3.0) and >>> you >>> > can >>> > > > > take >>> > > > > > > some more time to make sure we cover all the corner cases >>> for the >>> > > > data >>> > > > > > > integrity issues that you uncovered. >>> > > > > > > >>> > > > > > > >>> > > > > > > On Fri, Jan 5, 2024 at 6:38 PM Viraj Jasani < >>> vjas...@apache.org> >>> > > > > wrote: >>> > > > > > > >>> > > > > > > > Sounds good, thanks Rajeshbabu. I will try to get the >>> first PR >>> > > out >>> > > > > next >>> > > > > > > > week and while reviews happen in parallel, will try to get >>> 5.1 >>> > PR >>> > > > > soon. >>> > > > > > > > >>> > > > > > > > >>> > > > > > > > On Wed, Jan 3, 2024 at 8:49 PM rajeshb...@apache.org < >>> > > > > > > > chrajeshbab...@gmail.com> wrote: >>> > > > > > > > >>> > > > > > > > > Hi Viraj, >>> > > > > > > > > >>> > > > > > > > > Would be better to include the changes in 5.1.4 as in >>> any >>> > way >>> > > it >>> > > > > > will >>> > > > > > > > take >>> > > > > > > > > at least 3-4 days to complete the omid release. >>> > > > > > > > > >>> > > > > > > > > Thanks, >>> > > > > > > > > Rajeshbabu. >>> > > > > > > > > >>> > > > > > > > > >>> > > > > > > > > On Thu, Jan 4, 2024 at 5:06 AM Viraj Jasani < >>> > > vjas...@apache.org> >>> > > > > > > wrote: >>> > > > > > > > > >>> > > > > > > > > > Thank you Kadir and Geoffrey for your replies!! >>> > > > > > > > > > >>> > > > > > > > > > > How does this affect 5.1.4, which is also listed as >>> a Fix >>> > > > > Version >>> > > > > > > for >>> > > > > > > > > > > PHOENIX-7106? >>> > > > > > > > > > >>> > > > > > > > > > Yes, it also needs to be ported to 5.1. Once the >>> master PR >>> > is >>> > > > up >>> > > > > > for >>> > > > > > > > > final >>> > > > > > > > > > review, I would start working on the backport PR. >>> > > > > > > > > > We just need some more additional testing to ensure old >>> > > client >>> > > > > > (e.g. >>> > > > > > > > > 5.1.3) >>> > > > > > > > > > is compatible with the new server with the changes. >>> > > > > > > > > > >>> > > > > > > > > > Hence, yes it is now a blocker for upcoming 5.1.4 as >>> well >>> > > since >>> > > > > > 5.1.4 >>> > > > > > > > RC >>> > > > > > > > > > preparation is still pending (while Omid release is in >>> > > > progress). >>> > > > > > > > > > Otherwise if 5.1.4 was ready for release, I would have >>> > > proposed >>> > > > > > > > immediate >>> > > > > > > > > > 5.1.5 release to include the changes proposed with >>> > > > PHOENIX-7106. >>> > > > > > > > > > >>> > > > > > > > > > >>> > > > > > > > > > On Wed, Jan 3, 2024 at 3:08 PM Geoffrey Jacoby < >>> > > > > gjac...@apache.org >>> > > > > > > >>> > > > > > > > > wrote: >>> > > > > > > > > > >>> > > > > > > > > > > I agree that data integrity issues are a higher >>> priority >>> > > than >>> > > > > > > feature >>> > > > > > > > > > > development, so I also support the decision. The fact >>> > that >>> > > > > > several >>> > > > > > > of >>> > > > > > > > > the >>> > > > > > > > > > > major remaining 5.2 features are currently being >>> > developed >>> > > in >>> > > > > > > > > > long-running >>> > > > > > > > > > > feature branches also helps, as work can continue >>> there >>> > at >>> > > > the >>> > > > > > cost >>> > > > > > > > of >>> > > > > > > > > a >>> > > > > > > > > > > rebase later. >>> > > > > > > > > > > >>> > > > > > > > > > > How does this affect 5.1.4, which is also listed as >>> a Fix >>> > > > > Version >>> > > > > > > for >>> > > > > > > > > > > PHOENIX-7106? From the bug description it also sounds >>> > like >>> > > > > 5.1.3 >>> > > > > > > and >>> > > > > > > > > the >>> > > > > > > > > > > forthcoming .4 are affected, since we have >>> server-side >>> > > paging >>> > > > > in >>> > > > > > > 5.1. >>> > > > > > > > > > (Feel >>> > > > > > > > > > > free to move that to a separate thread if you feel it >>> > > should >>> > > > > be a >>> > > > > > > > > > separate >>> > > > > > > > > > > discussion.) Should this be a blocker for releasing >>> > 5.1.4? >>> > > > > > > > > > > >>> > > > > > > > > > > Geoffrey >>> > > > > > > > > > > >>> > > > > > > > > > > >>> > > > > > > > > > > On Wed, Jan 3, 2024 at 5:06 PM Kadir Ozdemir < >>> > > > > > > > > > > ka...@gsuite.cloud.apache.org> >>> > > > > > > > > > > wrote: >>> > > > > > > > > > > >>> > > > > > > > > > > > Being a database, Phoenix has to make sure that the >>> > data >>> > > > > stays >>> > > > > > on >>> > > > > > > > > disk >>> > > > > > > > > > > > intact and its queries return correct data. In this >>> > case, >>> > > > > > Phoenix >>> > > > > > > > > fails >>> > > > > > > > > > > to >>> > > > > > > > > > > > return correct data for some queries if their scans >>> > > > > experience >>> > > > > > > > region >>> > > > > > > > > > > > movement. Now that we know these data integrity >>> issues >>> > > and >>> > > > > how >>> > > > > > to >>> > > > > > > > > > > reproduce >>> > > > > > > > > > > > them, fixing them should be our first priority. >>> So, I >>> > > fully >>> > > > > > > support >>> > > > > > > > > > this >>> > > > > > > > > > > > proposal. >>> > > > > > > > > > > > >>> > > > > > > > > > > > On Wed, Jan 3, 2024 at 10:58 PM Viraj Jasani < >>> > > > > > vjas...@apache.org >>> > > > > > > > >>> > > > > > > > > > wrote: >>> > > > > > > > > > > > >>> > > > > > > > > > > > > Hello, >>> > > > > > > > > > > > > >>> > > > > > > > > > > > > I would like to bring PHOENIX-7106 >>> > > > > > > > > > > > > < >>> > > > > > > > >>> > > > > > > >>> > > > > > >>> > > > > >>> > > > >>> > > >>> > >>> https://urldefense.com/v3/__https://issues.apache.org/jira/browse/PHOENIX-7106__;!!DCbAVzZNrAf4!FZG5sv55IC1NqItQLY7GKWgUG2Do0gSta01gOiSdd36Dx3XHGtQx4M3c9visVXIt9DctPQzS-orob9vhzrCfVA$ >>> > > > > > > > > to everyone's >>> > > > > > > > > > > > > attention here and brief about the data integrity >>> > > issues >>> > > > > that >>> > > > > > > we >>> > > > > > > > > have >>> > > > > > > > > > > in >>> > > > > > > > > > > > > various coprocessors. Majority of the issues are >>> > > related >>> > > > to >>> > > > > > the >>> > > > > > > > > fact >>> > > > > > > > > > > that >>> > > > > > > > > > > > > we do not return valid rowkey for certain >>> queries. If >>> > > any >>> > > > > > > region >>> > > > > > > > > > moves >>> > > > > > > > > > > in >>> > > > > > > > > > > > > the middle of the scan, the HBase client relies >>> on >>> > the >>> > > > last >>> > > > > > > > > returned >>> > > > > > > > > > > > rowkey >>> > > > > > > > > > > > > and accordingly changes the scan boundaries >>> while the >>> > > > > scanner >>> > > > > > > is >>> > > > > > > > > > > getting >>> > > > > > > > > > > > > reset to continue the scan operation. If the >>> region >>> > > does >>> > > > > not >>> > > > > > > > move, >>> > > > > > > > > > scan >>> > > > > > > > > > > > is >>> > > > > > > > > > > > > not expected to return invalid data, however if >>> the >>> > > > region >>> > > > > > > moves >>> > > > > > > > in >>> > > > > > > > > > the >>> > > > > > > > > > > > > middle of ongoing scan operation, scan would >>> return >>> > > > > > > > > invalid/incorrect >>> > > > > > > > > > > > data >>> > > > > > > > > > > > > causing data integrity issues. >>> > > > > > > > > > > > > >>> > > > > > > > > > > > > Given the critical nature of these issues, I >>> would >>> > like >>> > > > to >>> > > > > > > > propose >>> > > > > > > > > > that >>> > > > > > > > > > > > we >>> > > > > > > > > > > > > treat this as a high priority for the upcoming >>> 5.2.0 >>> > > > > release, >>> > > > > > > and >>> > > > > > > > > not >>> > > > > > > > > > > > > include any other feature or big change to master >>> > > branch >>> > > > > > until >>> > > > > > > we >>> > > > > > > > > > merge >>> > > > > > > > > > > > > this. The PR is still not ready as additional >>> changes >>> > > are >>> > > > > > still >>> > > > > > > > in >>> > > > > > > > > my >>> > > > > > > > > > > > > local, requiring rebase with the current master. >>> > > > > > > > > > > > > >>> > > > > > > > > > > > > I would get back to this discuss thread as soon >>> as >>> > the >>> > > PR >>> > > > > and >>> > > > > > > the >>> > > > > > > > > doc >>> > > > > > > > > > > are >>> > > > > > > > > > > > > updated with the latest findings so far. The >>> changes >>> > > > > include >>> > > > > > > many >>> > > > > > > > > of >>> > > > > > > > > > > our >>> > > > > > > > > > > > > coproc scanner implementations and hence it would >>> > > require >>> > > > > > > > > significant >>> > > > > > > > > > > > > review as well. >>> > > > > > > > > > > > > It would be great if we can hold on to merging >>> any >>> > > > feature >>> > > > > or >>> > > > > > > big >>> > > > > > > > > > > change >>> > > > > > > > > > > > to >>> > > > > > > > > > > > > master branch until this gets in so as to not >>> > > complicate >>> > > > > > > > > > > > merging/rebasing. >>> > > > > > > > > > > > > Once this is merged to the master branch, I would >>> > like >>> > > to >>> > > > > cut >>> > > > > > > 5.2 >>> > > > > > > > > > > branch >>> > > > > > > > > > > > > from master and we can move forward with 5.2.0 >>> > release. >>> > > > > > > > > > > > > >>> > > > > > > > > > > > > Please let me know if this looks good or if you >>> have >>> > > any >>> > > > > > other >>> > > > > > > > high >>> > > > > > > > > > > > > priority work for 5.2.0. >>> > > > > > > > > > > > > >>> > > > > > > > > > > > >>> > > > > > > > > > > >>> > > > > > > > > > >>> > > > > > > > > >>> > > > > > > > >>> > > > > > > >>> > > > > > >>> > > > > >>> > > > >>> > > >>> > >>> >> >> >> -- >> *István Tóth* | Sr. Staff Software Engineer >> *Email*: st...@cloudera.com >> cloudera.com <https://www.cloudera.com> >> [image: Cloudera] <https://www.cloudera.com/> >> [image: Cloudera on Twitter] <https://twitter.com/cloudera> [image: >> Cloudera on Facebook] <https://www.facebook.com/cloudera> [image: >> Cloudera on LinkedIn] <https://www.linkedin.com/company/cloudera> >> ------------------------------ >> ------------------------------ >> >