Hello, Are there any plans to release an accumulo 2.1 patch to fix this issue?
Thanks, On Sat, Dec 17, 2022 at 7:36 PM Vincent Russell <vincent.russ...@gmail.com> wrote: > No worries. I'm glad I was able to help in some small way. > > Thank you for fixing the issue. > > On Sat, Dec 17, 2022 at 11:49 AM Christopher Shannon < > christopher.l.shan...@gmail.com> wrote: > >> I created https://github.com/apache/accumulo/pull/3134 and that should >> fix >> this issue. >> >> Thanks Vincent for pointing out the issue in TServerUtils, your testing >> made this easy to track down and fix. >> >> >> On Sat, Dec 17, 2022 at 9:00 AM Christopher Shannon < >> christopher.l.shan...@gmail.com> wrote: >> >> > I was able to reproduce the issue by setting the value size for a >> mutation >> > to size 16384001 to make sure it's greater than the default value for >> > Thrift and it fails immediately. I will work on a fix now that we know >> how >> > to reproduce it. >> > >> > On Fri, Dec 16, 2022 at 2:31 PM Christopher <ctubb...@apache.org> >> wrote: >> > >> >> I don't think it's intentional. This might be the source of the >> problem. >> >> >> >> On Thu, Dec 15, 2022 at 3:39 PM Vincent Russell >> >> <vincent.russ...@gmail.com> wrote: >> >> > >> >> > Also in TserverUtils:270, when the TNonblockingServerSocket is >> created >> >> it >> >> > looks like it ends up using the default frame size. I am not sure if >> >> this >> >> > is intentional or not. >> >> > >> >> > On Thu, Dec 15, 2022 at 3:26 PM Vincent Russell < >> >> vincent.russ...@gmail.com> >> >> > wrote: >> >> > >> >> > > Christopher, >> >> > > >> >> > > I am not sure if this issue is related to 3042 or not. >> >> > > >> >> > > On the client side it does look like TConfiguration ends up being >> >> called >> >> > > with the default constructor. I am not sure if this is intentional >> >> or not. >> >> > > >> >> > > On the server side I see this stack, so it also looks like: >> >> > > >> >> > > at org.apache.thrift.TConfiguration.<init>(TConfiguration.java:36) >> >> > > at >> >> org.apache.thrift.TConfiguration$Builder.build(TConfiguration.java:99) >> >> > > at >> org.apache.thrift.TConfiguration.<clinit>(TConfiguration.java:65) >> >> > > at >> >> > > >> >> >> org.apache.thrift.transport.TNonblockingSocket.<init>(TNonblockingSocket.java:74) >> >> > > at >> >> > > >> >> >> org.apache.thrift.transport.TNonblockingSocket.<init>(TNonblockingSocket.java:68) >> >> > > at >> >> > > >> >> >> org.apache.thrift.transport.TNonblockingServerSocket.accept(TNonblockingServerSocket.java:135) >> >> > > at >> >> > > >> >> >> org.apache.thrift.transport.TNonblockingServerSocket.accept(TNonblockingServerSocket.java:36) >> >> > > at >> >> > > >> >> >> org.apache.thrift.server.TNonblockingServer$SelectAcceptThread.handleAccept(TNonblockingServer.java:218) >> >> > > at >> >> > > >> >> >> org.apache.thrift.server.TNonblockingServer$SelectAcceptThread.select(TNonblockingServer.java:186) >> >> > > at >> >> > > >> >> >> org.apache.thrift.server.TNonblockingServer$SelectAcceptThread.run(TNonblockingServer.java:142) >> >> > > >> >> > > >> >> > > I see this in the server log so it does look like it should be >> using >> >> 1G: >> >> > > 2022-09-01 16:59:41 INFO >> [org.apache.accumulo.tserver.TabletServer] >> >> > > ServerUtil:124 - general.server.message.size.max = 1G >> >> > > >> >> > > Thanks, >> >> > > Vincent >> >> > > >> >> > > On Thu, Dec 15, 2022 at 12:26 PM Vincent Russell < >> >> > > vincent.russ...@gmail.com> wrote: >> >> > > >> >> > >> >> >> > >> >> >> > >> I had to make a stack trace with hacking together a remote debug >> >> instance: >> >> > >> >> >> > >> at >> >> > >> >> >> >> org.apache.thrift.server.AbstractNonblockingServer$FrameBuffer.read(AbstractNonblockingServer.java:334) >> >> > >> at >> >> > >> >> >> >> org.apache.accumulo.server.rpc.CustomNonBlockingServer$CustomFrameBuffer.read(CustomNonBlockingServer.java:134) >> >> > >> at >> >> > >> >> >> >> org.apache.thrift.server.AbstractNonblockingServer$AbstractSelectThread.handleRead(AbstractNonblockingServer.java:187) >> >> > >> at >> >> > >> >> >> >> org.apache.thrift.server.TNonblockingServer$SelectAcceptThread.select(TNonblockingServer.java:189) >> >> > >> at >> >> > >> >> >> >> org.apache.thrift.server.TNonblockingServer$SelectAcceptThread.run(TNonblockingServer.java:142) >> >> > >> >> >> > >> On Thu, Dec 15, 2022 at 12:52 AM Christopher <ctubb...@apache.org >> > >> >> wrote: >> >> > >> >> >> > >>> From the numbers in the message, it looks like you're sending an >> >> 18MB >> >> > >>> payload but something in Thrift is limiting things to 16384000 >> >> > >>> (16000KB). I doubt you've overridden the default >> >> > >>> general.server.message.size.max to be anything that low (the >> default >> >> > >>> is 1G). Unless you're flushing after every mutation, it would >> not be >> >> > >>> surprising to exceed the 16MB max frame size indicated in the >> error >> >> > >>> message quite quickly. >> >> > >>> >> >> > >>> This value of 16384000 seemed weird. It looks like it's not using >> >> our >> >> > >>> configuration, but using the built-in default value of >> >> > >>> org.apache.thrift.TConfiguration.DEFAULT_MAX_FRAME_SIZE. It looks >> >> like >> >> > >>> this can happen whenever `new TConfiguration()` is called without >> >> > >>> parameters... and there's a fair amount of internal code, mostly >> in >> >> > >>> libthrift itself, that does that. It's a bit tricky to track down >> >> the >> >> > >>> one causing this particular issue. If you have a full stack >> trace, >> >> it >> >> > >>> could help. >> >> > >>> >> >> > >>> Also, this might be the same issue seen reported in >> >> > >>> https://github.com/apache/accumulo/issues/3042 >> >> > >>> >> >> > >>> On Wed, Dec 14, 2022 at 8:53 PM Vincent Russell >> >> > >>> <vincent.russ...@gmail.com> wrote: >> >> > >>> > >> >> > >>> > I was able to work out all of my compilation issues; however >> when >> >> I >> >> > >>> run an >> >> > >>> > integration test with the Mini Accumulo Cluster that tests >> writing >> >> > >>> > mutations with values of 5mb the flush hangs forever >> >> > >>> > and I see the following logs in the TabletServer logs: >> >> > >>> > >> >> > >>> > 20:41:02.306 [Thread-7] ERROR >> >> > >>> > o.a.a.s.r.CustomNonBlockingServer$CustomFrameBuffer - Read a >> frame >> >> > >>> size of >> >> > >>> > 18874697, which is bigger than the maximum allowable frame size >> >> > >>> 16384000 >> >> > >>> > for ALL connections. >> >> > >>> > 20:41:03.582 [Thread-7] ERROR >> >> > >>> > o.a.a.s.r.CustomNonBlockingServer$CustomFrameBuffer - Read a >> frame >> >> > >>> size of >> >> > >>> > 18874697, which is bigger than the maximum allowable frame size >> >> > >>> 16384000 >> >> > >>> > for ALL connections. >> >> > >>> > 20:41:05.079 [Thread-7] ERROR >> >> > >>> > o.a.a.s.r.CustomNonBlockingServer$CustomFrameBuffer - Read a >> frame >> >> > >>> size of >> >> > >>> > 18874697, which is bigger than the maximum allowable frame size >> >> > >>> 16384000 >> >> > >>> > for ALL connections. >> >> > >>> > >> >> > >>> > Other tests that write smaller amounts of data appear to work >> >> fine. >> >> > >>> > >> >> > >>> > Any idea what the issue might be? >> >> > >>> > >> >> > >>> > Thank you, >> >> > >>> > Vincent >> >> > >>> > >> >> > >>> > On Tue, Dec 13, 2022 at 4:43 PM Vincent Russell < >> >> > >>> vincent.russ...@gmail.com> >> >> > >>> > wrote: >> >> > >>> > >> >> > >>> > > Thank you both for your responses. >> >> > >>> > > >> >> > >>> > > We are using an event store library from a sister project >> that >> >> was >> >> > >>> written >> >> > >>> > > for accumulo 1.10., which I have already upgraded to 2.0. >> >> > >>> > > >> >> > >>> > > I'll spend some time investigating how bad the usage of the >> >> internal >> >> > >>> > > packages are and get back to you. >> >> > >>> > > >> >> > >>> > > Thanks again, >> >> > >>> > > >> >> > >>> > > On Tue, Dec 13, 2022 at 3:20 PM Christopher < >> >> ctubb...@apache.org> >> >> > >>> wrote: >> >> > >>> > > >> >> > >>> > >> To add to Dave's answer, the public API is defined at >> >> > >>> > >> https://accumulo.apache.org/api/ >> >> > >>> > >> Anything else is not public and is subject to change without >> >> notice >> >> > >>> on >> >> > >>> > >> any release without any attempt to retain compatibility. >> >> > >>> > >> >> >> > >>> > >> On Tue, Dec 13, 2022 at 3:10 PM Dave Marion < >> >> dmario...@gmail.com> >> >> > >>> wrote: >> >> > >>> > >> > >> >> > >>> > >> > There is no guide. You are using implementation classes >> (see >> >> > >>> clientImpl >> >> > >>> > >> in >> >> > >>> > >> > the package name) vs. using the client api. If you can use >> >> the >> >> > >>> client >> >> > >>> > >> api >> >> > >>> > >> > directly, then this should insulate you from changes in >> the >> >> future >> >> > >>> > >> (except >> >> > >>> > >> > during major versions). We can try and find where things >> >> might >> >> > >>> have >> >> > >>> > >> moved, >> >> > >>> > >> > but a class may have been split into multiple pieces. If >> you >> >> could >> >> > >>> > >> provide >> >> > >>> > >> > class and method, that would be easier. >> >> > >>> > >> > >> >> > >>> > >> > On Tue, Dec 13, 2022 at 2:45 PM Vincent Russell < >> >> > >>> > >> vincent.russ...@gmail.com> >> >> > >>> > >> > wrote: >> >> > >>> > >> > >> >> > >>> > >> > > Is there a guide that shows where classes may have been >> >> moved >> >> > >>> with >> >> > >>> > >> moving >> >> > >>> > >> > > from 2.0 to 2.1? For instance, I am having issues >> >> compiling, >> >> > >>> because >> >> > >>> > >> the >> >> > >>> > >> > > following class doesn't exist: >> >> > >>> > >> > > import org.apache.accumulo.core.clientImpl.Tables; >> >> > >>> > >> > > >> >> > >>> > >> > > I'm just getting started so I'm sure there are others. >> >> > >>> > >> > > >> >> > >>> > >> > > Thanks, >> >> > >>> > >> > > Vincent >> >> > >>> > >> > > >> >> > >>> > >> > > >> >> > >>> > >> > > >> >> > >>> > >> > > On Fri, Dec 9, 2022 at 9:02 AM Vincent Russell < >> >> > >>> > >> vincent.russ...@gmail.com> >> >> > >>> > >> > > wrote: >> >> > >>> > >> > > >> >> > >>> > >> > > > I mean Christopher. >> >> > >>> > >> > > > >> >> > >>> > >> > > > Thanks again. >> >> > >>> > >> > > > >> >> > >>> > >> > > > On Fri, Dec 9, 2022 at 9:01 AM Vincent Russell < >> >> > >>> > >> > > vincent.russ...@gmail.com> >> >> > >>> > >> > > > wrote: >> >> > >>> > >> > > > >> >> > >>> > >> > > >> Thank you Chris. >> >> > >>> > >> > > >> >> >> > >>> > >> > > >> Will will upgrade to Accumulo 2.1 and ZooKeeper 3.7 >> or >> >> > >>> later as >> >> > >>> > >> soon as >> >> > >>> > >> > > >> possible. >> >> > >>> > >> > > >> >> >> > >>> > >> > > >> On Thu, Dec 8, 2022 at 8:44 PM Christopher < >> >> > >>> ctubb...@apache.org> >> >> > >>> > >> wrote: >> >> > >>> > >> > > >> >> >> > >>> > >> > > >>> Hi Vincent, >> >> > >>> > >> > > >>> >> >> > >>> > >> > > >>> Version 2.0.1 is end of life as of the 2.1.0 LTM >> >> release, >> >> > >>> and 2.0 >> >> > >>> > >> is >> >> > >>> > >> > > >>> not expected to receive any further updates. Version >> >> 2.1.0 >> >> > >>> may >> >> > >>> > >> work >> >> > >>> > >> > > >>> with ZooKeeper 3.4, but was developed and tested >> >> against >> >> > >>> 3.5 and >> >> > >>> > >> later >> >> > >>> > >> > > >>> versions. I believe the ZooKeeper community is >> >> currently >> >> > >>> > >> considering >> >> > >>> > >> > > >>> whether to make 3.6 end-of-life themselves, so I >> would >> >> > >>> recommend >> >> > >>> > >> using >> >> > >>> > >> > > >>> Accumulo 2.1.0 with the latest ZooKeeper 3.7 or >> later >> >> to >> >> > >>> have the >> >> > >>> > >> best >> >> > >>> > >> > > >>> chance of any kind of support, including JDK 17 >> >> support. >> >> > >>> > >> > > >>> >> >> > >>> > >> > > >>> As for your specific issues: >> >> > >>> > >> > > >>> >> >> > >>> > >> > > >>> 1. This is already fixed in 2.1.0 >> >> > >>> > >> > > >>> 2/3. These issues are likely fixed in newer >> ZooKeeper >> >> > >>> versions. I >> >> > >>> > >> > > >>> haven't seen them anytime recently, anyway. Bugs in >> >> > >>> ZooKeeper >> >> > >>> > >> itself >> >> > >>> > >> > > >>> are out of scope for the Accumulo developers, but I >> >> have >> >> > >>> tried >> >> > >>> > >> > > >>> building Accumulo 2.1.0 with JDK 17 and ZooKeeper >> >> 3.8.0 and >> >> > >>> > >> haven't >> >> > >>> > >> > > >>> observed any unresolved issues. However, it's >> >> difficult to >> >> > >>> > >> actually >> >> > >>> > >> > > >>> run it because I don't think Hadoop has good JDK 17 >> >> support >> >> > >>> yet. >> >> > >>> > >> So, >> >> > >>> > >> > > >>> MiniAccumuloCluster seems to work with JDK 17, as >> does >> >> > >>> Accumulo >> >> > >>> > >> and ZK >> >> > >>> > >> > > >>> 3.8, but I don't think a full Hadoop cluster would >> >> (yet). >> >> > >>> > >> > > >>> >> >> > >>> > >> > > >>> On Thu, Dec 8, 2022 at 12:28 PM Vincent Russell >> >> > >>> > >> > > >>> <vincent.russ...@gmail.com> wrote: >> >> > >>> > >> > > >>> > >> >> > >>> > >> > > >>> > Hello, >> >> > >>> > >> > > >>> > >> >> > >>> > >> > > >>> > We are currently using accumulo 2.0.1. >> >> > >>> > >> > > >>> > >> >> > >>> > >> > > >>> > We are in the process of upgrading our source code >> >> to use >> >> > >>> jdk 17 >> >> > >>> > >> > > >>> however we >> >> > >>> > >> > > >>> > are running into some problems with our tests and >> the >> >> > >>> > >> > > >>> MiniAccumuloCluster. >> >> > >>> > >> > > >>> > >> >> > >>> > >> > > >>> > One of our developer encountered the following >> >> issues: >> >> > >>> > >> > > >>> > >> >> > >>> > >> > > >>> > 1. The MiniAccumumluoClusterImpl._exec is >> >> hardcoded >> >> > >>> with the >> >> > >>> > >> JVM >> >> > >>> > >> > > arg >> >> > >>> > >> > > >>> > -XX:+IUseConcMarkSweepGC, which is no longer >> >> tolerated >> >> > >>> with >> >> > >>> > >> JDK17. >> >> > >>> > >> > > >>> > 2. In Zookeeper 3.4.14, ConetStringParser uses >> >> > >>> > >> createUnresolved to >> >> > >>> > >> > > >>> > make IPAddresses. >> >> > >>> > >> > > >>> >> SaslServerPrincipal.WrapperInetSocketAddress.getAddress >> >> > >>> > >> > > >>> > uses InetSocketAddess.getAddress, which returns >> >> null >> >> > >>> because >> >> > >>> > >> it's >> >> > >>> > >> > > >>> not >> >> > >>> > >> > > >>> > resolved, resulting in a failure to connect to >> the >> >> > >>> > >> newly-started >> >> > >>> > >> > > >>> zookeeper. >> >> > >>> > >> > > >>> > 3. StaticHostProvider.getHostString() tries to >> >> extract >> >> > >>> he >> >> > >>> > >> hostname >> >> > >>> > >> > > >>> by >> >> > >>> > >> > > >>> > calling toString on the address and taking >> >> everything >> >> > >>> before >> >> > >>> > >> the >> >> > >>> > >> > > >>> colon, but >> >> > >>> > >> > > >>> > in JDK17, the string format changed to >> >> > >>> > >> > > "localhost/<unresolved->:xx" >> >> > >>> > >> > > >>> (where >> >> > >>> > >> > > >>> > XX is still the port number). That's incorrect >> >> and it >> >> > >>> can't >> >> > >>> > >> > > >>> resolve the >> >> > >>> > >> > > >>> > names. >> >> > >>> > >> > > >>> > >> >> > >>> > >> > > >>> > >> >> > >>> > >> > > >>> > Has anyone come across/resolved these kinds of >> >> issues? >> >> > >>> Is it >> >> > >>> > >> not >> >> > >>> > >> > > >>> possible >> >> > >>> > >> > > >>> > to use java17 from a client perspective? Will >> >> upgrading >> >> > >>> to >> >> > >>> > >> accumulo >> >> > >>> > >> > > >>> 2.1 >> >> > >>> > >> > > >>> > help? >> >> > >>> > >> > > >>> > >> >> > >>> > >> > > >>> > Thanks, >> >> > >>> > >> > > >>> > Vincent >> >> > >>> > >> > > >>> >> >> > >>> > >> > > >> >> >> > >>> > >> > > >> >> > >>> > >> >> >> > >>> > > >> >> > >>> >> >> > >> >> >> >> > >> >