Sorry for the late posting of the vote, I have now started a poll thread
about BP-60:
https://lists.apache.org/thread/oy0goq9b8bl4gs5gn281rmm6h64102pk

Wenbing

Enrico Olivelli <eolive...@gmail.com> 于2023年1月11日周三 16:10写道:

> Hello,
> someone merged the PR
> https://github.com/apache/bookkeeper/pull/3719
>
> this is a BP, so it needs a formal VOTE
>
> Please hold off from committing PRs related to open BPs
>
> Enrico
>
> Il giorno mer 11 gen 2023 alle ore 08:43 steven lu
> <lushiji2...@gmail.com> ha scritto:
> >
> > +1
> > Thanks for your great work!
> >
> > Wenbing Shen <oliver.shen...@gmail.com> 于2022年12月26日周一 17:29写道:
> >
> > > Hi BookKeepers, I've changed the limitStatsLogging default value to
> true
> > > from false:
> > > BP-60 <https://github.com/apache/bookkeeper/issues/3718>
> > >
> > > Motivation
> > >
> > > We have an efficient online bookie cluster with hundreds of bookie
> nodes
> > > deployed on SSD disks.
> > >
> > > We separate the AutoRecovery cluster and the Bookie cluster for
> independent
> > > deployment.
> > >
> > > I observed that our AutoRecovery cluster GC is very frequent. After
> > > investigation, I found that the limitStatsLogging of the bookkeeper
> client
> > > PCBC is disabled by default, and a large number of channel monitoring
> > > indicators are generated. Due to the large number of bookie cluster
> nodes,
> > > this metric data occupies a large amount of heap memory.
> > >
> > > A single StringWriter object occupies 16MB of memory, of which nearly
> 70
> > > StringWriter objects are waiting for the next GC to be destroyed,
> occupying
> > > 1GB+ heap memory.
> > > Proposal
> > >
> > > In my use, I haven't found any usefulness of these PCBC monitoring
> metrics
> > > data, at least so far, I haven't used it effectively.
> > >
> > > If our AutoRecovery and Bookie cluster are mixed in one process, these
> > > large objects will affect the performance and stability of Bookie
> cluster.
> > >
> > > Since I can't find the meaning of these metrics by default, I suggest
> to
> > > adjust the default value of limitStatsLogging to true.
> > >
> > > Everyone can choose to turn it on or off, but by default, it is
> difficult
> > > for users to find out what effect this parameter will have, so that
> when
> > > their cluster grows to hundreds or thousands, when they realize the
> problem
> > > sometimes, it is necessary to restart hundreds to thousands of bookies
> in a
> > > rolling manner.
> > >
> > > At the same time, I observed that in pulsar, various monitoring of the
> > > bookkeeper client is turned off by default, because they really affect
> the
> > > performance of the pulsar service, which is enough to show that we
> should
> > > try to change it, especially some very redundant metrics created based
> on
> > > channels.
> > > Compatibility, Deprecation, and Migration PlanClients that rely on PCBC
> > > metrics monitoring need to pay attention to this upgrade, but this
> will not
> > > affect the actual functions of the client, only the metrics data, and
> users
> > > can choose to open it again.
> > >
> > >
> > > What do you think about it?
> > >
> > > Best.
> > > Wenbing
> > >
>

Reply via email to