To clarify, historical profile that is persisted in ZK On Thu, Mar 17, 2022 at 11:54 AM Joe F <joefranc...@gmail.com> wrote:
> IIRC, there is a historical load profile for topic that feeds into > decisions by the load balancer. > > What happens during a cluster startup, with this new proposal? > > > > > On Thu, Mar 17, 2022 at 7:50 AM PengHui Li <peng...@apache.org> wrote: > >> > But which brokers will own that topic ? >> in a Pulsar cluster with a high level of isolation of tenants, we must >> ensure that: >> - at least one broker is allowed to own the topic >> - brokers dedicated to tenants do not own the topic >> With the current approach the data in on zookeeper, and this is shared >> among all the brokers >> >> We have "pulsar/system" namespace which can be used to maintain >> system topics. If users consider broker isolation, it's all transparent. >> >> Using a topic we also can shared the data among all brokers. >> Who want a data copy, only need to create a reader when starting. >> And we have introduced table view, which will make it easier to cache >> the load data, and perform the load cache update. >> >> > Another point: >> will users be allowed to produce/consume this topic ? how do we deal >> with permissions = >> >> Good point. We should avoid the user's producers/consumers, and only >> the super user can access the system topic. >> >> Thanks, >> Penghui >> >> On Thu, Mar 17, 2022 at 10:08 PM Enrico Olivelli <eolive...@gmail.com> >> wrote: >> >> > Il giorno gio 17 mar 2022 alle ore 02:42 PengHui Li >> > <peng...@apache.org> ha scritto: >> > > >> > > > we do not know >> > > anything about the availability of the owner of the topic. >> > > >> > > If the owner broker is not available, other brokers will take over. >> > > >> > > > We could make it simpler and when a broker wants to push its data, >> it >> > > looks >> > > up the REST address of the "leader broker" and then pushes the data to >> > it, >> > > I mean, without involving a "topic" >> > > >> > > Any broker may become the leader broker, in this case, the brokers >> need >> > to >> > > know all the addresses of the brokers in the cluster. With the topic >> > > approach, >> > > they only need to know the topic name. >> > >> > I thought about this a little more. >> > Using a non persistent topic makes sense. So I am closer to be >> > convinced about this move. >> > >> > But which brokers will own that topic ? >> > in a Pulsar cluster with a high level of isolation of tenants, we must >> > ensure that: >> > - at least one broker is allowed to own the topic >> > - brokers dedicated to tenants do not own the topic >> > With the current approach the data in on zookeeper, and this is shared >> > among all the brokers >> > >> > Another point: >> > will users be allowed to produce/consume this topic ? how do we deal >> > with permissions = >> > >> > >> > Enrico >> > >> > > >> > > Penghui >> > > >> > > On Thu, Mar 17, 2022 at 12:35 AM Enrico Olivelli <eolive...@gmail.com >> > >> > > wrote: >> > > >> > > > But in order to read from a topic you need a broker that is the >> owner >> > of >> > > > the owner of the special "temporary topic". >> > > > >> > > > While the metadata service (ZooKeeper) is already a central point >> and >> > it is >> > > > meant to be available (otherwise Pulsar doesn't work), we do not >> know >> > > > anything about the availability of the owner of the topic. >> > > > >> > > > Or do you mean to create a special topic that is always owned by the >> > > > "leader broker" ? >> > > > >> > > > We could make it simpler and when a broker wants to push its data, >> it >> > looks >> > > > up the REST address of the "leader broker" and then pushes the data >> to >> > it, >> > > > I mean, without involving a "topic". >> > > > >> > > > >> > > > Enrico >> > > > >> > > > >> > > > >> > > > Il Mer 16 Mar 2022, 12:55 PengHui Li <peng...@apache.org> ha >> scritto: >> > > > >> > > > > +1 >> > > > > >> > > > > The load data don't need to be persistent to the storage layer, >> > > > > Using a non-persistent topic is more efficient. >> > > > > >> > > > > Thanks, >> > > > > Penghui >> > > > > >> > > > > On Wed, Mar 16, 2022 at 2:14 PM Kai Wang >> > <kw...@streamnative.io.invalid> >> > > > > wrote: >> > > > > >> > > > > > Hi Pulsar Community, >> > > > > > >> > > > > > Currently, Pulsar LoadManager is using Zookeeper to store the >> local >> > > > > broker >> > > > > > data, the LoadReportUpdaterTask will report the local load data >> to >> > > > > > Zookeeper, the leader broker will collect load data and store >> it to >> > > > > > Zookeeper. >> > > > > > >> > > > > > When we have a lot of brokers and bundles, this load datas will >> put >> > > > some >> > > > > > pressure on Zookeeper. >> > > > > > >> > > > > > Since the load data are not strongly consistent, we can use the >> > > > > > non-persistent topics to sync the load data. And it will reduce >> our >> > > > > > dependence on Zookeeper. >> > > > > > >> > > > > > If this proposal is acceptable, I will draft a PIP. >> > > > > > >> > > > > > Any suggestions are appreciated. >> > > > > > >> > > > > > Thanks, >> > > > > > Kai >> > > > > > >> > > > > >> > > > >> > >> >