[GitHub] [pulsar-test-infra] nodece closed issue #67: Simplify invocation of docbot action

2022-08-24 Thread GitBox


nodece closed issue #67: Simplify invocation of docbot action
URL: https://github.com/apache/pulsar-test-infra/issues/67


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] nodece merged pull request #68: [improve][docbot] Move the preceding steps into the action

2022-08-24 Thread GitBox


nodece merged PR #68:
URL: https://github.com/apache/pulsar-test-infra/pull/68


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread PengHui Li
Hi Tison,

> But any proposer can take their proposal number once proposed and a
committer clicks the merge button. Still, I'm concerned about adding such a
step to exclusively allocate a proposal number.

Here is the previous discussion
https://lists.apache.org/thread/vw25onjyo18trfysd7x9k9wrs2tyqfok
about this part. Without the request, approval steps for the PIP number.
I didn't think of any other better way to help us manage the proposal
number better.
The proposal number should be solemn and not very casual.

> I would suggest you collect the suggestions in this discussion thread

Yes, I agree. This is exactly what I'm currently doing.
Sorry, the PR https://github.com/apache/pulsar/pull/17270 introduced the
confusion.
Have this one just for that we are able to review detailed changes based on
the old one.
So I pushed the PR.

Best,
Penghui

On Thu, Aug 25, 2022 at 2:22 PM PengHui Li  wrote:

> Hi all,
>
> I have pushed out a PR https://github.com/apache/pulsar/pull/17270
> which wants to copy the PIP and proposal template to the codebase.
>
> The PR doesn't want to make some changes to the proposal process
> before we have approval. Since we will make some changes to the
> current PIP process, I want to copy the old one to the codebase so that
> all the subsequent updates will have historical records.
>
> Finally, the PIP process changes should be done with a new VOTE email.
> With https://github.com/apache/pulsar/pull/17270, we can have a PR to show
> all the details which can help the reviewer to easily review.
>
> Thanks,
> Penghui
>
>
> On Thu, Aug 25, 2022 at 12:57 PM tison  wrote:
>
>> Hi Penghui,
>>
>> > IMO, the final status of the proposal should be summarized in the
>> proposal.
>>
>> Agree. It's not exclusive. What I propose is that if you want to track
>> proposals in a project, track issues. But it seems your opinion is not to
>> do so. Then it's a nonissue. Flink FLIP tracks proposal status with the
>> proposal content and it works well. Although, I'd still highlight that
>> maintainer efforts are required to maintain the metadata.
>>
>> > As I know, if we have an issue tracking all the proposal status, not all
>> the contributors make contributions. Only the committers and the author?
>>
>> I don't propose to track the status _within_ the issue. But if you want to
>> track in a project, only committer have the privilege :)
>>
>> >  if the discussion of the motivation doesn't have objections and is
>> discussed enough.
>>
>> This is quite vague and hard to follow.
>>
>> Rust RFCs have a nonmandatory pre-RFC stage. I think we can encourage the
>> proposer to discuss on the mailing list or any channel with committers
>> first. But any proposer can take their proposal number once proposed and a
>> committer clicks the merge button. Still, I'm concerned about adding such
>> a
>> step to exclusively allocate a proposal number.
>>
>> I would suggest you collect the suggestions in this discussion thread, and
>> propose a process as shown in
>>
>> * https://github.com/rust-lang/rfcs#readme
>> * https://github.com/pingcap/tidb/tree/master/docs/design#readme
>>
>> So that we can have a vote. Instead of rushing into making several PRs to
>> move content fragmentedly.
>>
>> Best,
>> tison.
>>
>>
>> PengHui Li  于2022年8月25日周四 10:59写道:
>>
>> > Hi Tison,
>> >
>> > Thanks for providing more context.
>> >
>> > > Rust RFCs hold in a separate repo so that they can use the PR number
>> as
>> > the
>> > proposal number to prevent duplicates. It's a bit overhead for Pulsar to
>> > hold a separate repo. So I'd prefer to hold proposals in the same repo
>> > under the dev/proposals folder and resolve the PR number duplicates
>> issue
>> > as described above. Although, it can still be a bit awkward. But at
>> least
>> > we can quickly notice duplicate numbers and resolve them. Using the PR
>> > number as the proposal number keeps it unique, but the number will be
>> > discontinuous. Especially if we're proposing in the main repo, the
>> number
>> > will grow quickly.
>> >
>> > The PR number can provide unique numbers. Everyone can create PRs, maybe
>> > not related to PIP, but we are not able to revert it.
>> >
>> > > In this direction, we can associate a proposal with a tracking issue
>> and
>> > manipulate the proposal status represented by their tracking issue. It
>> > doesn't mean we keep the current issue discussion process. You can
>> regard
>> > it as an umbrella ticket for the proposal. See
>> > https://github.com/pingcap/tidb/issues/26022 as an example.
>> >
>> > > In this direction, we should always ensure that there're several eyes
>> on
>> > the project. Otherwise, we can easily end up with an out-of-sync
>> project.
>> >
>> > IMO, the final status of the proposal should be summarized in the
>> proposal.
>> > We can have some metadata for md file which allow us to have a page that
>> > can
>> > show all the proposals like the bookkeeper does
>> > https://bookkeeper.apache.org/community/bookkeeper-propo

Re: [DISCUSS] Catch up reads with many subscriptions - saving reads from BK

2022-08-24 Thread Enrico Olivelli
Thank you folks for your feedback

Il giorno gio 25 ago 2022 alle ore 05:56 Haiting Jiang
 ha scritto:
>
> Here is my idea of handing the read position shift if we want to go a bit
> further with this solution.
>
> Instead of only checking the exact match of entry range match
> (PendingReadKey in the PR),
> we can check if the requesting range is covered by several ongoing read
> requests, and merge
>  the result of these requests.
>
> For example, currently there are already inflight request reads of entries
> [10,20] and [20,30],
> and now comes a new request [15,25], it can just merge the result from
> [10,20] and [20,30],
> and avoids sending the request to BK.
>
> As for the complexity checking procedure, I think we can sort all the
> inflight requests by startEntryId
> and apply a binary search here, to reduce the time complexity to O(logN)
> where N is the
> max entry size in one ledger.

This is an interesting idea.
We are doing more testing to empirically understand how frequently we
have overlapping
ranges.
Because if it is not frequent to have such overlapping ranges then we
will waste resources.

Currently my explanation about this scenario is that groups of
subscriptions are close to each other
and nobody is able to leverage the entries cached by the others and
they try to read the same entries.

By default we have this default of 100 in dispatcherMaxReadBatchSize
and that limits the ranges to be up to 100 entries.

When there are no errors in reads and the consumers work well all the
subscriptions quickly reach the max value (100).
So if you are lucky they read the same ranges.
If you are not lucky, because they start from different positions then
this feature does not help.
By the way this means that it is likely that you have:
100...200,200...300,180...280
but not
100...200, 120...180
So it is unlikely that you see a range that is totally included in a
pending read.

a note about "merging":
the main problem is that we have to choose the "best way" of combining
what's going on and what you can do:
A) use parts of the pending reads (how many reads do you want to concatenate?)
B) add one (or two) new read to fill the missing entries and merge
C) start a new read from scratch

Also, if you go for B) and not for C) other new readers may miss a
chance to leverage your range

I will do further testing in order to understand if merging two
existing pending reads improves the situation.



Enrico

>
> Thanks,
> Haiting
>
> On Thu, Aug 25, 2022 at 11:40 AM Haiting Jiang 
> wrote:
>
> > +1 for this optimization for the specific scenario.
> >
> > This simple patch is really effective, in some tests it reduces a lot
> >> (from 100K reads/s to 250 reads/s) the concurrent reads to BK.
> >> You may say that it is unlikely that the broker sends concurrent
> >> requests for exactly the same range of entries but empirically I have
> >> observed that it happens quite often, at least in the scenario I drew
> >> above.
> >
> >
> > This is quite an interesting test result. My guess is that all the
> > subscriptions
> > have exactly the same backlog and all the messages are consumed instantly
> > and successfully. So that they can read the same range of entries.
> > Once there are some consuming delays or retries due to failures, I believe
> > there will be read position shifts and the BK read requests will go up.
> >
> > Thanks,
> > Haiting
> >
> > On Thu, Aug 25, 2022 at 12:12 AM Michael Marshall 
> > wrote:
> >
> >> +1 I support merging the temporary solution.
> >>
> >> > Just one point, please make sure the change will not introduce too much
> >> > heap memory overhead.
> >>
> >> If we see that there is extra pressure, I wonder if we can dynamically
> >> determine when to deduplicate requests. A broker only needs this
> >> feature when a topic has multiple subscriptions. I haven't double
> >> checked if this information is easily accessible to the cache.
> >>
> >> One of the interesting benefits of this optimization is that it will
> >> decrease direct memory pressure. When we do not deduplicate reads, the
> >> same entry will be duplicated in the bookie's netty direct memory
> >> pool. With this design, an entry will be materialized once with many
> >> different ByteBuffer's objects pointing to it.
> >>
> >> Thanks,
> >> Michael
> >>
> >> On Wed, Aug 24, 2022 at 11:00 AM PengHui Li  wrote:
> >> >
> >> > +1 for we can have a short-term solution first.
> >> >
> >> > Just one point, please make sure the change will not introduce too much
> >> > heap memory overhead.
> >> >
> >> > Thanks,
> >> > Penghui
> >> >
> >> > On Wed, Aug 24, 2022 at 11:54 PM Enrico Olivelli 
> >> > wrote:
> >> >
> >> > > Hello folks,
> >> > > These days I am working on a use case in which I have many
> >> > > subscriptions on the same topic (~100).
> >> > > If you get to a situation with a big backlog, the consumers are no
> >> > > more able to leverage the broker cache then the broker starts to read
> >> > > from BookKeeper.
> >> > >

Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread PengHui Li
Hi all,

I have pushed out a PR https://github.com/apache/pulsar/pull/17270
which wants to copy the PIP and proposal template to the codebase.

The PR doesn't want to make some changes to the proposal process
before we have approval. Since we will make some changes to the
current PIP process, I want to copy the old one to the codebase so that
all the subsequent updates will have historical records.

Finally, the PIP process changes should be done with a new VOTE email.
With https://github.com/apache/pulsar/pull/17270, we can have a PR to show
all the details which can help the reviewer to easily review.

Thanks,
Penghui


On Thu, Aug 25, 2022 at 12:57 PM tison  wrote:

> Hi Penghui,
>
> > IMO, the final status of the proposal should be summarized in the
> proposal.
>
> Agree. It's not exclusive. What I propose is that if you want to track
> proposals in a project, track issues. But it seems your opinion is not to
> do so. Then it's a nonissue. Flink FLIP tracks proposal status with the
> proposal content and it works well. Although, I'd still highlight that
> maintainer efforts are required to maintain the metadata.
>
> > As I know, if we have an issue tracking all the proposal status, not all
> the contributors make contributions. Only the committers and the author?
>
> I don't propose to track the status _within_ the issue. But if you want to
> track in a project, only committer have the privilege :)
>
> >  if the discussion of the motivation doesn't have objections and is
> discussed enough.
>
> This is quite vague and hard to follow.
>
> Rust RFCs have a nonmandatory pre-RFC stage. I think we can encourage the
> proposer to discuss on the mailing list or any channel with committers
> first. But any proposer can take their proposal number once proposed and a
> committer clicks the merge button. Still, I'm concerned about adding such a
> step to exclusively allocate a proposal number.
>
> I would suggest you collect the suggestions in this discussion thread, and
> propose a process as shown in
>
> * https://github.com/rust-lang/rfcs#readme
> * https://github.com/pingcap/tidb/tree/master/docs/design#readme
>
> So that we can have a vote. Instead of rushing into making several PRs to
> move content fragmentedly.
>
> Best,
> tison.
>
>
> PengHui Li  于2022年8月25日周四 10:59写道:
>
> > Hi Tison,
> >
> > Thanks for providing more context.
> >
> > > Rust RFCs hold in a separate repo so that they can use the PR number as
> > the
> > proposal number to prevent duplicates. It's a bit overhead for Pulsar to
> > hold a separate repo. So I'd prefer to hold proposals in the same repo
> > under the dev/proposals folder and resolve the PR number duplicates issue
> > as described above. Although, it can still be a bit awkward. But at least
> > we can quickly notice duplicate numbers and resolve them. Using the PR
> > number as the proposal number keeps it unique, but the number will be
> > discontinuous. Especially if we're proposing in the main repo, the number
> > will grow quickly.
> >
> > The PR number can provide unique numbers. Everyone can create PRs, maybe
> > not related to PIP, but we are not able to revert it.
> >
> > > In this direction, we can associate a proposal with a tracking issue
> and
> > manipulate the proposal status represented by their tracking issue. It
> > doesn't mean we keep the current issue discussion process. You can regard
> > it as an umbrella ticket for the proposal. See
> > https://github.com/pingcap/tidb/issues/26022 as an example.
> >
> > > In this direction, we should always ensure that there're several eyes
> on
> > the project. Otherwise, we can easily end up with an out-of-sync project.
> >
> > IMO, the final status of the proposal should be summarized in the
> proposal.
> > We can have some metadata for md file which allow us to have a page that
> > can
> > show all the proposals like the bookkeeper does
> > https://bookkeeper.apache.org/community/bookkeeper-proposals
> > From my personal view, I prefer this way of the bookkeeper. This will
> allow
> > all the users find the proposals on the website, instead of going to
> search
> > in the
> > issue list.
> >
> > And the md file is the source of the truth. I think we can only provide
> one
> > way to
> > update the proposal status (from a PR). So that all the contributors can
> > contribute.
> > As I know, if we have an issue tracking all the proposal status, not all
> > the contributors
> > make contributions. Only the committers and the author? Maybe the author
> > can change
> > the policy but still can't involve more people to review.
> >
> > > It seems the confusion comes from we have to merge one PR first to
> > prevent
> > duplicate numbers. If it's a certain concern, see also my comment above
> > using the PR number as the proposal, or even host a separate proposal
> repo
> > as rust-lang/rfcs does and keep the number growing normally. I can
> provide
> > experience to maintain this sort of resource.
> >
> > Yes, it looks to re

[GitHub] [pulsar] Nintorac edited a comment on the discussion: Presto connector schema issue

2022-08-24 Thread GitBox


GitHub user Nintorac edited a comment on the discussion: Presto connector 
schema issue

And for logs in that folder I only see `logs/pulsar-standalone.log` and 
`logs/gc.log`

GitHub link: 
https://github.com/apache/pulsar/discussions/17234#discussioncomment-3471296


This is an automatically sent email for dev@pulsar.apache.org.
To unsubscribe, please send an email to: dev-unsubscr...@pulsar.apache.org



[GitHub] [pulsar] Nintorac added a comment to the discussion: Presto connector schema issue

2022-08-24 Thread GitBox


GitHub user Nintorac added a comment to the discussion: Presto connector schema 
issue

And for logs I only see `logs/pulsar-standalone.log` and `logs/gc.log`

GitHub link: 
https://github.com/apache/pulsar/discussions/17234#discussioncomment-3471296


This is an automatically sent email for dev@pulsar.apache.org.
To unsubscribe, please send an email to: dev-unsubscr...@pulsar.apache.org



[GitHub] [pulsar] Nintorac added a comment to the discussion: Presto connector schema issue

2022-08-24 Thread GitBox


GitHub user Nintorac added a comment to the discussion: Presto connector schema 
issue

[pulsar-trino.tar.gz](https://github.com/apache/pulsar/files/9421624/pulsar-trino.tar.gz)

Here is the setup -- all in docker compose. It is missing the pulsar presto 
connctor plugin folder found in the release for version `apache-pulsar-2.10.1`. 

I also changed the pulsar docker image to match `2.10.1` (it is `2.9.1` in the 
archive)  and tried again but same error

GitHub link: 
https://github.com/apache/pulsar/discussions/17234#discussioncomment-3471265


This is an automatically sent email for dev@pulsar.apache.org.
To unsubscribe, please send an email to: dev-unsubscr...@pulsar.apache.org



[GitHub] [pulsar-site] Anonymitaet closed issue #172: Dark/Light mode toggle cannot be clicked for develop-binary-protocol page

2022-08-24 Thread GitBox


Anonymitaet closed issue #172: Dark/Light mode toggle cannot be clicked for 
develop-binary-protocol page
URL: https://github.com/apache/pulsar-site/issues/172


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [DISCUSS] PIP-Enable read-only partition, add read and write control for a single partition.

2022-08-24 Thread Gavin Gao
Where we store the metadata?
I suggest we use the system topic to store the metadata instead of zk 

On 2022/08/25 02:36:56 baomingyu_5 wrote:
> Hi all,
> 
> In the exclusive consumption mode of exclusive, failover, reader, etc. if a 
> large number of messages accumulate due to limited consumption capacity or 
> consumption logic problems, expanding partitions or modifying consumption 
> logic cannot reduce the consumption of new messages in time. Influence, a 
> certain proportion of newly added messages will still be produced in the 
> accumulated paritition, and new messages cannot be quickly consumed by 
> restarted consumers or new consumers. In this way, the time affected by the 
> accumulation factor may very long.
> 
> In order to reduce the impact on the business, through the control of read 
> and write permissions, configure some partitions as read-only, so that newly 
> produced messages can be sent to the partitions with less accumulation or 
> newly expanded, reducing the impact on the business caused by the 
> accumulation of messages. Reduce the overall time-consuming of 
> production--consumption.


[GitHub] [pulsar] gaoran10 added a comment to the discussion: Presto connector schema issue

2022-08-24 Thread GitBox


GitHub user gaoran10 added a comment to the discussion: Presto connector schema 
issue

Could you try to reproduce this problem in standalone mode? You should find log 
file `$PULSAR_HOME/lib/presto/var/log/server.log`.
If there are reproduce steps, it's better.

GitHub link: 
https://github.com/apache/pulsar/discussions/17234#discussioncomment-3471125


This is an automatically sent email for dev@pulsar.apache.org.
To unsubscribe, please send an email to: dev-unsubscr...@pulsar.apache.org



Re: [Discussion] PIP 198 - How to define [type] and [scope]?

2022-08-24 Thread tison
Anyway, it's a separate topic to discuss. If you want to discuss issue
types and whether to label components, please start another thread.

Best,
tison.


tison  于2022年8月25日周四 13:12写道:

> From current issue templates, we already sort issues into bug reports,
> improvements, doc changes, flaky tests, and PIPs. They're types. [type] and
> [component] described here are applied to commit messages, not for issues.
>
> For components, we may encourage contributors to try their best to sort
> out related components, but it's generally hard to do. I report a bug, how
> can I know which components are related? It is required I have to dig it
> out?
>
> Best,
> tison.
>
>
> tison  于2022年8月25日周四 13:08写道:
>
>> > Agree with applying the same rules ( [type] [scope] summary) for
>> writing issue titles.
>>
>> It cannot be guarded by check so I think it only increases contributors'
>> overhead.
>>
>> Instead, we can try to find out some integration if we can use the GitHub
>> issue forms dropdown widget to allow contributors to tag the issue.
>>
>> I don't know whether it's possible, but it's better than setting up title
>> rules. I can foresee that it's seldomly followed.
>>
>> Best,
>> tison.
>>
>>
>> Liu Yu  于2022年8月25日周四 12:59写道:
>>
>>> Thanks Max!
>>>
>>> Agree with applying the same rules ( [type] [scope] summary) for writing
>>> issue titles.
>>>
>>> On 2022/08/25 02:48:51 Max Xu wrote:
>>> > LGTM.
>>> >
>>> > And I think we should also update our issue templates.
>>> >
>>> > Best,
>>> > Max Xu
>>> >
>>> >
>>> > On Tue, Aug 23, 2022 at 6:04 PM Yu  wrote:
>>> >
>>> > > Hi team,
>>> > >
>>> > > Many thanks for your feedback! We've adjusted the convention based
>>> on your
>>> > > suggestions!
>>> > >
>>> > > Below is a brief summary of what we have reached a consensus on:
>>> > >
>>> > > 
>>> > >
>>> > > 1. Convention
>>> > >
>>> > > Continue to follow our existing convention (it's customized on
>>> Agular) [1]
>>> > >
>>> > > 
>>> > >
>>> > > 2. Definition
>>> > >
>>> > > [type] must be one of the following:
>>> > > - feat (abbr for "feature")
>>> > > - improve
>>> > > - fix
>>> > > - cleanup
>>> > > - refactor
>>> > > - revert
>>> > >
>>> > > [scope] must be one of the following:
>>> > > - admin
>>> > > - broker
>>> > > - cli (changes to CLI tools)
>>> > > - io
>>> > > - fn (abbr for "function")
>>> > > - meta (abbr for "metadata")
>>> > > - monitor
>>> > > - proxy
>>> > > - schema
>>> > > - sec (abbr for "security")
>>> > > - sql
>>> > > - storage
>>> > > - offload (changes to tiered storage)
>>> > > - txn
>>> > > - java
>>> > > - cpp
>>> > > - py
>>> > > - ws (changes to WebSocket)
>>> > > - test (changes to code tests)
>>> > > - ci (changes to CI workflow)
>>> > > - build (changes to dependencies, docker, build or release script)
>>> > > - misc
>>> > > - doc
>>> > > - blog
>>> > > - site
>>> > >
>>> > > For full details, see [Guide] Pulsar Pull Request Naming Convention
>>> [2]
>>> > >
>>> > > 
>>> > >
>>> > > If you have any concerns, feel free to comment before 13:00 August
>>> 25 (UTC
>>> > > +8).
>>> > >
>>> > > We'll start implementing it if there is no objection after that time.
>>> > >
>>> > > Thank you!
>>> > >
>>> > > 
>>> > >
>>> > > [1] https://lists.apache.org/thread/90rcjf1dv0fbkb5hm31kmgr65fj0nfnn
>>> > > [2]
>>> > >
>>> > >
>>> https://docs.google.com/document/d/1d8Pw6ZbWk-_pCKdOmdvx9rnhPiyuxwq60_TrD68d7BA/edit?pli=1#bookmark=id.y8943h392zno
>>> > >
>>> > > Yu and mangoGoForward
>>> > >
>>> > > On Tue, Aug 23, 2022 at 5:59 PM Yu  wrote:
>>> > >
>>> > > > Hi Jiuming, Yunze, tison,
>>> > > > Thanks for your vote!
>>> > > >
>>> > > > 
>>> > > >
>>> > > > Hi tison,
>>> > > >
>>> > > > > "packaging logics"
>>> > > > > For example, build the docker image, build & publish shell
>>> scripts.
>>> > > >
>>> > > > If you refer to these changes, they belong to [build] scope.
>>> > > >
>>> > > > Yu and Zixuan
>>> > > >
>>> > > > On Tue, Aug 23, 2022 at 1:25 PM tison 
>>> wrote:
>>> > > >
>>> > > >> Hi Yu,
>>> > > >>
>>> > > >> Reply inline:
>>> > > >>
>>> > > >> > Besides, the existing scope, [tool], refers to Pulsar CLI tools
>>> [1].
>>> > > >> > We're considering to rename it to [cli] since:
>>> > > >>
>>> > > >> Make sense.
>>> > > >>
>>> > > >> > "deployment logic" If so, can we ignore this?
>>> > > >>
>>> > > >> I saw you already remove [deploy] scope. No comment here. It
>>> should be
>>> > > >> fine.
>>> > > >>
>>> > > >> > "packaging logics"
>>> > > >>
>>> > > >> For example, build the docker image, build & publish shell
>>> scripts.
>>> > > >>
>>> > > >> >  How about defining [build] refer to the following?
>>> > > >>
>>> > > >> Make sense.
>>> > > >>
>>> > > >> > Two quick questions need your vote!
>>> > > >>
>>> > > >> To save letters, B & A.
>>> > > >>
>>> > > >> Best,
>>> > > >> tison.
>>> > > >>
>>> > > >
>>> > >
>>> >
>>>
>>


Re: [Discussion] PIP 198 - How to define [type] and [scope]?

2022-08-24 Thread tison
>From current issue templates, we already sort issues into bug reports,
improvements, doc changes, flaky tests, and PIPs. They're types. [type] and
[component] described here are applied to commit messages, not for issues.

For components, we may encourage contributors to try their best to sort out
related components, but it's generally hard to do. I report a bug, how can
I know which components are related? It is required I have to dig it out?

Best,
tison.


tison  于2022年8月25日周四 13:08写道:

> > Agree with applying the same rules ( [type] [scope] summary) for
> writing issue titles.
>
> It cannot be guarded by check so I think it only increases contributors'
> overhead.
>
> Instead, we can try to find out some integration if we can use the GitHub
> issue forms dropdown widget to allow contributors to tag the issue.
>
> I don't know whether it's possible, but it's better than setting up title
> rules. I can foresee that it's seldomly followed.
>
> Best,
> tison.
>
>
> Liu Yu  于2022年8月25日周四 12:59写道:
>
>> Thanks Max!
>>
>> Agree with applying the same rules ( [type] [scope] summary) for writing
>> issue titles.
>>
>> On 2022/08/25 02:48:51 Max Xu wrote:
>> > LGTM.
>> >
>> > And I think we should also update our issue templates.
>> >
>> > Best,
>> > Max Xu
>> >
>> >
>> > On Tue, Aug 23, 2022 at 6:04 PM Yu  wrote:
>> >
>> > > Hi team,
>> > >
>> > > Many thanks for your feedback! We've adjusted the convention based on
>> your
>> > > suggestions!
>> > >
>> > > Below is a brief summary of what we have reached a consensus on:
>> > >
>> > > 
>> > >
>> > > 1. Convention
>> > >
>> > > Continue to follow our existing convention (it's customized on
>> Agular) [1]
>> > >
>> > > 
>> > >
>> > > 2. Definition
>> > >
>> > > [type] must be one of the following:
>> > > - feat (abbr for "feature")
>> > > - improve
>> > > - fix
>> > > - cleanup
>> > > - refactor
>> > > - revert
>> > >
>> > > [scope] must be one of the following:
>> > > - admin
>> > > - broker
>> > > - cli (changes to CLI tools)
>> > > - io
>> > > - fn (abbr for "function")
>> > > - meta (abbr for "metadata")
>> > > - monitor
>> > > - proxy
>> > > - schema
>> > > - sec (abbr for "security")
>> > > - sql
>> > > - storage
>> > > - offload (changes to tiered storage)
>> > > - txn
>> > > - java
>> > > - cpp
>> > > - py
>> > > - ws (changes to WebSocket)
>> > > - test (changes to code tests)
>> > > - ci (changes to CI workflow)
>> > > - build (changes to dependencies, docker, build or release script)
>> > > - misc
>> > > - doc
>> > > - blog
>> > > - site
>> > >
>> > > For full details, see [Guide] Pulsar Pull Request Naming Convention
>> [2]
>> > >
>> > > 
>> > >
>> > > If you have any concerns, feel free to comment before 13:00 August 25
>> (UTC
>> > > +8).
>> > >
>> > > We'll start implementing it if there is no objection after that time.
>> > >
>> > > Thank you!
>> > >
>> > > 
>> > >
>> > > [1] https://lists.apache.org/thread/90rcjf1dv0fbkb5hm31kmgr65fj0nfnn
>> > > [2]
>> > >
>> > >
>> https://docs.google.com/document/d/1d8Pw6ZbWk-_pCKdOmdvx9rnhPiyuxwq60_TrD68d7BA/edit?pli=1#bookmark=id.y8943h392zno
>> > >
>> > > Yu and mangoGoForward
>> > >
>> > > On Tue, Aug 23, 2022 at 5:59 PM Yu  wrote:
>> > >
>> > > > Hi Jiuming, Yunze, tison,
>> > > > Thanks for your vote!
>> > > >
>> > > > 
>> > > >
>> > > > Hi tison,
>> > > >
>> > > > > "packaging logics"
>> > > > > For example, build the docker image, build & publish shell
>> scripts.
>> > > >
>> > > > If you refer to these changes, they belong to [build] scope.
>> > > >
>> > > > Yu and Zixuan
>> > > >
>> > > > On Tue, Aug 23, 2022 at 1:25 PM tison  wrote:
>> > > >
>> > > >> Hi Yu,
>> > > >>
>> > > >> Reply inline:
>> > > >>
>> > > >> > Besides, the existing scope, [tool], refers to Pulsar CLI tools
>> [1].
>> > > >> > We're considering to rename it to [cli] since:
>> > > >>
>> > > >> Make sense.
>> > > >>
>> > > >> > "deployment logic" If so, can we ignore this?
>> > > >>
>> > > >> I saw you already remove [deploy] scope. No comment here. It
>> should be
>> > > >> fine.
>> > > >>
>> > > >> > "packaging logics"
>> > > >>
>> > > >> For example, build the docker image, build & publish shell scripts.
>> > > >>
>> > > >> >  How about defining [build] refer to the following?
>> > > >>
>> > > >> Make sense.
>> > > >>
>> > > >> > Two quick questions need your vote!
>> > > >>
>> > > >> To save letters, B & A.
>> > > >>
>> > > >> Best,
>> > > >> tison.
>> > > >>
>> > > >
>> > >
>> >
>>
>


Re: [Discussion] PIP 198 - How to define [type] and [scope]?

2022-08-24 Thread tison
> Agree with applying the same rules ( [type] [scope] summary) for writing
issue titles.

It cannot be guarded by check so I think it only increases contributors'
overhead.

Instead, we can try to find out some integration if we can use the GitHub
issue forms dropdown widget to allow contributors to tag the issue.

I don't know whether it's possible, but it's better than setting up title
rules. I can foresee that it's seldomly followed.

Best,
tison.


Liu Yu  于2022年8月25日周四 12:59写道:

> Thanks Max!
>
> Agree with applying the same rules ( [type] [scope] summary) for writing
> issue titles.
>
> On 2022/08/25 02:48:51 Max Xu wrote:
> > LGTM.
> >
> > And I think we should also update our issue templates.
> >
> > Best,
> > Max Xu
> >
> >
> > On Tue, Aug 23, 2022 at 6:04 PM Yu  wrote:
> >
> > > Hi team,
> > >
> > > Many thanks for your feedback! We've adjusted the convention based on
> your
> > > suggestions!
> > >
> > > Below is a brief summary of what we have reached a consensus on:
> > >
> > > 
> > >
> > > 1. Convention
> > >
> > > Continue to follow our existing convention (it's customized on Agular)
> [1]
> > >
> > > 
> > >
> > > 2. Definition
> > >
> > > [type] must be one of the following:
> > > - feat (abbr for "feature")
> > > - improve
> > > - fix
> > > - cleanup
> > > - refactor
> > > - revert
> > >
> > > [scope] must be one of the following:
> > > - admin
> > > - broker
> > > - cli (changes to CLI tools)
> > > - io
> > > - fn (abbr for "function")
> > > - meta (abbr for "metadata")
> > > - monitor
> > > - proxy
> > > - schema
> > > - sec (abbr for "security")
> > > - sql
> > > - storage
> > > - offload (changes to tiered storage)
> > > - txn
> > > - java
> > > - cpp
> > > - py
> > > - ws (changes to WebSocket)
> > > - test (changes to code tests)
> > > - ci (changes to CI workflow)
> > > - build (changes to dependencies, docker, build or release script)
> > > - misc
> > > - doc
> > > - blog
> > > - site
> > >
> > > For full details, see [Guide] Pulsar Pull Request Naming Convention [2]
> > >
> > > 
> > >
> > > If you have any concerns, feel free to comment before 13:00 August 25
> (UTC
> > > +8).
> > >
> > > We'll start implementing it if there is no objection after that time.
> > >
> > > Thank you!
> > >
> > > 
> > >
> > > [1] https://lists.apache.org/thread/90rcjf1dv0fbkb5hm31kmgr65fj0nfnn
> > > [2]
> > >
> > >
> https://docs.google.com/document/d/1d8Pw6ZbWk-_pCKdOmdvx9rnhPiyuxwq60_TrD68d7BA/edit?pli=1#bookmark=id.y8943h392zno
> > >
> > > Yu and mangoGoForward
> > >
> > > On Tue, Aug 23, 2022 at 5:59 PM Yu  wrote:
> > >
> > > > Hi Jiuming, Yunze, tison,
> > > > Thanks for your vote!
> > > >
> > > > 
> > > >
> > > > Hi tison,
> > > >
> > > > > "packaging logics"
> > > > > For example, build the docker image, build & publish shell scripts.
> > > >
> > > > If you refer to these changes, they belong to [build] scope.
> > > >
> > > > Yu and Zixuan
> > > >
> > > > On Tue, Aug 23, 2022 at 1:25 PM tison  wrote:
> > > >
> > > >> Hi Yu,
> > > >>
> > > >> Reply inline:
> > > >>
> > > >> > Besides, the existing scope, [tool], refers to Pulsar CLI tools
> [1].
> > > >> > We're considering to rename it to [cli] since:
> > > >>
> > > >> Make sense.
> > > >>
> > > >> > "deployment logic" If so, can we ignore this?
> > > >>
> > > >> I saw you already remove [deploy] scope. No comment here. It should
> be
> > > >> fine.
> > > >>
> > > >> > "packaging logics"
> > > >>
> > > >> For example, build the docker image, build & publish shell scripts.
> > > >>
> > > >> >  How about defining [build] refer to the following?
> > > >>
> > > >> Make sense.
> > > >>
> > > >> > Two quick questions need your vote!
> > > >>
> > > >> To save letters, B & A.
> > > >>
> > > >> Best,
> > > >> tison.
> > > >>
> > > >
> > >
> >
>


[VOTE RESULT] PIP 198: Standardize PR Naming Convention using GitHub Actions

2022-08-24 Thread Yu
Hi team,

As discussed previously [1], we'll start implementing PIP 198: Standardize
PR Naming Convention using GitHub Actions [1] based on the results voted by
the community.

Below is a brief summary of the vote results.



1. Convention

Continue to follow our existing convention (it's customized on Agular) [3]



2. Definition

[type] must be one of the following:
- feat (abbr for "feature")
- improve
- fix
- cleanup
- refactor
- revert

[scope] must be one of the following:
- admin
- broker
- cli (changes to CLI tools)
- io
- fn (abbr for "function")
- meta (abbr for "metadata")
- monitor
- proxy
- schema
- sec (abbr for "security")
- sql
- storage
- offload (changes to tiered storage)
- txn
- java
- cpp
- py
- ws (changes to WebSocket)
- test (changes to code tests)
- ci (changes to CI workflow)
- build (changes to dependencies, docker, build or release script)
- misc
- doc
- blog
- site

For full details, see [Guide] Pulsar Pull Request Naming Convention [4]



[1] https://lists.apache.org/thread/11hpppllkh68zodromd7pzn173gb9hgl
[2]
https://docs.google.com/document/d/1sJlUNAHnYAbvu9UtEgCrn_oVTnVc1M5nHC19x1bFab4/edit?pli=1
[3] https://lists.apache.org/thread/90rcjf1dv0fbkb5hm31kmgr65fj0nfnn
[4]
https://docs.google.com/document/d/1d8Pw6ZbWk-_pCKdOmdvx9rnhPiyuxwq60_TrD68d7BA/edit?pli=1#bookmark=id.y8943h392zno



Yu, mangoGoForward, Max


Re: [Discussion] PIP 198 - How to define [type] and [scope]?

2022-08-24 Thread Liu Yu
Thanks Max! 

Agree with applying the same rules ( [type] [scope] summary) for writing issue 
titles.

On 2022/08/25 02:48:51 Max Xu wrote:
> LGTM.
> 
> And I think we should also update our issue templates.
> 
> Best,
> Max Xu
> 
> 
> On Tue, Aug 23, 2022 at 6:04 PM Yu  wrote:
> 
> > Hi team,
> >
> > Many thanks for your feedback! We've adjusted the convention based on your
> > suggestions!
> >
> > Below is a brief summary of what we have reached a consensus on:
> >
> > 
> >
> > 1. Convention
> >
> > Continue to follow our existing convention (it's customized on Agular) [1]
> >
> > 
> >
> > 2. Definition
> >
> > [type] must be one of the following:
> > - feat (abbr for "feature")
> > - improve
> > - fix
> > - cleanup
> > - refactor
> > - revert
> >
> > [scope] must be one of the following:
> > - admin
> > - broker
> > - cli (changes to CLI tools)
> > - io
> > - fn (abbr for "function")
> > - meta (abbr for "metadata")
> > - monitor
> > - proxy
> > - schema
> > - sec (abbr for "security")
> > - sql
> > - storage
> > - offload (changes to tiered storage)
> > - txn
> > - java
> > - cpp
> > - py
> > - ws (changes to WebSocket)
> > - test (changes to code tests)
> > - ci (changes to CI workflow)
> > - build (changes to dependencies, docker, build or release script)
> > - misc
> > - doc
> > - blog
> > - site
> >
> > For full details, see [Guide] Pulsar Pull Request Naming Convention [2]
> >
> > 
> >
> > If you have any concerns, feel free to comment before 13:00 August 25 (UTC
> > +8).
> >
> > We'll start implementing it if there is no objection after that time.
> >
> > Thank you!
> >
> > 
> >
> > [1] https://lists.apache.org/thread/90rcjf1dv0fbkb5hm31kmgr65fj0nfnn
> > [2]
> >
> > https://docs.google.com/document/d/1d8Pw6ZbWk-_pCKdOmdvx9rnhPiyuxwq60_TrD68d7BA/edit?pli=1#bookmark=id.y8943h392zno
> >
> > Yu and mangoGoForward
> >
> > On Tue, Aug 23, 2022 at 5:59 PM Yu  wrote:
> >
> > > Hi Jiuming, Yunze, tison,
> > > Thanks for your vote!
> > >
> > > 
> > >
> > > Hi tison,
> > >
> > > > "packaging logics"
> > > > For example, build the docker image, build & publish shell scripts.
> > >
> > > If you refer to these changes, they belong to [build] scope.
> > >
> > > Yu and Zixuan
> > >
> > > On Tue, Aug 23, 2022 at 1:25 PM tison  wrote:
> > >
> > >> Hi Yu,
> > >>
> > >> Reply inline:
> > >>
> > >> > Besides, the existing scope, [tool], refers to Pulsar CLI tools [1].
> > >> > We're considering to rename it to [cli] since:
> > >>
> > >> Make sense.
> > >>
> > >> > "deployment logic" If so, can we ignore this?
> > >>
> > >> I saw you already remove [deploy] scope. No comment here. It should be
> > >> fine.
> > >>
> > >> > "packaging logics"
> > >>
> > >> For example, build the docker image, build & publish shell scripts.
> > >>
> > >> >  How about defining [build] refer to the following?
> > >>
> > >> Make sense.
> > >>
> > >> > Two quick questions need your vote!
> > >>
> > >> To save letters, B & A.
> > >>
> > >> Best,
> > >> tison.
> > >>
> > >
> >
> 


Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread tison
Hi Penghui,

> IMO, the final status of the proposal should be summarized in the
proposal.

Agree. It's not exclusive. What I propose is that if you want to track
proposals in a project, track issues. But it seems your opinion is not to
do so. Then it's a nonissue. Flink FLIP tracks proposal status with the
proposal content and it works well. Although, I'd still highlight that
maintainer efforts are required to maintain the metadata.

> As I know, if we have an issue tracking all the proposal status, not all
the contributors make contributions. Only the committers and the author?

I don't propose to track the status _within_ the issue. But if you want to
track in a project, only committer have the privilege :)

>  if the discussion of the motivation doesn't have objections and is
discussed enough.

This is quite vague and hard to follow.

Rust RFCs have a nonmandatory pre-RFC stage. I think we can encourage the
proposer to discuss on the mailing list or any channel with committers
first. But any proposer can take their proposal number once proposed and a
committer clicks the merge button. Still, I'm concerned about adding such a
step to exclusively allocate a proposal number.

I would suggest you collect the suggestions in this discussion thread, and
propose a process as shown in

* https://github.com/rust-lang/rfcs#readme
* https://github.com/pingcap/tidb/tree/master/docs/design#readme

So that we can have a vote. Instead of rushing into making several PRs to
move content fragmentedly.

Best,
tison.


PengHui Li  于2022年8月25日周四 10:59写道:

> Hi Tison,
>
> Thanks for providing more context.
>
> > Rust RFCs hold in a separate repo so that they can use the PR number as
> the
> proposal number to prevent duplicates. It's a bit overhead for Pulsar to
> hold a separate repo. So I'd prefer to hold proposals in the same repo
> under the dev/proposals folder and resolve the PR number duplicates issue
> as described above. Although, it can still be a bit awkward. But at least
> we can quickly notice duplicate numbers and resolve them. Using the PR
> number as the proposal number keeps it unique, but the number will be
> discontinuous. Especially if we're proposing in the main repo, the number
> will grow quickly.
>
> The PR number can provide unique numbers. Everyone can create PRs, maybe
> not related to PIP, but we are not able to revert it.
>
> > In this direction, we can associate a proposal with a tracking issue and
> manipulate the proposal status represented by their tracking issue. It
> doesn't mean we keep the current issue discussion process. You can regard
> it as an umbrella ticket for the proposal. See
> https://github.com/pingcap/tidb/issues/26022 as an example.
>
> > In this direction, we should always ensure that there're several eyes on
> the project. Otherwise, we can easily end up with an out-of-sync project.
>
> IMO, the final status of the proposal should be summarized in the proposal.
> We can have some metadata for md file which allow us to have a page that
> can
> show all the proposals like the bookkeeper does
> https://bookkeeper.apache.org/community/bookkeeper-proposals
> From my personal view, I prefer this way of the bookkeeper. This will allow
> all the users find the proposals on the website, instead of going to search
> in the
> issue list.
>
> And the md file is the source of the truth. I think we can only provide one
> way to
> update the proposal status (from a PR). So that all the contributors can
> contribute.
> As I know, if we have an issue tracking all the proposal status, not all
> the contributors
> make contributions. Only the committers and the author? Maybe the author
> can change
> the policy but still can't involve more people to review.
>
> > It seems the confusion comes from we have to merge one PR first to
> prevent
> duplicate numbers. If it's a certain concern, see also my comment above
> using the PR number as the proposal, or even host a separate proposal repo
> as rust-lang/rfcs does and keep the number growing normally. I can provide
> experience to maintain this sort of resource.
>
> Yes, it looks to request a PIP number through a PR. IMO, it is necessary.
> The reviewer can only merge the PR (request the PIP number) if the
> discussion
> of the motivation doesn't have objections and is discussed enough.
> Without this step, maybe only the author thinks that "I can create a
> proposal for now" (maybe
> no response on the mailing list).
>
> Best,
> Penghui
>
> On Thu, Aug 25, 2022 at 10:31 AM PengHui Li  wrote:
>
> > > So that we can review the details and improve these 2 documents (I see
> > the last update
> > for these two documents are 2021 and 2020)
> >
> > Sorry, I think it's not a good idea.
> > We should move to the codebase 100% the same as the original
> documentation.
> > And use another PR to make improvements for it. If it is needed, we
> should
> > start with an email.
> > We need to have historical records.
> >
> > Penghui
> >
> >
> > On Thu, Aug 2

Re: [VOTE] [PIP-201] Extensions mechanism for Pulsar Admin CLI tools

2022-08-24 Thread Max Xu
+1 (non-binding)

It's a cool idea!

Best,
Max Xu


On Wed, Aug 24, 2022 at 9:46 PM PengHui Li  wrote:

> +1 (binding)
>
> Best,
> Penghui
>
> On Wed, Aug 24, 2022 at 9:39 PM Dave Fisher  wrote:
>
> > +1 (binding)
> >
> > Best,
> > Dave
> >
> > Sent from my iPhone
> >
> > > On Aug 24, 2022, at 12:43 AM, Yunze Xu 
> > wrote:
> > >
> > > +1 (non binding)
> > >
> > > Thanks,
> > > Yunze
> > >
> > >
> > >
> > >
> > >> 2022年8月24日 15:38,Nicolò Boschi  写道:
> > >>
> > >> +1 (non binding)
> > >> Nicolò Boschi
> > >>
> > >>
> > >>> Il giorno mer 24 ago 2022 alle ore 09:11 Enrico Olivelli <
> > >>> eolive...@gmail.com> ha scritto:
> > >>>
> > >>> Hello,
> > >>> this is the official thread VOTE for PIP-201 Extensions mechanism for
> > >>> Pulsar Admin CLI tools
> > >>>
> > >>> This is the PIP link https://github.com/apache/pulsar/issues/17155
> > >>> This is the discussion:
> > >>> https://lists.apache.org/thread/287ft8twc11cp4s1y4qkcx5nmh451cyo
> > >>>
> > >>> I am still working on the PR, that is the subject of the VOTE.
> > >>>
> > >>> Best regards
> > >>> Enrico Olivelli
> > >>>
> > >
> >
> >
>


[GitHub] [pulsar] liangyuanpeng added a comment to the discussion: I encountered a port error when using pulsar to integrate etcd

2022-08-24 Thread GitBox


GitHub user liangyuanpeng added a comment to the discussion: I encountered a 
port error when using pulsar to integrate etcd


I have tests on the docker image of  `docker pull lypgcs/pulsar:2022-08-23` and 
it's working for etcd cluster.

You need Change `etcd:node01:2379,node02:2379,node03:2379` to 
`etcd:http://node01:2379,http://node02:2379,http://node03:2379`

GitHub link: 
https://github.com/apache/pulsar/discussions/17240#discussioncomment-3471019


This is an automatically sent email for dev@pulsar.apache.org.
To unsubscribe, please send an email to: dev-unsubscr...@pulsar.apache.org



[GitHub] [pulsar-site] Anonymitaet commented on pull request #176: Generate client config docs from source code

2022-08-24 Thread GitBox


Anonymitaet commented on PR #176:
URL: https://github.com/apache/pulsar-site/pull/176#issuecomment-1226745537

   @urfreespace 
   Could you please review this PR from a technical perspective? 
   Thank you! 😊


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] maxsxu commented on a diff in pull request #68: [improve][docbot] Move the preceding steps into the action

2022-08-24 Thread GitBox


maxsxu commented on code in PR #68:
URL: https://github.com/apache/pulsar-test-infra/pull/68#discussion_r954477706


##
docbot/action.yml:
##
@@ -8,6 +8,16 @@ branding:
 runs:
   using: composite
   steps:
-- run: go run main.go
+- name: Checkout
+  uses: actions/checkout@v3
+  with:
+repository: pulsar/pulsar-test-infra
+ref: master

Review Comment:
   Removed, let ref uses the default branch.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] Anonymitaet closed pull request #36: [Docbot] Add doc-added label

2022-08-24 Thread GitBox


Anonymitaet closed pull request #36: [Docbot] Add doc-added label
URL: https://github.com/apache/pulsar-test-infra/pull/36


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] Anonymitaet commented on pull request #36: [Docbot] Add doc-added label

2022-08-24 Thread GitBox


Anonymitaet commented on PR #36:
URL: https://github.com/apache/pulsar-test-infra/pull/36#issuecomment-1226743227

   Thanks for your contribution! Already resolved 
https://github.com/apache/pulsar/blob/b21f72886dc20362efe6808d690ddca32c5abd1e/.github/workflows/ci-documentbot.yml#L56


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [DISCUSS] Catch up reads with many subscriptions - saving reads from BK

2022-08-24 Thread Haiting Jiang
Here is my idea of handing the read position shift if we want to go a bit
further with this solution.

Instead of only checking the exact match of entry range match
(PendingReadKey in the PR),
we can check if the requesting range is covered by several ongoing read
requests, and merge
 the result of these requests.

For example, currently there are already inflight request reads of entries
[10,20] and [20,30],
and now comes a new request [15,25], it can just merge the result from
[10,20] and [20,30],
and avoids sending the request to BK.

As for the complexity checking procedure, I think we can sort all the
inflight requests by startEntryId
and apply a binary search here, to reduce the time complexity to O(logN)
where N is the
max entry size in one ledger.

Thanks,
Haiting

On Thu, Aug 25, 2022 at 11:40 AM Haiting Jiang 
wrote:

> +1 for this optimization for the specific scenario.
>
> This simple patch is really effective, in some tests it reduces a lot
>> (from 100K reads/s to 250 reads/s) the concurrent reads to BK.
>> You may say that it is unlikely that the broker sends concurrent
>> requests for exactly the same range of entries but empirically I have
>> observed that it happens quite often, at least in the scenario I drew
>> above.
>
>
> This is quite an interesting test result. My guess is that all the
> subscriptions
> have exactly the same backlog and all the messages are consumed instantly
> and successfully. So that they can read the same range of entries.
> Once there are some consuming delays or retries due to failures, I believe
> there will be read position shifts and the BK read requests will go up.
>
> Thanks,
> Haiting
>
> On Thu, Aug 25, 2022 at 12:12 AM Michael Marshall 
> wrote:
>
>> +1 I support merging the temporary solution.
>>
>> > Just one point, please make sure the change will not introduce too much
>> > heap memory overhead.
>>
>> If we see that there is extra pressure, I wonder if we can dynamically
>> determine when to deduplicate requests. A broker only needs this
>> feature when a topic has multiple subscriptions. I haven't double
>> checked if this information is easily accessible to the cache.
>>
>> One of the interesting benefits of this optimization is that it will
>> decrease direct memory pressure. When we do not deduplicate reads, the
>> same entry will be duplicated in the bookie's netty direct memory
>> pool. With this design, an entry will be materialized once with many
>> different ByteBuffer's objects pointing to it.
>>
>> Thanks,
>> Michael
>>
>> On Wed, Aug 24, 2022 at 11:00 AM PengHui Li  wrote:
>> >
>> > +1 for we can have a short-term solution first.
>> >
>> > Just one point, please make sure the change will not introduce too much
>> > heap memory overhead.
>> >
>> > Thanks,
>> > Penghui
>> >
>> > On Wed, Aug 24, 2022 at 11:54 PM Enrico Olivelli 
>> > wrote:
>> >
>> > > Hello folks,
>> > > These days I am working on a use case in which I have many
>> > > subscriptions on the same topic (~100).
>> > > If you get to a situation with a big backlog, the consumers are no
>> > > more able to leverage the broker cache then the broker starts to read
>> > > from BookKeeper.
>> > >
>> > > In 2.11 we have a first mechanism of catch-up reads cache for
>> > > backlogged consumers (when you have a few backlogged cursors we start
>> > > putting in the cache the entries that you read).
>> > >
>> > > That is not enough to deal with my case because if you have many
>> > > consumers (one per partition) that connect all together, for instance
>> > > after a major problem in the application, the cache is empty and the
>> > > broker starts to overwhelm the bookies with reads.
>> > >
>> > > As a short term solution I have created this patch
>> > > https://github.com/apache/pulsar/pull/17241
>> > >
>> > > The patch basically saves reads from BK by preventing concurrent reads
>> > > for the same entries (actually "ranges of entries").
>> > > If the broker starts a read for a range and then comes another read
>> > > for the same range before the read completes, then we skip sending the
>> > > new read and we use the result of the BK request.
>> > >
>> > > This simple patch is really effective, in some tests it reduces a lot
>> > > (from 100K reads/s to 250 reads/s) the concurrent reads to BK.
>> > > You may say that it is unlikely that the broker sends concurrent
>> > > requests for exactly the same range of entries but empirically I have
>> > > observed that it happens quite often, at least in the scenario I drew
>> > > above.
>> > >
>> > > My proposal is to commit my patch as a short term solution.
>> > > I have already talked with PengHui and a better solution would be to
>> > > introduce explicitly a "Catch Up reads cache", but that is a mid term
>> > > implementation.
>> > >
>> > > WDYT ?
>> > >
>> > >
>> > > Enrico
>> > >
>>
>


[GitHub] [pulsar-test-infra] Anonymitaet commented on pull request #60: Remove unused legacy docbot

2022-08-24 Thread GitBox


Anonymitaet commented on PR #60:
URL: https://github.com/apache/pulsar-test-infra/pull/60#issuecomment-1226742311

   Hi @tisonkun @nodece thanks for your feedback!
   
   We (+ @maxsxu) prefer not to tag it. Reasons:
   
   1. We've searched and found that nearly no one uses this code.
   2. The quality of this code is not high.
   
   I can merge this PR if there are no more comments tomorrow. 

   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] maxsxu commented on pull request #36: [Docbot] Add doc-added label

2022-08-24 Thread GitBox


maxsxu commented on PR #36:
URL: https://github.com/apache/pulsar-test-infra/pull/36#issuecomment-1226741194

   Already addressed in new docbot. /cc @Anonymitaet 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [DISCUSS] Catch up reads with many subscriptions - saving reads from BK

2022-08-24 Thread Haiting Jiang
+1 for this optimization for the specific scenario.

This simple patch is really effective, in some tests it reduces a lot
> (from 100K reads/s to 250 reads/s) the concurrent reads to BK.
> You may say that it is unlikely that the broker sends concurrent
> requests for exactly the same range of entries but empirically I have
> observed that it happens quite often, at least in the scenario I drew
> above.


This is quite an interesting test result. My guess is that all the
subscriptions
have exactly the same backlog and all the messages are consumed instantly
and successfully. So that they can read the same range of entries.
Once there are some consuming delays or retries due to failures, I believe
there will be read position shifts and the BK read requests will go up.

Thanks,
Haiting

On Thu, Aug 25, 2022 at 12:12 AM Michael Marshall 
wrote:

> +1 I support merging the temporary solution.
>
> > Just one point, please make sure the change will not introduce too much
> > heap memory overhead.
>
> If we see that there is extra pressure, I wonder if we can dynamically
> determine when to deduplicate requests. A broker only needs this
> feature when a topic has multiple subscriptions. I haven't double
> checked if this information is easily accessible to the cache.
>
> One of the interesting benefits of this optimization is that it will
> decrease direct memory pressure. When we do not deduplicate reads, the
> same entry will be duplicated in the bookie's netty direct memory
> pool. With this design, an entry will be materialized once with many
> different ByteBuffer's objects pointing to it.
>
> Thanks,
> Michael
>
> On Wed, Aug 24, 2022 at 11:00 AM PengHui Li  wrote:
> >
> > +1 for we can have a short-term solution first.
> >
> > Just one point, please make sure the change will not introduce too much
> > heap memory overhead.
> >
> > Thanks,
> > Penghui
> >
> > On Wed, Aug 24, 2022 at 11:54 PM Enrico Olivelli 
> > wrote:
> >
> > > Hello folks,
> > > These days I am working on a use case in which I have many
> > > subscriptions on the same topic (~100).
> > > If you get to a situation with a big backlog, the consumers are no
> > > more able to leverage the broker cache then the broker starts to read
> > > from BookKeeper.
> > >
> > > In 2.11 we have a first mechanism of catch-up reads cache for
> > > backlogged consumers (when you have a few backlogged cursors we start
> > > putting in the cache the entries that you read).
> > >
> > > That is not enough to deal with my case because if you have many
> > > consumers (one per partition) that connect all together, for instance
> > > after a major problem in the application, the cache is empty and the
> > > broker starts to overwhelm the bookies with reads.
> > >
> > > As a short term solution I have created this patch
> > > https://github.com/apache/pulsar/pull/17241
> > >
> > > The patch basically saves reads from BK by preventing concurrent reads
> > > for the same entries (actually "ranges of entries").
> > > If the broker starts a read for a range and then comes another read
> > > for the same range before the read completes, then we skip sending the
> > > new read and we use the result of the BK request.
> > >
> > > This simple patch is really effective, in some tests it reduces a lot
> > > (from 100K reads/s to 250 reads/s) the concurrent reads to BK.
> > > You may say that it is unlikely that the broker sends concurrent
> > > requests for exactly the same range of entries but empirically I have
> > > observed that it happens quite often, at least in the scenario I drew
> > > above.
> > >
> > > My proposal is to commit my patch as a short term solution.
> > > I have already talked with PengHui and a better solution would be to
> > > introduce explicitly a "Catch Up reads cache", but that is a mid term
> > > implementation.
> > >
> > > WDYT ?
> > >
> > >
> > > Enrico
> > >
>


[GitHub] [pulsar-test-infra] tisonkun commented on a diff in pull request #68: [improve][docbot] Move the preceding steps into the action

2022-08-24 Thread GitBox


tisonkun commented on code in PR #68:
URL: https://github.com/apache/pulsar-test-infra/pull/68#discussion_r954466400


##
docbot/action.yml:
##
@@ -8,6 +8,16 @@ branding:
 runs:
   using: composite
   steps:
-- run: go run main.go
+- name: Checkout
+  uses: actions/checkout@v3
+  with:
+repository: pulsar/pulsar-test-infra
+ref: master

Review Comment:
   Fair enough. If we remove this config, the default branch is also master.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] maxsxu commented on a diff in pull request #68: [improve][docbot] Move the preceding steps into the action

2022-08-24 Thread GitBox


maxsxu commented on code in PR #68:
URL: https://github.com/apache/pulsar-test-infra/pull/68#discussion_r954462336


##
docbot/action.yml:
##
@@ -8,6 +8,16 @@ branding:
 runs:
   using: composite
   steps:
-- run: go run main.go
+- name: Checkout
+  uses: actions/checkout@v3
+  with:
+repository: pulsar/pulsar-test-infra
+ref: master

Review Comment:
   The current branch ref is master: 
https://github.com/apache/pulsar-test-infra/tree/master



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-site] SignorMercurio opened a new issue, #177: Migrate all docs from `/tools` to `/reference`

2022-08-24 Thread GitBox


SignorMercurio opened a new issue, #177:
URL: https://github.com/apache/pulsar-site/issues/177

   According to 
https://apache-pulsar.slack.com/archives/C03MG1H2B3N/p1661220553497289, we can 
start working on migrate all docs from `/tools` to `/reference`. For instance, 
https://pulsar.apache.org/tools/pulsar-admin/2.11.0-SNAPSHOT/ will be moved to 
https://pulsar.apache.org/reference/ and can be viewed in the `Pulsar Admin` 
section of the side navbar, just like `Pulsar Configuration` right now.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] maxsxu commented on a diff in pull request #68: [improve][docbot] Move the preceding steps into the action

2022-08-24 Thread GitBox


maxsxu commented on code in PR #68:
URL: https://github.com/apache/pulsar-test-infra/pull/68#discussion_r954459313


##
docbot/action.yml:
##
@@ -8,6 +8,16 @@ branding:
 runs:
   using: composite
   steps:
-- run: go run main.go
+- name: Checkout
+  uses: actions/checkout@v3
+  with:
+repository: pulsar/pulsar-test-infra

Review Comment:
   Fixed



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [Vote] PIP-192 New Pulsar Broker Load Balancer

2022-08-24 Thread Hang Chen
+1 (binding)

Awesome job!

Thanks,
Hang

Heesung Sohn  于2022年8月24日周三 05:29写道:
>
> Hi, Pulsar dev community,
>
> This vote has been open for 23 days, and we are closing this vote today as
> this PIP received three binding +1s.
>
> We are moving forward and starting implementation -- there will be a series
> of PRs and sub-PIPs(if further design discussions are required).
>
> Thank you,
> Heesung
>
> On Tue, Aug 9, 2022 at 4:56 AM guo jiwei  wrote:
>
> > +1
> >
> > Regards
> > Jiwei Guo (Tboy)
> >
> >
> > On Tue, Aug 9, 2022 at 3:12 PM Qiang Huang 
> > wrote:
> >
> > > +1 (non-binding)
> > >
> > > Kai Wang  于2022年8月5日周五 10:18写道:
> > >
> > > > +1 (non-binding)
> > > >
> > > > Thanks,
> > > > Kai
> > > >
> > > > Heesung Sohn  于2022年8月2日周二
> > > 08:50写道:
> > > >
> > > > > Dear Pulsar Community,
> > > > >
> > > > > Please review and vote on this PIP.
> > > > >
> > > > > PIP link: https://github.com/apache/pulsar/issues/16691
> > > > >
> > > > > Thank you,
> > > > > -Heesung
> > > > >
> > > >
> > >
> > >
> > > --
> > > BR,
> > > Qiang Huang
> > >
> >


Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread PengHui Li
Hi Tison,

Thanks for providing more context.

> Rust RFCs hold in a separate repo so that they can use the PR number as
the
proposal number to prevent duplicates. It's a bit overhead for Pulsar to
hold a separate repo. So I'd prefer to hold proposals in the same repo
under the dev/proposals folder and resolve the PR number duplicates issue
as described above. Although, it can still be a bit awkward. But at least
we can quickly notice duplicate numbers and resolve them. Using the PR
number as the proposal number keeps it unique, but the number will be
discontinuous. Especially if we're proposing in the main repo, the number
will grow quickly.

The PR number can provide unique numbers. Everyone can create PRs, maybe
not related to PIP, but we are not able to revert it.

> In this direction, we can associate a proposal with a tracking issue and
manipulate the proposal status represented by their tracking issue. It
doesn't mean we keep the current issue discussion process. You can regard
it as an umbrella ticket for the proposal. See
https://github.com/pingcap/tidb/issues/26022 as an example.

> In this direction, we should always ensure that there're several eyes on
the project. Otherwise, we can easily end up with an out-of-sync project.

IMO, the final status of the proposal should be summarized in the proposal.
We can have some metadata for md file which allow us to have a page that can
show all the proposals like the bookkeeper does
https://bookkeeper.apache.org/community/bookkeeper-proposals
>From my personal view, I prefer this way of the bookkeeper. This will allow
all the users find the proposals on the website, instead of going to search
in the
issue list.

And the md file is the source of the truth. I think we can only provide one
way to
update the proposal status (from a PR). So that all the contributors can
contribute.
As I know, if we have an issue tracking all the proposal status, not all
the contributors
make contributions. Only the committers and the author? Maybe the author
can change
the policy but still can't involve more people to review.

> It seems the confusion comes from we have to merge one PR first to prevent
duplicate numbers. If it's a certain concern, see also my comment above
using the PR number as the proposal, or even host a separate proposal repo
as rust-lang/rfcs does and keep the number growing normally. I can provide
experience to maintain this sort of resource.

Yes, it looks to request a PIP number through a PR. IMO, it is necessary.
The reviewer can only merge the PR (request the PIP number) if the
discussion
of the motivation doesn't have objections and is discussed enough.
Without this step, maybe only the author thinks that "I can create a
proposal for now" (maybe
no response on the mailing list).

Best,
Penghui

On Thu, Aug 25, 2022 at 10:31 AM PengHui Li  wrote:

> > So that we can review the details and improve these 2 documents (I see
> the last update
> for these two documents are 2021 and 2020)
>
> Sorry, I think it's not a good idea.
> We should move to the codebase 100% the same as the original documentation.
> And use another PR to make improvements for it. If it is needed, we should
> start with an email.
> We need to have historical records.
>
> Penghui
>
>
> On Thu, Aug 25, 2022 at 10:28 AM PengHui Li  wrote:
>
>> Hi Michael,
>>
>> > This raises the question of how to handle historical PIPs. We could do
>> some work to move PIPs out of the wiki and out of issues into the code
>> base. The wiki is just another git repo, so any committer should be
>> able to help migrate PIPs in the wiki to the apache/pulsar git repo.
>> This might also be an opportunity to double check the status on old
>> PIPs.
>>
>> Yes, I think it's a good idea. IMO, we can migrate them one by one.
>> To provide more context for each one like the discussion email, vote email
>> , and the approvals (Of course, very early proposals did not require
>> approval).
>> So that all the contributors can help with the migration process.
>>
>> I think I can move [0] [1] to the codebase first.
>>
>> [0]
>> https://github.com/apache/pulsar/wiki/Pulsar-Improvement-Proposal-Template
>> [1]
>> https://github.com/apache/pulsar/wiki/Pulsar-Improvement-Proposal-%28PIP%29
>>
>> So that we can review the details and improve these 2 documents (I see
>> the last update
>> for these two documents are 2021 and 2020)
>>
>> Thanks,
>> Penghui
>>
>> On Thu, Aug 25, 2022 at 9:44 AM tison  wrote:
>>
>>> Hi Penghui,
>>>
>>> Thanks for starting this discussion! +1 for holding PIPs in the codebase.
>>>
>>> I'd like to share some experience working on open-source projects holding
>>> proposals as code.
>>>
>>> 1. Rust RFCs: codebase https://github.com/rust-lang/rfcs
>>> 2. TiDB Proposals:
>>> https://github.com/pingcap/tidb/tree/master/docs/design#readme
>>>
>>> Reflect on topics in this thread:
>>>
>>> > any reviews should happen with a PR first
>>>
>>> The experience should be similar to reviewing on Google Docs while

[GitHub] [pulsar-test-infra] tisonkun commented on a diff in pull request #68: [improve][docbot] Move the preceding steps into the action

2022-08-24 Thread GitBox


tisonkun commented on code in PR #68:
URL: https://github.com/apache/pulsar-test-infra/pull/68#discussion_r954452284


##
docbot/action.yml:
##
@@ -8,6 +8,16 @@ branding:
 runs:
   using: composite
   steps:
-- run: go run main.go
+- name: Checkout
+  uses: actions/checkout@v3
+  with:
+repository: pulsar/pulsar-test-infra

Review Comment:
   ```suggestion
   repository: apache/pulsar-test-infra
   ```
   ?



##
docbot/action.yml:
##
@@ -8,6 +8,16 @@ branding:
 runs:
   using: composite
   steps:
-- run: go run main.go
+- name: Checkout
+  uses: actions/checkout@v3
+  with:
+repository: pulsar/pulsar-test-infra
+ref: master

Review Comment:
   ```suggestion
   ```
   



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-site] SignorMercurio opened a new pull request, #176: Generate client config docs from source code

2022-08-24 Thread GitBox


SignorMercurio opened a new pull request, #176:
URL: https://github.com/apache/pulsar-site/pull/176

   Related PR: apache/pulsar#17198.
   
   @urfreespace PTAL


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [Discussion] PIP 198 - How to define [type] and [scope]?

2022-08-24 Thread Max Xu
LGTM.

And I think we should also update our issue templates.

Best,
Max Xu


On Tue, Aug 23, 2022 at 6:04 PM Yu  wrote:

> Hi team,
>
> Many thanks for your feedback! We've adjusted the convention based on your
> suggestions!
>
> Below is a brief summary of what we have reached a consensus on:
>
> 
>
> 1. Convention
>
> Continue to follow our existing convention (it's customized on Agular) [1]
>
> 
>
> 2. Definition
>
> [type] must be one of the following:
> - feat (abbr for "feature")
> - improve
> - fix
> - cleanup
> - refactor
> - revert
>
> [scope] must be one of the following:
> - admin
> - broker
> - cli (changes to CLI tools)
> - io
> - fn (abbr for "function")
> - meta (abbr for "metadata")
> - monitor
> - proxy
> - schema
> - sec (abbr for "security")
> - sql
> - storage
> - offload (changes to tiered storage)
> - txn
> - java
> - cpp
> - py
> - ws (changes to WebSocket)
> - test (changes to code tests)
> - ci (changes to CI workflow)
> - build (changes to dependencies, docker, build or release script)
> - misc
> - doc
> - blog
> - site
>
> For full details, see [Guide] Pulsar Pull Request Naming Convention [2]
>
> 
>
> If you have any concerns, feel free to comment before 13:00 August 25 (UTC
> +8).
>
> We'll start implementing it if there is no objection after that time.
>
> Thank you!
>
> 
>
> [1] https://lists.apache.org/thread/90rcjf1dv0fbkb5hm31kmgr65fj0nfnn
> [2]
>
> https://docs.google.com/document/d/1d8Pw6ZbWk-_pCKdOmdvx9rnhPiyuxwq60_TrD68d7BA/edit?pli=1#bookmark=id.y8943h392zno
>
> Yu and mangoGoForward
>
> On Tue, Aug 23, 2022 at 5:59 PM Yu  wrote:
>
> > Hi Jiuming, Yunze, tison,
> > Thanks for your vote!
> >
> > 
> >
> > Hi tison,
> >
> > > "packaging logics"
> > > For example, build the docker image, build & publish shell scripts.
> >
> > If you refer to these changes, they belong to [build] scope.
> >
> > Yu and Zixuan
> >
> > On Tue, Aug 23, 2022 at 1:25 PM tison  wrote:
> >
> >> Hi Yu,
> >>
> >> Reply inline:
> >>
> >> > Besides, the existing scope, [tool], refers to Pulsar CLI tools [1].
> >> > We're considering to rename it to [cli] since:
> >>
> >> Make sense.
> >>
> >> > "deployment logic" If so, can we ignore this?
> >>
> >> I saw you already remove [deploy] scope. No comment here. It should be
> >> fine.
> >>
> >> > "packaging logics"
> >>
> >> For example, build the docker image, build & publish shell scripts.
> >>
> >> >  How about defining [build] refer to the following?
> >>
> >> Make sense.
> >>
> >> > Two quick questions need your vote!
> >>
> >> To save letters, B & A.
> >>
> >> Best,
> >> tison.
> >>
> >
>


[DISCUSS] PIP-Enable read-only partition, add read and write control for a single partition.

2022-08-24 Thread baomingyu_5
Hi all,

In the exclusive consumption mode of exclusive, failover, reader, etc. if a 
large number of messages accumulate due to limited consumption capacity or 
consumption logic problems, expanding partitions or modifying consumption logic 
cannot reduce the consumption of new messages in time. Influence, a certain 
proportion of newly added messages will still be produced in the accumulated 
paritition, and new messages cannot be quickly consumed by restarted consumers 
or new consumers. In this way, the time affected by the accumulation factor may 
very long.

In order to reduce the impact on the business, through the control of read and 
write permissions, configure some partitions as read-only, so that newly 
produced messages can be sent to the partitions with less accumulation or newly 
expanded, reducing the impact on the business caused by the accumulation of 
messages. Reduce the overall time-consuming of production--consumption.




This feature needs to be changed as follows:

1、PB protocl

message CommandPartitionedTopicMetadataResponse {

enum LookupType {

Success  = 0;

Failed   = 1;

}

optional uint32 partitions= 1;  // Optional in case of error

required uint64 request_id= 2;

optional LookupType response  = 3;

optional ServerError error= 4;

optional string message   = 5;

repeated KeyValue metadata = 6;// key parition index,value:r、w、rw

}




2、Modify choosePartition method in all implement class of MessageRouter, and it 
will not choose read-only partition to send msg.




3、ProducerImpl adds permission attributes, and if permission is read-only will 
not be used sending msg.




4、Modify admin interface updatePartitionedTopic to update metadata of topic

Thanks,  


Mingyu.




[DISCUSS] PIP-Enable read-only partition, add read and write control for a single partition.

2022-08-24 Thread baomingyu_5
Hi all,

In the exclusive consumption mode of exclusive, failover, reader, etc. if a 
large number of messages accumulate due to limited consumption capacity or 
consumption logic problems, expanding partitions or modifying consumption logic 
cannot reduce the consumption of new messages in time. Influence, a certain 
proportion of newly added messages will still be produced in the accumulated 
paritition, and new messages cannot be quickly consumed by restarted consumers 
or new consumers. In this way, the time affected by the accumulation factor may 
very long.

In order to reduce the impact on the business, through the control of read and 
write permissions, configure some partitions as read-only, so that newly 
produced messages can be sent to the partitions with less accumulation or newly 
expanded, reducing the impact on the business caused by the accumulation of 
messages. Reduce the overall time-consuming of production--consumption.

Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread PengHui Li
> So that we can review the details and improve these 2 documents (I see
the last update
for these two documents are 2021 and 2020)

Sorry, I think it's not a good idea.
We should move to the codebase 100% the same as the original documentation.
And use another PR to make improvements for it. If it is needed, we should
start with an email.
We need to have historical records.

Penghui


On Thu, Aug 25, 2022 at 10:28 AM PengHui Li  wrote:

> Hi Michael,
>
> > This raises the question of how to handle historical PIPs. We could do
> some work to move PIPs out of the wiki and out of issues into the code
> base. The wiki is just another git repo, so any committer should be
> able to help migrate PIPs in the wiki to the apache/pulsar git repo.
> This might also be an opportunity to double check the status on old
> PIPs.
>
> Yes, I think it's a good idea. IMO, we can migrate them one by one.
> To provide more context for each one like the discussion email, vote email
> , and the approvals (Of course, very early proposals did not require
> approval).
> So that all the contributors can help with the migration process.
>
> I think I can move [0] [1] to the codebase first.
>
> [0]
> https://github.com/apache/pulsar/wiki/Pulsar-Improvement-Proposal-Template
> [1]
> https://github.com/apache/pulsar/wiki/Pulsar-Improvement-Proposal-%28PIP%29
>
> So that we can review the details and improve these 2 documents (I see the
> last update
> for these two documents are 2021 and 2020)
>
> Thanks,
> Penghui
>
> On Thu, Aug 25, 2022 at 9:44 AM tison  wrote:
>
>> Hi Penghui,
>>
>> Thanks for starting this discussion! +1 for holding PIPs in the codebase.
>>
>> I'd like to share some experience working on open-source projects holding
>> proposals as code.
>>
>> 1. Rust RFCs: codebase https://github.com/rust-lang/rfcs
>> 2. TiDB Proposals:
>> https://github.com/pingcap/tidb/tree/master/docs/design#readme
>>
>> Reflect on topics in this thread:
>>
>> > any reviews should happen with a PR first
>>
>> The experience should be similar to reviewing on Google Docs while keeping
>> all conversations on GitHub and thus synced on the mailing list. You can
>> preview can some examples:
>>
>> * https://github.com/pingcap/tidb/pull/23673
>> * https://github.com/rust-lang/rfcs/pull/42
>>
>> Furthermore, continuous follow-up like PIP-198 can discuss implementation
>> details and update the proposal content one PR by another. The downside is
>> that even trivial typo fixes should go through a PR.
>>
>> > Certainly, all the votes should happen on the mailing list.
>>
>> Yes. We should have an explicit [VOTE] and [RESULT][VOTE] thread on the
>> mailing list while keeping a reference within the proposal content. We can
>> port the current PIP issue template to a proposal template as Rust RFCs to
>> highlight it.
>>
>> > We can just add a link that points to the PIPs dir to the contribution
>> guide
>> or README.
>>
>> Rust RFCs hold in a separate repo so that they can use the PR number as
>> the
>> proposal number to prevent duplicates. It's a bit overhead for Pulsar to
>> hold a separate repo. So I'd prefer to hold proposals in the same repo
>> under the dev/proposals folder and resolve the PR number duplicates issue
>> as described above. Although, it can still be a bit awkward. But at least
>> we can quickly notice duplicate numbers and resolve them. Using the PR
>> number as the proposal number keeps it unique, but the number will be
>> discontinuous. Especially if we're proposing in the main repo, the number
>> will grow quickly.
>>
>> > Should we track the status of PIPs as a project?
>>
>> In this direction, we can associate a proposal with a tracking issue and
>> manipulate the proposal status represented by their tracking issue. It
>> doesn't mean we keep the current issue discussion process. You can regard
>> it as an umbrella ticket for the proposal. See
>> https://github.com/pingcap/tidb/issues/26022 as an example.
>>
>> In this direction, we should always ensure that there're several eyes on
>> the project. Otherwise, we can easily end up with an out-of-sync project.
>>
>> To @Rajan @Penghui
>>
>> It seems the confusion comes from we have to merge one PR first to prevent
>> duplicate numbers. If it's a certain concern, see also my comment above
>> using the PR number as the proposal, or even host a separate proposal repo
>> as rust-lang/rfcs does and keep the number growing normally. I can provide
>> experience to maintain this sort of resource.
>>
>>
>> Best,
>> tison.
>>
>


Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread PengHui Li
Hi Michael,

> This raises the question of how to handle historical PIPs. We could do
some work to move PIPs out of the wiki and out of issues into the code
base. The wiki is just another git repo, so any committer should be
able to help migrate PIPs in the wiki to the apache/pulsar git repo.
This might also be an opportunity to double check the status on old
PIPs.

Yes, I think it's a good idea. IMO, we can migrate them one by one.
To provide more context for each one like the discussion email, vote email
, and the approvals (Of course, very early proposals did not require
approval).
So that all the contributors can help with the migration process.

I think I can move [0] [1] to the codebase first.

[0]
https://github.com/apache/pulsar/wiki/Pulsar-Improvement-Proposal-Template
[1]
https://github.com/apache/pulsar/wiki/Pulsar-Improvement-Proposal-%28PIP%29

So that we can review the details and improve these 2 documents (I see the
last update
for these two documents are 2021 and 2020)

Thanks,
Penghui

On Thu, Aug 25, 2022 at 9:44 AM tison  wrote:

> Hi Penghui,
>
> Thanks for starting this discussion! +1 for holding PIPs in the codebase.
>
> I'd like to share some experience working on open-source projects holding
> proposals as code.
>
> 1. Rust RFCs: codebase https://github.com/rust-lang/rfcs
> 2. TiDB Proposals:
> https://github.com/pingcap/tidb/tree/master/docs/design#readme
>
> Reflect on topics in this thread:
>
> > any reviews should happen with a PR first
>
> The experience should be similar to reviewing on Google Docs while keeping
> all conversations on GitHub and thus synced on the mailing list. You can
> preview can some examples:
>
> * https://github.com/pingcap/tidb/pull/23673
> * https://github.com/rust-lang/rfcs/pull/42
>
> Furthermore, continuous follow-up like PIP-198 can discuss implementation
> details and update the proposal content one PR by another. The downside is
> that even trivial typo fixes should go through a PR.
>
> > Certainly, all the votes should happen on the mailing list.
>
> Yes. We should have an explicit [VOTE] and [RESULT][VOTE] thread on the
> mailing list while keeping a reference within the proposal content. We can
> port the current PIP issue template to a proposal template as Rust RFCs to
> highlight it.
>
> > We can just add a link that points to the PIPs dir to the contribution
> guide
> or README.
>
> Rust RFCs hold in a separate repo so that they can use the PR number as the
> proposal number to prevent duplicates. It's a bit overhead for Pulsar to
> hold a separate repo. So I'd prefer to hold proposals in the same repo
> under the dev/proposals folder and resolve the PR number duplicates issue
> as described above. Although, it can still be a bit awkward. But at least
> we can quickly notice duplicate numbers and resolve them. Using the PR
> number as the proposal number keeps it unique, but the number will be
> discontinuous. Especially if we're proposing in the main repo, the number
> will grow quickly.
>
> > Should we track the status of PIPs as a project?
>
> In this direction, we can associate a proposal with a tracking issue and
> manipulate the proposal status represented by their tracking issue. It
> doesn't mean we keep the current issue discussion process. You can regard
> it as an umbrella ticket for the proposal. See
> https://github.com/pingcap/tidb/issues/26022 as an example.
>
> In this direction, we should always ensure that there're several eyes on
> the project. Otherwise, we can easily end up with an out-of-sync project.
>
> To @Rajan @Penghui
>
> It seems the confusion comes from we have to merge one PR first to prevent
> duplicate numbers. If it's a certain concern, see also my comment above
> using the PR number as the proposal, or even host a separate proposal repo
> as rust-lang/rfcs does and keep the number growing normally. I can provide
> experience to maintain this sort of resource.
>
>
> Best,
> tison.
>


[GitHub] [pulsar-test-infra] maxsxu commented on pull request #68: [improve][docbot] Move the preceding steps into the action

2022-08-24 Thread GitBox


maxsxu commented on PR #68:
URL: https://github.com/apache/pulsar-test-infra/pull/68#issuecomment-1226695489

   /cc @tisonkun @Anonymitaet @lhotari 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] maxsxu opened a new pull request, #68: [improve][docbot] Move the preceding steps into the action

2022-08-24 Thread GitBox


maxsxu opened a new pull request, #68:
URL: https://github.com/apache/pulsar-test-infra/pull/68

   Fixes #67 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] maxsxu opened a new issue, #67: Simplify invocation of docbot action

2022-08-24 Thread GitBox


maxsxu opened a new issue, #67:
URL: https://github.com/apache/pulsar-test-infra/issues/67

   ### Motivation
   The [Current invocation of 
docbot](https://github.com/apache/pulsar/blob/46b1239a23fd2a250b009df1e9b4a65341d299a7/.github/workflows/ci-documentbot.yml):
   
   ```
   jobs:
 label:
   steps:
 - name: Checkout action
   uses: actions/checkout@v3
   with:
 repository: apache/pulsar-test-infra
 ref: master
   
 - name: Set up Go
   uses: actions/setup-go@v3
   with:
 go-version: 1.18
   
 - name: Labeling
   uses: ./docbot
   env:
 GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
 LABEL_WATCH_LIST: 'doc,doc-required,doc-not-needed,doc-complete'
 LABEL_MISSING: 'doc-label-missing'
   ```
   
   These steps before Labeling should be organized into the action itself, the 
desired invocation should be illustrated below:
   
   ```
   jobs:
 label:
   steps:
 - name: Labeling
   uses: pulsar/pulsar-test-infra/docbot@master
   env:
 GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
 LABEL_WATCH_LIST: 'doc,doc-required,doc-not-needed,doc-complete'
 LABEL_MISSING: 'doc-label-missing'
   ```
   
   inspired from 
https://github.com/apache/pulsar-test-infra/issues/58#issuecomment-1225889058


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [DISCUSS] Enable message deduplication for replicator by default

2022-08-24 Thread Baodi Shi
+1

Thanks,
Baodi Shi

> On Aug 24, 2022, at 20:1312, Qiang Huang  wrote:
> 
> +1
> 
> Zike Yang  于2022年8月22日周一 15:32写道:
> 
>> +1
>> 
>> Thanks,
>> Zike Yang
>> 
>> On Mon, Aug 22, 2022 at 3:16 PM mattison chao 
>> wrote:
>>> 
>>> +1
>>> 
>>> Best,
>>> Mattison
>>> 
>>> On Fri, 19 Aug 2022 at 01:40, Enrico Olivelli 
>> wrote:
>>> 
 I agree
 
 Enrico
 
 Il Gio 18 Ago 2022, 18:23 PengHui Li  ha scritto:
 
> Hi all,
> 
> When I tried to fix a problem related to replicator
> https://github.com/apache/pulsar/pull/17154
> It surprised me that the message deduplication will not work by
>> default
> with the replicator.
> I always thought it was enabled for replicators by default. Details
>> to
 see
> [0].
> 
> I think we should enable the deduplication for the replicator.
>> Otherwise,
> we will see duplicated
> messages on the remote cluster. And the producer of the replicator
>> always
> has a fixed producer
> name, this will make the message deduplication work properly.
> 
> The test introduced in https://github.com/apache/pulsar/pull/17154
>> will
> check the message
> replication ordering. Without the message deduplication enabled, the
>> test
> is flaky with received
> duplicated messages. After enabling, everything is fine.
> 
> Best,
> Penghui
> 
> [0]
>> https://github.com/apache/pulsar/pull/17154#discussion_r948736894
> 
 
>> 
> 
> 
> -- 
> BR,
> Qiang Huang



Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread tison
Hi Penghui,

Thanks for starting this discussion! +1 for holding PIPs in the codebase.

I'd like to share some experience working on open-source projects holding
proposals as code.

1. Rust RFCs: codebase https://github.com/rust-lang/rfcs
2. TiDB Proposals:
https://github.com/pingcap/tidb/tree/master/docs/design#readme

Reflect on topics in this thread:

> any reviews should happen with a PR first

The experience should be similar to reviewing on Google Docs while keeping
all conversations on GitHub and thus synced on the mailing list. You can
preview can some examples:

* https://github.com/pingcap/tidb/pull/23673
* https://github.com/rust-lang/rfcs/pull/42

Furthermore, continuous follow-up like PIP-198 can discuss implementation
details and update the proposal content one PR by another. The downside is
that even trivial typo fixes should go through a PR.

> Certainly, all the votes should happen on the mailing list.

Yes. We should have an explicit [VOTE] and [RESULT][VOTE] thread on the
mailing list while keeping a reference within the proposal content. We can
port the current PIP issue template to a proposal template as Rust RFCs to
highlight it.

> We can just add a link that points to the PIPs dir to the contribution guide
or README.

Rust RFCs hold in a separate repo so that they can use the PR number as the
proposal number to prevent duplicates. It's a bit overhead for Pulsar to
hold a separate repo. So I'd prefer to hold proposals in the same repo
under the dev/proposals folder and resolve the PR number duplicates issue
as described above. Although, it can still be a bit awkward. But at least
we can quickly notice duplicate numbers and resolve them. Using the PR
number as the proposal number keeps it unique, but the number will be
discontinuous. Especially if we're proposing in the main repo, the number
will grow quickly.

> Should we track the status of PIPs as a project?

In this direction, we can associate a proposal with a tracking issue and
manipulate the proposal status represented by their tracking issue. It
doesn't mean we keep the current issue discussion process. You can regard
it as an umbrella ticket for the proposal. See
https://github.com/pingcap/tidb/issues/26022 as an example.

In this direction, we should always ensure that there're several eyes on
the project. Otherwise, we can easily end up with an out-of-sync project.

To @Rajan @Penghui

It seems the confusion comes from we have to merge one PR first to prevent
duplicate numbers. If it's a certain concern, see also my comment above
using the PR number as the proposal, or even host a separate proposal repo
as rust-lang/rfcs does and keep the number growing normally. I can provide
experience to maintain this sort of resource.


Best,
tison.


[jira] [Closed] (PULSAR-22) flink消费pulsar时出现错误,但是仍可消费到数据

2022-08-24 Thread Jira


 [ 
https://issues.apache.org/jira/browse/PULSAR-22?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

海洋饼干 closed PULSAR-22.
--
Resolution: Cannot Reproduce

> flink消费pulsar时出现错误,但是仍可消费到数据
> 
>
> Key: PULSAR-22
> URL: https://issues.apache.org/jira/browse/PULSAR-22
> Project: Pulsar
>  Issue Type: Bug
> Environment: kafka(json) -> flink(Stream) -> custom 
> Serializer -> kop -> pulsar -> flink(stream) keyshared consum(问题所在区域)
>Reporter: 海洋饼干
>Priority: Major
>
> [Source Data Fetcher for Source: kafkaSource (4/4)#0] ERROR 
> org.apache.flink.connector.pulsar.source.reader.split.PulsarPartitionSplitReaderBase
>  - Error in polling message from pulsar consumer.
> java.util.concurrent.ExecutionException: 
> org.apache.pulsar.client.api.PulsarClientException$TransactionConflictException:
>  
> \{"errorMsg":"org.apache.pulsar.transaction.common.exception.TransactionConflictException:
>  
> [persistent://public/default/HB0002_rdwBaseCtr_aaamore5005-test-pulsar-partition-0][flink-source]
>  Transaction:(1,12) try to ack batch message:810:24 in pending ack 
> status.","reqId":4357026755663389273, 
> "remote":"tk01-bd-test-pulsar-7-139/192.168.7.139:6650", 
> "local":"/192.168.34.54:9756"}
>     at 
> java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357)
>     at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1908)
>     at 
> org.apache.flink.connector.pulsar.source.reader.split.PulsarUnorderedPartitionSplitReader.pollMessage(PulsarUnorderedPartitionSplitReader.java:98)
>     at 
> org.apache.flink.connector.pulsar.source.reader.split.PulsarPartitionSplitReaderBase.fetch(PulsarPartitionSplitReaderBase.java:110)
>     at 
> org.apache.flink.connector.pulsar.source.reader.split.PulsarUnorderedPartitionSplitReader.fetch(PulsarUnorderedPartitionSplitReader.java:56)
>     at 
> org.apache.flink.connector.base.source.reader.fetcher.FetchTask.run(FetchTask.java:58)
>     at 
> org.apache.flink.connector.base.source.reader.fetcher.SplitFetcher.runOnce(SplitFetcher.java:142)
>     at 
> org.apache.flink.connector.base.source.reader.fetcher.SplitFetcher.run(SplitFetcher.java:105)
>     at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>     at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>     at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>     at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>     at java.lang.Thread.run(Thread.java:750)
> Caused by: 
> org.apache.pulsar.client.api.PulsarClientException$TransactionConflictException:
>  
> \{"errorMsg":"org.apache.pulsar.transaction.common.exception.TransactionConflictException:
>  
> [persistent://public/default/HB0002_rdwBaseCtr_aaamore5005-test-pulsar-partition-0][flink-source]
>  Transaction:(1,12) try to ack batch message:810:24 in pending ack 
> status.","reqId":4357026755663389273, 
> "remote":"tk01-bd-test-pulsar-7-139/192.168.7.139:6650", 
> "local":"/192.168.34.54:9756"}
>     at 
> org.apache.pulsar.client.impl.ClientCnx.getPulsarClientException(ClientCnx.java:1177)
>     at 
> org.apache.pulsar.client.impl.ClientCnx.handleAckResponse(ClientCnx.java:431)
>     at 
> org.apache.pulsar.common.protocol.PulsarDecoder.channelRead(PulsarDecoder.java:150)
>     at 
> org.apache.pulsar.shade.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379)
>     at 
> org.apache.pulsar.shade.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365)
>     at 
> org.apache.pulsar.shade.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357)
>     at 
> org.apache.pulsar.shade.io.netty.handler.codec.ByteToMessageDecoder.fireChannelRead(ByteToMessageDecoder.java:327)
>     at 
> org.apache.pulsar.shade.io.netty.handler.codec.ByteToMessageDecoder.channelRead(ByteToMessageDecoder.java:299)
>     at 
> org.apache.pulsar.shade.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379)
>     at 
> org.apache.pulsar.shade.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365)
>     at 
> org.apache.pulsar.shade.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357)
>     at 
> org.apache.pulsar.shade.io.netty.channel.DefaultChannelPipeline$HeadContext.channelRead(DefaultChannelPipeline.java:1410)
>     at 
> org.apache.pulsar.shade.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379)
>     at 
> org.apache.pulsar.shade.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365)
>     at 
> org.apache.pu

Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread Christophe Bornet
+1 from me. (I've always been a supporter of having PIPs in codebase)

Le mer. 24 août 2022 à 18:04, Michael Marshall  a
écrit :

> I support moving PIPs to the code base.
>
> When we discussed this option during last year's redesign of the PIP
> process, an objection was that "merging" a PIP could be construed as
> acceptance of the PIP. As long as we are clear that we merge all
> proposed PIP PRs because it is good to have the historical record, I
> think we'll be able to prevent confusion.
>
> > Also we have old PIPs that have been abandoned or that never completed
> VOTEs
>
> This raises the question of how to handle historical PIPs. We could do
> some work to move PIPs out of the wiki and out of issues into the code
> base. The wiki is just another git repo, so any committer should be
> able to help migrate PIPs in the wiki to the apache/pulsar git repo.
> This might also be an opportunity to double check the status on old
> PIPs.
>
> Thanks,
> Michael
>
>
> On Wed, Aug 24, 2022 at 10:23 AM 丛搏  wrote:
> >
> > Hi all,
> >
> > I also think it's a good idea. Not only can it prevent the duplication
> of PIP
> > numbers, but most importantly, it is easier to annotate, in other words,
> > It can review the PIP just like the review code, it can make the problem
> > more focused, and it is easier for the viewer to annotate at any time.
> > It didn't make a huge difference, just from ISSUE to PR. We have no
> > reason not to change it, having it will greatly improve the review
> > efficiency of PIP.
> >
> > Thanks,
> > Bo
> >
> > PengHui Li  于2022年8月24日周三 10:01写道:
> > >
> > > Hi Rajan,
> > >
> > > > I didn't understand this part. You want one to create a PR before
> > > submitting a proposal? That's clearly not a good idea because if the
> PIP
> > > approach will change then the entire development effort will be wasted
> and
> > > that's the whole purpose of PIP. I guess creating PIP into an issue and
> > > discussing the issue is definitely working and it's an easier way to
> > > discuss quickly rather than discussing over email threads.
> > >
> > > Sorry, I mean create a PR to add the proposal, not the implementation.
> > > Before adding the proposal, we should discuss the motivation on the
> mailing
> > > list first.
> > >
> > > > Let's not change this practice without good discussion and agreement
> from
> > > the community.
> > >
> > > I don't think this essentially changes this part, it just provides a
> way
> > > for reviewers to
> > > review the details of the proposal more efficiently. For a quick
> > > discussion, this will not change
> > > it. Let me try to clarify the procedure after we applied this idea
> > >
> > >
> > >1. Send the email first to discuss the motivation of your proposal
> > >2. Discuss the motivation under the mailing list and request a PIP
> > >number if there are no objections
> > >3. Create a PR to add the detailed proposal
> > >4. *Send out the VOTE email with the PR link*
> > >5. *Review the proposal(The PR) on Github and vote under the mailing
> > >list.*
> > >6. Merge the PR of the proposal after the proposal get 3 (+1)
> bindings
> > >
> > > The only difference is step 4 and step 5. Currently, we share the issue
> > > link to ask for a review.
> > >
> > > Thanks,
> > > Penghui
> > >
> > > On Wed, Aug 24, 2022 at 1:22 AM Rajan Dhabalia 
> > > wrote:
> > >
> > > > Hi,
> > > >
> > > > >>> I think we can move all the PIPs to the codebase and the new
> proposal
> > > > and proposal without any reviews should happen with a PR first. So
> that we
> > > > can review and comment easily.
> > > >
> > > > I didn't understand this part. You want one to create a PR before
> > > > submitting a proposal? That's clearly not a good idea because if the
> PIP
> > > > approach will change then the entire development effort will be
> wasted and
> > > > that's the whole purpose of PIP. I guess creating PIP into an issue
> and
> > > > discussing the issue is definitely working and it's an easier way to
> > > > discuss quickly rather than discussing over email threads.
> > > >
> > > > Let's not change this practice without good discussion and agreement
> from
> > > > the community.
> > > >
> > > > Thanks,
> > > > Rajan
> > > >
> > > > On Thu, Aug 18, 2022 at 8:27 AM PengHui Li 
> wrote:
> > > >
> > > > > Hi all,
> > > > >
> > > > > Currently, the new proposal will be added to the issue list and
> then
> > > > shared
> > > > > link in the email
> > > > > to request the proposal review. It's really hard to review a long
> > > > proposal
> > > > > if you want to comment
> > > > > in detail.
> > > > >
> > > > > Here is an example:
> > > > >
> https://github.com/apache/pulsar/issues/16763#issuecomment-1219606491
> > > > > This seems very unintuitive.
> > > > >
> > > > > I think we can move all the PIPs to the codebase and the new
> proposal and
> > > > > proposal without
> > > > > any reviews should happen with a PR first. So that we can review
> and
> > > > > comment eas

[GitHub] [pulsar-site] tisonkun commented on pull request #170: fix version replace

2022-08-24 Thread GitBox


tisonkun commented on PR #170:
URL: https://github.com/apache/pulsar-site/pull/170#issuecomment-1226029231

   Cool!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-site] tisonkun opened a new issue, #175: Remove all content before sync

2022-08-24 Thread GitBox


tisonkun opened a new issue, #175:
URL: https://github.com/apache/pulsar-site/issues/175

   @urfreespace @Anonymitaet Otherwise, we can suffer from some files deleted 
from upstream but still remain in this repo.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-site] tisonkun closed pull request #174: Sync with main repo rename develop-binary-protocol

2022-08-24 Thread GitBox


tisonkun closed pull request #174: Sync with main repo rename 
develop-binary-protocol
URL: https://github.com/apache/pulsar-site/pull/174


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-site] tisonkun commented on pull request #174: Sync with main repo rename develop-binary-protocol

2022-08-24 Thread GitBox


tisonkun commented on PR #174:
URL: https://github.com/apache/pulsar-site/pull/174#issuecomment-1225988594

   Closed. It seems I can use the following snippet to redirect. Should submit 
a pr on the main repo:
   
   ```mdx
   ---
   id: develop-binary-protocol
   title: Pulsar binary protocol specification2
   sidebar_label: "Binary protocol2"
   ---
   
   import {Redirect} from '@docusaurus/router';
   
   
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [DISCUSS] Catch up reads with many subscriptions - saving reads from BK

2022-08-24 Thread Michael Marshall
+1 I support merging the temporary solution.

> Just one point, please make sure the change will not introduce too much
> heap memory overhead.

If we see that there is extra pressure, I wonder if we can dynamically
determine when to deduplicate requests. A broker only needs this
feature when a topic has multiple subscriptions. I haven't double
checked if this information is easily accessible to the cache.

One of the interesting benefits of this optimization is that it will
decrease direct memory pressure. When we do not deduplicate reads, the
same entry will be duplicated in the bookie's netty direct memory
pool. With this design, an entry will be materialized once with many
different ByteBuffer's objects pointing to it.

Thanks,
Michael

On Wed, Aug 24, 2022 at 11:00 AM PengHui Li  wrote:
>
> +1 for we can have a short-term solution first.
>
> Just one point, please make sure the change will not introduce too much
> heap memory overhead.
>
> Thanks,
> Penghui
>
> On Wed, Aug 24, 2022 at 11:54 PM Enrico Olivelli 
> wrote:
>
> > Hello folks,
> > These days I am working on a use case in which I have many
> > subscriptions on the same topic (~100).
> > If you get to a situation with a big backlog, the consumers are no
> > more able to leverage the broker cache then the broker starts to read
> > from BookKeeper.
> >
> > In 2.11 we have a first mechanism of catch-up reads cache for
> > backlogged consumers (when you have a few backlogged cursors we start
> > putting in the cache the entries that you read).
> >
> > That is not enough to deal with my case because if you have many
> > consumers (one per partition) that connect all together, for instance
> > after a major problem in the application, the cache is empty and the
> > broker starts to overwhelm the bookies with reads.
> >
> > As a short term solution I have created this patch
> > https://github.com/apache/pulsar/pull/17241
> >
> > The patch basically saves reads from BK by preventing concurrent reads
> > for the same entries (actually "ranges of entries").
> > If the broker starts a read for a range and then comes another read
> > for the same range before the read completes, then we skip sending the
> > new read and we use the result of the BK request.
> >
> > This simple patch is really effective, in some tests it reduces a lot
> > (from 100K reads/s to 250 reads/s) the concurrent reads to BK.
> > You may say that it is unlikely that the broker sends concurrent
> > requests for exactly the same range of entries but empirically I have
> > observed that it happens quite often, at least in the scenario I drew
> > above.
> >
> > My proposal is to commit my patch as a short term solution.
> > I have already talked with PengHui and a better solution would be to
> > introduce explicitly a "Catch Up reads cache", but that is a mid term
> > implementation.
> >
> > WDYT ?
> >
> >
> > Enrico
> >


[GitHub] [pulsar-site] tisonkun commented on pull request #174: Sync with main repo rename develop-binary-protocol

2022-08-24 Thread GitBox


tisonkun commented on PR #174:
URL: https://github.com/apache/pulsar-site/pull/174#issuecomment-1225936104

   Or we can change main repo files `develop-binary-protocol` to redirect to 
`developing-binary-protocol `


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-site] tisonkun opened a new pull request, #174: Sync with main repo rename develop-binary-protocol

2022-08-24 Thread GitBox


tisonkun opened a new pull request, #174:
URL: https://github.com/apache/pulsar-site/pull/174

   This closes #172.
   
   cc @urfreespace perhaps we should set up a redirect from 
develop-binary-protocol to developing-binary-protocol. Although, I don't know 
how to achieve it.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread Michael Marshall
I support moving PIPs to the code base.

When we discussed this option during last year's redesign of the PIP
process, an objection was that "merging" a PIP could be construed as
acceptance of the PIP. As long as we are clear that we merge all
proposed PIP PRs because it is good to have the historical record, I
think we'll be able to prevent confusion.

> Also we have old PIPs that have been abandoned or that never completed VOTEs

This raises the question of how to handle historical PIPs. We could do
some work to move PIPs out of the wiki and out of issues into the code
base. The wiki is just another git repo, so any committer should be
able to help migrate PIPs in the wiki to the apache/pulsar git repo.
This might also be an opportunity to double check the status on old
PIPs.

Thanks,
Michael


On Wed, Aug 24, 2022 at 10:23 AM 丛搏  wrote:
>
> Hi all,
>
> I also think it's a good idea. Not only can it prevent the duplication of PIP
> numbers, but most importantly, it is easier to annotate, in other words,
> It can review the PIP just like the review code, it can make the problem
> more focused, and it is easier for the viewer to annotate at any time.
> It didn't make a huge difference, just from ISSUE to PR. We have no
> reason not to change it, having it will greatly improve the review
> efficiency of PIP.
>
> Thanks,
> Bo
>
> PengHui Li  于2022年8月24日周三 10:01写道:
> >
> > Hi Rajan,
> >
> > > I didn't understand this part. You want one to create a PR before
> > submitting a proposal? That's clearly not a good idea because if the PIP
> > approach will change then the entire development effort will be wasted and
> > that's the whole purpose of PIP. I guess creating PIP into an issue and
> > discussing the issue is definitely working and it's an easier way to
> > discuss quickly rather than discussing over email threads.
> >
> > Sorry, I mean create a PR to add the proposal, not the implementation.
> > Before adding the proposal, we should discuss the motivation on the mailing
> > list first.
> >
> > > Let's not change this practice without good discussion and agreement from
> > the community.
> >
> > I don't think this essentially changes this part, it just provides a way
> > for reviewers to
> > review the details of the proposal more efficiently. For a quick
> > discussion, this will not change
> > it. Let me try to clarify the procedure after we applied this idea
> >
> >
> >1. Send the email first to discuss the motivation of your proposal
> >2. Discuss the motivation under the mailing list and request a PIP
> >number if there are no objections
> >3. Create a PR to add the detailed proposal
> >4. *Send out the VOTE email with the PR link*
> >5. *Review the proposal(The PR) on Github and vote under the mailing
> >list.*
> >6. Merge the PR of the proposal after the proposal get 3 (+1) bindings
> >
> > The only difference is step 4 and step 5. Currently, we share the issue
> > link to ask for a review.
> >
> > Thanks,
> > Penghui
> >
> > On Wed, Aug 24, 2022 at 1:22 AM Rajan Dhabalia 
> > wrote:
> >
> > > Hi,
> > >
> > > >>> I think we can move all the PIPs to the codebase and the new proposal
> > > and proposal without any reviews should happen with a PR first. So that we
> > > can review and comment easily.
> > >
> > > I didn't understand this part. You want one to create a PR before
> > > submitting a proposal? That's clearly not a good idea because if the PIP
> > > approach will change then the entire development effort will be wasted and
> > > that's the whole purpose of PIP. I guess creating PIP into an issue and
> > > discussing the issue is definitely working and it's an easier way to
> > > discuss quickly rather than discussing over email threads.
> > >
> > > Let's not change this practice without good discussion and agreement from
> > > the community.
> > >
> > > Thanks,
> > > Rajan
> > >
> > > On Thu, Aug 18, 2022 at 8:27 AM PengHui Li  wrote:
> > >
> > > > Hi all,
> > > >
> > > > Currently, the new proposal will be added to the issue list and then
> > > shared
> > > > link in the email
> > > > to request the proposal review. It's really hard to review a long
> > > proposal
> > > > if you want to comment
> > > > in detail.
> > > >
> > > > Here is an example:
> > > > https://github.com/apache/pulsar/issues/16763#issuecomment-1219606491
> > > > This seems very unintuitive.
> > > >
> > > > I think we can move all the PIPs to the codebase and the new proposal 
> > > > and
> > > > proposal without
> > > > any reviews should happen with a PR first. So that we can review and
> > > > comment easily.
> > > > Certainly, all the votes should happen on the mailing list. And we can
> > > also
> > > > discuss the
> > > > proposal on the mailing list.
> > > >
> > > > Following this way, we don't need to sync the PIPs from the issue to the
> > > > wiki page.
> > > > We can just add a link that points to the PIPs dir to the contribution
> > > > guide or README.
> > > >
> > > > We have 

Re: [DISCUSS] Catch up reads with many subscriptions - saving reads from BK

2022-08-24 Thread PengHui Li
+1 for we can have a short-term solution first.

Just one point, please make sure the change will not introduce too much
heap memory overhead.

Thanks,
Penghui

On Wed, Aug 24, 2022 at 11:54 PM Enrico Olivelli 
wrote:

> Hello folks,
> These days I am working on a use case in which I have many
> subscriptions on the same topic (~100).
> If you get to a situation with a big backlog, the consumers are no
> more able to leverage the broker cache then the broker starts to read
> from BookKeeper.
>
> In 2.11 we have a first mechanism of catch-up reads cache for
> backlogged consumers (when you have a few backlogged cursors we start
> putting in the cache the entries that you read).
>
> That is not enough to deal with my case because if you have many
> consumers (one per partition) that connect all together, for instance
> after a major problem in the application, the cache is empty and the
> broker starts to overwhelm the bookies with reads.
>
> As a short term solution I have created this patch
> https://github.com/apache/pulsar/pull/17241
>
> The patch basically saves reads from BK by preventing concurrent reads
> for the same entries (actually "ranges of entries").
> If the broker starts a read for a range and then comes another read
> for the same range before the read completes, then we skip sending the
> new read and we use the result of the BK request.
>
> This simple patch is really effective, in some tests it reduces a lot
> (from 100K reads/s to 250 reads/s) the concurrent reads to BK.
> You may say that it is unlikely that the broker sends concurrent
> requests for exactly the same range of entries but empirically I have
> observed that it happens quite often, at least in the scenario I drew
> above.
>
> My proposal is to commit my patch as a short term solution.
> I have already talked with PengHui and a better solution would be to
> introduce explicitly a "Catch Up reads cache", but that is a mid term
> implementation.
>
> WDYT ?
>
>
> Enrico
>


[DISCUSS] Catch up reads with many subscriptions - saving reads from BK

2022-08-24 Thread Enrico Olivelli
Hello folks,
These days I am working on a use case in which I have many
subscriptions on the same topic (~100).
If you get to a situation with a big backlog, the consumers are no
more able to leverage the broker cache then the broker starts to read
from BookKeeper.

In 2.11 we have a first mechanism of catch-up reads cache for
backlogged consumers (when you have a few backlogged cursors we start
putting in the cache the entries that you read).

That is not enough to deal with my case because if you have many
consumers (one per partition) that connect all together, for instance
after a major problem in the application, the cache is empty and the
broker starts to overwhelm the bookies with reads.

As a short term solution I have created this patch
https://github.com/apache/pulsar/pull/17241

The patch basically saves reads from BK by preventing concurrent reads
for the same entries (actually "ranges of entries").
If the broker starts a read for a range and then comes another read
for the same range before the read completes, then we skip sending the
new read and we use the result of the BK request.

This simple patch is really effective, in some tests it reduces a lot
(from 100K reads/s to 250 reads/s) the concurrent reads to BK.
You may say that it is unlikely that the broker sends concurrent
requests for exactly the same range of entries but empirically I have
observed that it happens quite often, at least in the scenario I drew
above.

My proposal is to commit my patch as a short term solution.
I have already talked with PengHui and a better solution would be to
introduce explicitly a "Catch Up reads cache", but that is a mid term
implementation.

WDYT ?


Enrico


[GitHub] [pulsar-manager] jakiuncle opened a new issue, #483: I want use casdoor to login,when i change code, happen some problem

2022-08-24 Thread GitBox


jakiuncle opened a new issue, #483:
URL: https://github.com/apache/pulsar-manager/issues/483

   
![image](https://user-images.githubusercontent.com/88994283/186461092-4ef76ec1-a3ba-4d7b-a177-bf877737cb2e.png)
   why it jump to
   
![image](https://user-images.githubusercontent.com/88994283/186461225-1637218f-4c1a-40eb-a37b-029bf9b38938.png)
   How can i aviod it.
   Thank you.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] maxsxu commented on issue #58: Use Docker container action for docbot

2022-08-24 Thread GitBox


maxsxu commented on issue #58:
URL: 
https://github.com/apache/pulsar-test-infra/issues/58#issuecomment-1225889058

   
   > Although, we can still try to let the docbot action to be used as barely 
`apache/pulsar-test-infra/docbot@main`.
   
   Yes, one thing we can be sure of now is to simplify the current 
ci-documentbot.yaml workflow file like this 
https://github.com/open-github/pulsar/blob/master/.github/workflows/ci-documentbot.yml.
  which is both applicable for whether composite or docker.
   
   This will send PR in both pulsar-test-infra and pulsar repo.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] tisonkun closed issue #58: Use Docker container action for docbot

2022-08-24 Thread GitBox


tisonkun closed issue #58: Use Docker container action for docbot
URL: https://github.com/apache/pulsar-test-infra/issues/58


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-test-infra] tisonkun commented on issue #58: Use Docker container action for docbot

2022-08-24 Thread GitBox


tisonkun commented on issue #58:
URL: 
https://github.com/apache/pulsar-test-infra/issues/58#issuecomment-1225879730

   Cool! Then I'll close this issue as won't do.
   
   Although, we can still try to let the docbot action to be used as barely 
`apache/pulsar-test-infra/docbot@main`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread 丛搏
Hi all,

I also think it's a good idea. Not only can it prevent the duplication of PIP
numbers, but most importantly, it is easier to annotate, in other words,
It can review the PIP just like the review code, it can make the problem
more focused, and it is easier for the viewer to annotate at any time.
It didn't make a huge difference, just from ISSUE to PR. We have no
reason not to change it, having it will greatly improve the review
efficiency of PIP.

Thanks,
Bo

PengHui Li  于2022年8月24日周三 10:01写道:
>
> Hi Rajan,
>
> > I didn't understand this part. You want one to create a PR before
> submitting a proposal? That's clearly not a good idea because if the PIP
> approach will change then the entire development effort will be wasted and
> that's the whole purpose of PIP. I guess creating PIP into an issue and
> discussing the issue is definitely working and it's an easier way to
> discuss quickly rather than discussing over email threads.
>
> Sorry, I mean create a PR to add the proposal, not the implementation.
> Before adding the proposal, we should discuss the motivation on the mailing
> list first.
>
> > Let's not change this practice without good discussion and agreement from
> the community.
>
> I don't think this essentially changes this part, it just provides a way
> for reviewers to
> review the details of the proposal more efficiently. For a quick
> discussion, this will not change
> it. Let me try to clarify the procedure after we applied this idea
>
>
>1. Send the email first to discuss the motivation of your proposal
>2. Discuss the motivation under the mailing list and request a PIP
>number if there are no objections
>3. Create a PR to add the detailed proposal
>4. *Send out the VOTE email with the PR link*
>5. *Review the proposal(The PR) on Github and vote under the mailing
>list.*
>6. Merge the PR of the proposal after the proposal get 3 (+1) bindings
>
> The only difference is step 4 and step 5. Currently, we share the issue
> link to ask for a review.
>
> Thanks,
> Penghui
>
> On Wed, Aug 24, 2022 at 1:22 AM Rajan Dhabalia 
> wrote:
>
> > Hi,
> >
> > >>> I think we can move all the PIPs to the codebase and the new proposal
> > and proposal without any reviews should happen with a PR first. So that we
> > can review and comment easily.
> >
> > I didn't understand this part. You want one to create a PR before
> > submitting a proposal? That's clearly not a good idea because if the PIP
> > approach will change then the entire development effort will be wasted and
> > that's the whole purpose of PIP. I guess creating PIP into an issue and
> > discussing the issue is definitely working and it's an easier way to
> > discuss quickly rather than discussing over email threads.
> >
> > Let's not change this practice without good discussion and agreement from
> > the community.
> >
> > Thanks,
> > Rajan
> >
> > On Thu, Aug 18, 2022 at 8:27 AM PengHui Li  wrote:
> >
> > > Hi all,
> > >
> > > Currently, the new proposal will be added to the issue list and then
> > shared
> > > link in the email
> > > to request the proposal review. It's really hard to review a long
> > proposal
> > > if you want to comment
> > > in detail.
> > >
> > > Here is an example:
> > > https://github.com/apache/pulsar/issues/16763#issuecomment-1219606491
> > > This seems very unintuitive.
> > >
> > > I think we can move all the PIPs to the codebase and the new proposal and
> > > proposal without
> > > any reviews should happen with a PR first. So that we can review and
> > > comment easily.
> > > Certainly, all the votes should happen on the mailing list. And we can
> > also
> > > discuss the
> > > proposal on the mailing list.
> > >
> > > Following this way, we don't need to sync the PIPs from the issue to the
> > > wiki page.
> > > We can just add a link that points to the PIPs dir to the contribution
> > > guide or README.
> > >
> > > We have another pain point about the duplicated PIP number. We can
> > maintain
> > > a file, a list of
> > > all the proposal contains the approved, in-review, drafting. Before
> > > creating a proposal, we should
> > > have a discussion first on the mailing list, just get feedback on the
> > > motivation. If there are no objections,
> > > the proposal owner can add a line to the file with the PIP number
> > through a
> > > PR, like PIP-123: xxx (Under Discussion).
> > > So that we can prevent the duplicated PIP number(which will conflict if
> > > someone merged first).
> > > After the PR is merged, we can send out a new PR to add the proposal.
> > >
> > > Thanks,
> > > Penghui
> > >
> >


[GitHub] [pulsar-test-infra] maxsxu commented on issue #58: Use Docker container action for docbot

2022-08-24 Thread GitBox


maxsxu commented on issue #58:
URL: 
https://github.com/apache/pulsar-test-infra/issues/58#issuecomment-1225869732

   @tisonkun  You have raised a very valuable question. I do curious about the 
real performance between composite action and docker action. So I did a quick 
test and the results show that composite action takes only about half the 
execution time of docker action.
   
   - 
[Comparison](https://github.com/open-github/pulsar/actions/workflows/ci-documentbot.yml)
 as shown below:
   
   > https://user-images.githubusercontent.com/15963141/186450802-7b95a180-6ee5-4f19-8653-697846bbc1de.png";>

   - [Details of composite 
action](https://github.com/open-github/pulsar/runs/7995320623?check_suite_focus=true):
   
   > https://user-images.githubusercontent.com/15963141/186451495-04591dfd-e813-4896-9465-de000309847b.png";>
   
   - [Details of docker 
action:](https://github.com/open-github/pulsar/runs/7997522614?check_suite_focus=true)
   
   > https://user-images.githubusercontent.com/15963141/186451851-c94ea119-7076-41b7-a129-003a8dc0468d.png";>
   
   For short operations like labeling, we need to run our action quickly. So 
execution speed is the primary requirement.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [VOTE] PIP-193 : Sink preprocessing Function

2022-08-24 Thread Christophe Bornet
Thanks to you all,

Closing this vote with 8 +1s (4 bindings and 4 non-bindings) and no -1s.
We'll adjust the naming during the implementation.

Thanks,

Christophe

Le mar. 23 août 2022 à 07:04, Michael Marshall  a
écrit :

> +1 (binding)
>
> I have some thoughts on naming, too.
>
> My preference is to call it a “mapper function”, though I think
> “decorator function” is a good name too. My concern with the
> preprocessing (and the correlated future postprocessing) name is that
> it's not clear to me what the "processing" step is. Converting a
> message to another message seems like the processing step for the
> source/sink.
>
> Either way, these are minor details. The feature looks good to me.
>
> Thanks,
> Michael
>
>
>
>
> On Tue, Aug 16, 2022 at 12:55 PM Lari Hotari  wrote:
> >
> > +1 (binding)
> >
> > Good proposal. My only comment is about the naming. Instead of calling
> it a "preprocessing function" or "extra function", I'd propose calling it
> "decorator function".
> >
> > -Lari
> >
> > On 2022/07/28 10:39:35 Christophe Bornet wrote:
> > > Hi, Pulsar community,
> > >
> > > I'd like to start a vote on PIP-193 : Sink preprocessing Function
> > >
> > > You can find the proposal at
> https://github.com/apache/pulsar/issues/16739 and
> > > the discussion thread at
> > > https://lists.apache.org/thread/qn59jwn47w9ngxpkvq3kswbl1y882jth.
> > >
> > > The vote will stay open for at least 48 hours.
> > >
> > > Best regards.
> > >
> > > Christophe Bornet
> > >
>


Re: [VOTE] [PIP-201] Extensions mechanism for Pulsar Admin CLI tools

2022-08-24 Thread PengHui Li
+1 (binding)

Best,
Penghui

On Wed, Aug 24, 2022 at 9:39 PM Dave Fisher  wrote:

> +1 (binding)
>
> Best,
> Dave
>
> Sent from my iPhone
>
> > On Aug 24, 2022, at 12:43 AM, Yunze Xu 
> wrote:
> >
> > +1 (non binding)
> >
> > Thanks,
> > Yunze
> >
> >
> >
> >
> >> 2022年8月24日 15:38,Nicolò Boschi  写道:
> >>
> >> +1 (non binding)
> >> Nicolò Boschi
> >>
> >>
> >>> Il giorno mer 24 ago 2022 alle ore 09:11 Enrico Olivelli <
> >>> eolive...@gmail.com> ha scritto:
> >>>
> >>> Hello,
> >>> this is the official thread VOTE for PIP-201 Extensions mechanism for
> >>> Pulsar Admin CLI tools
> >>>
> >>> This is the PIP link https://github.com/apache/pulsar/issues/17155
> >>> This is the discussion:
> >>> https://lists.apache.org/thread/287ft8twc11cp4s1y4qkcx5nmh451cyo
> >>>
> >>> I am still working on the PR, that is the subject of the VOTE.
> >>>
> >>> Best regards
> >>> Enrico Olivelli
> >>>
> >
>
>


Re: [VOTE] [PIP-201] Extensions mechanism for Pulsar Admin CLI tools

2022-08-24 Thread Dave Fisher
+1 (binding)

Best,
Dave

Sent from my iPhone

> On Aug 24, 2022, at 12:43 AM, Yunze Xu  wrote:
> 
> +1 (non binding)
> 
> Thanks,
> Yunze
> 
> 
> 
> 
>> 2022年8月24日 15:38,Nicolò Boschi  写道:
>> 
>> +1 (non binding)
>> Nicolò Boschi
>> 
>> 
>>> Il giorno mer 24 ago 2022 alle ore 09:11 Enrico Olivelli <
>>> eolive...@gmail.com> ha scritto:
>>> 
>>> Hello,
>>> this is the official thread VOTE for PIP-201 Extensions mechanism for
>>> Pulsar Admin CLI tools
>>> 
>>> This is the PIP link https://github.com/apache/pulsar/issues/17155
>>> This is the discussion:
>>> https://lists.apache.org/thread/287ft8twc11cp4s1y4qkcx5nmh451cyo
>>> 
>>> I am still working on the PR, that is the subject of the VOTE.
>>> 
>>> Best regards
>>> Enrico Olivelli
>>> 
> 



[GitHub] [pulsar] abcMH1966 added a comment to the discussion: I encountered a port error when using pulsar to integrate etcd

2022-08-24 Thread GitBox


GitHub user abcMH1966 added a comment to the discussion: I encountered a port 
error when using pulsar to integrate etcd

It doesn't work. Occur the same error!

![image](https://user-images.githubusercontent.com/59676485/186428980-8648bb8f-fdfb-42d3-91c7-7a63e76e3406.png)

My JDK version is 18.0.2.1

GitHub link: 
https://github.com/apache/pulsar/discussions/17240#discussioncomment-3465717


This is an automatically sent email for dev@pulsar.apache.org.
To unsubscribe, please send an email to: dev-unsubscr...@pulsar.apache.org



Re: [DISCUSS] Use Github project to track the multi clients development

2022-08-24 Thread Enrico Olivelli
I think that the main problem is that in our community we have plenty
of Java developers and there are a few committers that review
the patches on non-Java client.
The same problem is also for other projects like the Helm Chart.
Also, the Java client is used internally by the broker so it is
usually more used and it needs more work

That said I agree that with the proposal in this thread

Enrico

Il giorno mer 24 ago 2022 alle ore 14:32 Qiang Huang
 ha scritto:
>
> It makes sense to me. I'm thinking, if an issue is encountered in a Java
> client, it is possible to be in other clients. If so, the Java client board
> is also needed.
>
> mattison chao  于2022年8月22日周一 16:00写道:
>
> > +1
> >
> > Best,
> > Mattison
> >
> > On Mon, 22 Aug 2022 at 15:52, Baodi Shi 
> > wrote:
> >
> > > Nice idea!
> > >
> > > Thanks,
> > > Baodi Shi
> > >
> > > > On Aug 22, 2022, at 12:4250, Zike Yang  wrote:
> > > >
> > > > Hi, all
> > > >
> > > > Currently, many features or bug fixes of some clients like the C++
> > > > client and Python client have not caught up with the Java client. For
> > > > better tracking of these features or bug fixes, I have created a
> > > > Github project called `Apache Pulsar / Multi Clients`[0] under the
> > > > pulsar main repo. It can be removed if we don't like it this way.
> > > >
> > > > There are six columns in this Github project:
> > > > * Backlog: Low priority issues or issues(or cards) that have not been
> > > > initially investigated.
> > > > * Todo: Newly added issues and issues suitable for contributors to pick
> > > first.
> > > > * In progress: Work in progress issues or PRs.
> > > > * Review in progress: The PRs or issues that are waiting for review.
> > > > * Reviewer approved: The PRs or issues that are waiting to merge.
> > > > * Done: Merged PRs and Solved issues.
> > > >
> > > > This not only helps to track the status of other clients' features and
> > > > bug fixes but also helps contributors who want to participate in other
> > > > language client feature development to pick the task.
> > > >
> > > > This Project is currently only for clients maintained by the main
> > > > pulsar repository, such as C++ and Python clients.
> > > >
> > > > I have created a PR [1] to add C++ and Python client issues and PRs to
> > > > the Multi Clients project.
> > > >
> > > > Please let me know what you think. Thanks!
> > > >
> > > > [0] https://github.com/apache/pulsar/projects/12
> > > > [1] https://github.com/apache/pulsar/pull/17201
> > > >
> > > > Best,
> > > > Zike Yang
> > >
> > >
> >
>
>
> --
> BR,
> Qiang Huang


Re: [DISCUSS] Apache Pulsar 2.10.2 release

2022-08-24 Thread Enrico Olivelli
+1


Enrico

Il giorno mer 24 ago 2022 alle ore 14:49 Qiang Huang
 ha scritto:
>
> +1
>
> PengHui Li  于2022年8月23日周二 23:42写道:
>
> > +1
> >
> > And, I have added the release/2.10.2 label, Lan Liang.
> >
> > Thanks,
> > Penghui
> >
> > On Tue, Aug 23, 2022 at 11:02 PM Lan Liang 
> > wrote:
> >
> > > Hi,Haiting:
> > >
> > >Thanks for your work of release. Could you help to release PR-16309
> > > [0]  to 2.10.2, It resolved pulsar can not working with etcd cluster.
> > >
> > >Some users are wondering about this part [1].
> > >
> > > [0] https://github.com/apache/pulsar/pull/16309
> > > [1] https://github.com/apache/pulsar/discussions/17240
> > >
> > >
> > >
> > >
> > > Best Regards,
> > > Lan Liang
> > >  Replied Message 
> > > | From | Haiting Jiang |
> > > | Date | 8/23/2022 22:10 |
> > > | To |  |
> > > | Subject | [DISCUSS] Apache Pulsar 2.10.2 release |
> > > Hello, Pulsar community:
> > >
> > > I'd like to propose to release Apache Pulsar 2.10.2
> > >
> > > Currently, we have 189 commits [0] labeled with `release/2.10.2`,
> > > And 39 of them are not cherry-picked yet [1].
> > >
> > > And there are 39 open PRs [2], I will follow them to make sure that
> > > the important fixes could be contained in 2.10.2.
> > >
> > > If you have any important fixes or any questions,
> > > please reply to this email, we will evaluate whether to
> > > include it in 2.10.2
> > >
> > > [0]
> > >
> > https://github.com/apache/pulsar/pulls?q=is%3Amerged+is%3Apr+label%3Arelease%2F2.10.2+
> > > [1]
> > >
> > https://github.com/apache/pulsar/pulls?q=is%3Amerged+is%3Apr+label%3Arelease%2F2.10.2+-label%3Acherry-picked%2Fbranch-2.10+
> > > [2]
> > >
> > https://github.com/apache/pulsar/pulls?q=is%3Aopen+is%3Apr+label%3Arelease%2F2.10.2+
> > >
> > > Best Regards
> > > Haiting Jiang
> > >
> >
>
>
> --
> BR,
> Qiang Huang


[GitHub] [pulsar] abcMH1966 added a comment to the discussion: I encountered a port error when using pulsar to integrate etcd

2022-08-24 Thread GitBox


GitHub user abcMH1966 added a comment to the discussion: I encountered a port 
error when using pulsar to integrate etcd

> [This PR](https://github.com/apache/pulsar/pull/16309) had resolve pulsar 
> working with etcd cluster and seems like have not any release version of 
> pulsar include it. and i have build a version of pulsar from master, could 
> you try it? `docker pull ghcr.io/pulsar-sigs/pulsar:2022-08-23` or `docker 
> pull lypgcs/pulsar:2022-08-23`

**something goes wrong:**
Error: LinkageError occurred while loading main class 
org.apache.pulsar.PulsarClusterMetadataSetup
java.lang.UnsupportedClassVersionError: 
org/apache/pulsar/PulsarClusterMetadataSetup has been compiled by a more recent 
version of the Java Runtime (class file version 61.0), this version of the Java 
Runtime only recognizes class file versions up to 55.0

Can you tell me what's your jdk version?

GitHub link: 
https://github.com/apache/pulsar/discussions/17240#discussioncomment-3465565


This is an automatically sent email for dev@pulsar.apache.org.
To unsubscribe, please send an email to: dev-unsubscr...@pulsar.apache.org



Re: [DISCUSS] Deprecate KeyValue schema factory methods with Class parameters

2022-08-24 Thread Qiang Huang
Nice catch. LGTM.

Yunze Xu  于2022年8月24日周三 16:29写道:

> Hi folks,
>
> Recently I'm looking into the KeyValue schema and found **FOUR**
> static methods in `Schema` to create a `KeyValueSchema` object:
>
> 1. KeyValue(Class key, Class value);
> 2. KeyValue(Class key, Class value, SchemaType type);
> 3. KeyValue(Schema key, Schema value);
> 4. KeyValue(Schema key, Schema value, KeyValueEncodingType
> keyValueEncodingType);
>
> IMO, having too many overloads is not user-friendly. I turned to the
> official website and found overload 4 is used. The overload 3 is just
> a simple version of overload 4 whose encoding type is `INLINE`, but
> it's not clear. I opened a PR
> https://github.com/apache/pulsar/pull/17256 to make it more clear.
>
> However, for overload 1 and 2, I can only find two references in unit
> tests `testAllowNullAvroSchemaCreate` and
> `testAllowNullJsonSchemaCreate`. From the very simple JavaDocs, it
> looks like they are only available for JSON and AVRO schemas.
>
> Then I found the original PR to introduce these overloads:
> https://github.com/apache/pulsar/pull/2885
>
> Code has been changed a lot since that. It looks other codes are not
> much related to these two overloads now. IMO, we should not encourage
> users to use these two overloads, so I suggest marking them as
> deprecated and might remove them in future releases.
>
>
> Thanks,
> Yunze
>
>
>
>
>

-- 
BR,
Qiang Huang


Re: [VOTE] [PIP-201] Extensions mechanism for Pulsar Admin CLI tools

2022-08-24 Thread Qiang Huang
+1 (non binding)

Yunze Xu  于2022年8月24日周三 15:44写道:

> +1 (non binding)
>
> Thanks,
> Yunze
>
>
>
>
> > 2022年8月24日 15:38,Nicolò Boschi  写道:
> >
> > +1 (non binding)
> > Nicolò Boschi
> >
> >
> > Il giorno mer 24 ago 2022 alle ore 09:11 Enrico Olivelli <
> > eolive...@gmail.com> ha scritto:
> >
> >> Hello,
> >> this is the official thread VOTE for PIP-201 Extensions mechanism for
> >> Pulsar Admin CLI tools
> >>
> >> This is the PIP link https://github.com/apache/pulsar/issues/17155
> >> This is the discussion:
> >> https://lists.apache.org/thread/287ft8twc11cp4s1y4qkcx5nmh451cyo
> >>
> >> I am still working on the PR, that is the subject of the VOTE.
> >>
> >> Best regards
> >> Enrico Olivelli
> >>
>
>

-- 
BR,
Qiang Huang


Re: [VOTE] [PIP-169] Support split bundle by flow or qps

2022-08-24 Thread Qiang Huang
+1(non-binding)

Enrico Olivelli  于2022年8月24日周三 15:04写道:

> +1 (binding)
>
>
> Enrico
>
> Il giorno mer 24 ago 2022 alle ore 07:13 Aloys Zhang
>  ha scritto:
> >
> > +1
> >
> > Haiting Jiang  于2022年8月24日周三 12:35写道:
> >
> > > +1 (non)
> > >
> > > Thanks,
> > > Haiting
> > >
> > > On Wed, Aug 24, 2022 at 10:19 AM guo jiwei 
> wrote:
> > >
> > > > +1 (binding)
> > > >
> > > >
> > > > Regards
> > > > Jiwei Guo (Tboy)
> > > >
> > > >
> > > > On Wed, Aug 24, 2022 at 1:01 AM Heesung Sohn
> > > >  wrote:
> > > >
> > > > > +1 (non-binding)
> > > > >
> > > > > Heesung
> > > > >
> > > > > On Tue, Aug 23, 2022 at 10:00 AM Anon Hxy 
> wrote:
> > > > > >
> > > > > > +1 (non-binding)
> > > > > >
> > > > > > Thanks,
> > > > > > Xiaoyu Hou
> > > > > >
> > > > > > lordcheng10 <1572139...@qq.com.invalid> 于2022年8月23日周二 15:10写道:
> > > > > >
> > > > > > > Hi Pulsar Community, I would like to start a VOTE on "Support
> split
> > > > > bundle
> > > > > > > by flow or qps."(PIP-169)
> > > > > > > Here is the design detail:
> > > > > https://github.com/apache/pulsar/issues/16782
> > > > > > >
> > > > > > >
> > > > > > > and the discussion thread:
> > > > > > >
> https://lists.apache.org/thread/cshyt10fwcjjxs93g8yf0svgwcgnshmg
> > > > > > >
> > > > > > >
> > > > > > > Thanks,
> > > > > > > lordcheng10
> > > > >
> > > >
> > >
>


-- 
BR,
Qiang Huang


Re: [DISCUSS] Minor breaking change to 2 method signatures in RangeCache class

2022-08-24 Thread Qiang Huang
Great work! It can tune the range cache better.

Michael Marshall  于2022年8月24日周三 11:11写道:

> Hi All,
>
> I'd like to add some more metrics to the broker entry cache. In order
> to do so, I needed to update RangeCache#clear and
> RangeCache#evictLEntriesBeforeTimestamp so they return Pair Long> instead of long. This is technically a breaking change, but
> since it is an internal broker class, it would only break broker
> extensions, if they used the return value for these methods.
>
> Please let me know if you think the changes will be a problem.
>
> Here is the PR: https://github.com/apache/pulsar/pull/17248.
>
> Thanks,
> Michael
>


-- 
BR,
Qiang Huang


Re: [DISCUSS] Apache Pulsar 2.10.2 release

2022-08-24 Thread Qiang Huang
+1

PengHui Li  于2022年8月23日周二 23:42写道:

> +1
>
> And, I have added the release/2.10.2 label, Lan Liang.
>
> Thanks,
> Penghui
>
> On Tue, Aug 23, 2022 at 11:02 PM Lan Liang 
> wrote:
>
> > Hi,Haiting:
> >
> >Thanks for your work of release. Could you help to release PR-16309
> > [0]  to 2.10.2, It resolved pulsar can not working with etcd cluster.
> >
> >Some users are wondering about this part [1].
> >
> > [0] https://github.com/apache/pulsar/pull/16309
> > [1] https://github.com/apache/pulsar/discussions/17240
> >
> >
> >
> >
> > Best Regards,
> > Lan Liang
> >  Replied Message 
> > | From | Haiting Jiang |
> > | Date | 8/23/2022 22:10 |
> > | To |  |
> > | Subject | [DISCUSS] Apache Pulsar 2.10.2 release |
> > Hello, Pulsar community:
> >
> > I'd like to propose to release Apache Pulsar 2.10.2
> >
> > Currently, we have 189 commits [0] labeled with `release/2.10.2`,
> > And 39 of them are not cherry-picked yet [1].
> >
> > And there are 39 open PRs [2], I will follow them to make sure that
> > the important fixes could be contained in 2.10.2.
> >
> > If you have any important fixes or any questions,
> > please reply to this email, we will evaluate whether to
> > include it in 2.10.2
> >
> > [0]
> >
> https://github.com/apache/pulsar/pulls?q=is%3Amerged+is%3Apr+label%3Arelease%2F2.10.2+
> > [1]
> >
> https://github.com/apache/pulsar/pulls?q=is%3Amerged+is%3Apr+label%3Arelease%2F2.10.2+-label%3Acherry-picked%2Fbranch-2.10+
> > [2]
> >
> https://github.com/apache/pulsar/pulls?q=is%3Aopen+is%3Apr+label%3Arelease%2F2.10.2+
> >
> > Best Regards
> > Haiting Jiang
> >
>


-- 
BR,
Qiang Huang


Re: [DISCUSS] Make `Pulsar CI / CI - Unit - Brokers - Flaky` test group to be required

2022-08-24 Thread Qiang Huang
Failure flaky tests reduce the passion of contributors. I suggest making
stable tests as required.
I have 3 points.
1. Find the existing failure flaky tests and fix them. I believe that there
is already a project to record them.
2. Move the failure flaky tests out of the flaky test group.
3. Add them into a new group and fix. And make it required if they are
stable.

PengHui Li  于2022年8月22日周一 17:27写道:

> I agree with moving the tests out of the flaky test group.
> I just checking some new PRs
>
> https://github.com/apache/pulsar/pull/17195
> https://github.com/apache/pulsar/pull/17201
> https://github.com/apache/pulsar/pull/17193
> https://github.com/apache/pulsar/pull/17204
>
> The `Pulsar CI / CI - Unit - Brokers - Flaky` looks more stable than other
> groups such as
>
> `Pulsar CI / CI - System - Pulsar Connectors - Thread`
> `Pulsar CI / CI - Unit - Brokers - Broker Group 2`
> `Pulsar CI / CI - Unit - Brokers - Broker Group 1`
>
> It looks like we can change the test group for now and make it required.
> To a new test group, or move them to `Pulsar CI / CI - Unit - Brokers - * `
>
> Thanks,
> Penghui
>
> On Mon, Aug 22, 2022 at 4:33 PM tison  wrote:
>
> > While agree to require more tests to pass, here are my two coins:
> >
> > 1. This group is named "flaky tests" so I regard it as flaky tests
> > literally. NOT require these tests to pass could be by design. Besides,
> > IIRC some developers keep investigating tests in the flaky test group,
> try
> > to make them stable, and move out of the flaky test group. This seems the
> > desired approach to resolve flaky tests.
> >
> > 2. Instead of barely "require broker-flaky" test group, do you have a
> list
> > of tests that fail frequently? Otherwise, we just go back to the
> situation
> > where we want to exclude it from the required status - it's quite
> unstable.
> >
> > Best,
> > tison.
> >
> >
> > mattison chao  于2022年8月22日周一 15:56写道:
> >
> > > Hi all
> > >
> > > Recently, some tests in the `broker-flaky` test group always failed,
> but
> > > since it doesn't block CI, no one cared for a long time.
> > >
> > > This behaviour causes some test scenarios to go unchecked and risk
> > > introducing some regressions, and I think we need to make this test
> group
> > > set required.
> > >
> > > e.g. https://github.com/apache/pulsar/pull/17163
> > >
> > > WDYT?
> > >
> > > Best,
> > > Mattison
> > >
> >
>


-- 
BR,
Qiang Huang


Re: [DISCUSS] Extended auth operation

2022-08-24 Thread Qiang Huang
Nice catch! +1

Zixuan Liu  于2022年8月22日周一 16:49写道:

> Hi all,
>
> I noticed we added the TenantOperation, NamespaceOperation, and
> PolicyOperation in https://github.com/apache/pulsar/pull/6428, which
> provide the "real" authz abilities for each resource. This PR only adds
> check permissions method in authorization provider, no grant, revoke, and
> get action, so I want to add this feature.
>
> Best,
> Zixuan
>


-- 
BR,
Qiang Huang


Re: [DISCUSS] Use Github project to track the multi clients development

2022-08-24 Thread Qiang Huang
It makes sense to me. I'm thinking, if an issue is encountered in a Java
client, it is possible to be in other clients. If so, the Java client board
is also needed.

mattison chao  于2022年8月22日周一 16:00写道:

> +1
>
> Best,
> Mattison
>
> On Mon, 22 Aug 2022 at 15:52, Baodi Shi 
> wrote:
>
> > Nice idea!
> >
> > Thanks,
> > Baodi Shi
> >
> > > On Aug 22, 2022, at 12:4250, Zike Yang  wrote:
> > >
> > > Hi, all
> > >
> > > Currently, many features or bug fixes of some clients like the C++
> > > client and Python client have not caught up with the Java client. For
> > > better tracking of these features or bug fixes, I have created a
> > > Github project called `Apache Pulsar / Multi Clients`[0] under the
> > > pulsar main repo. It can be removed if we don't like it this way.
> > >
> > > There are six columns in this Github project:
> > > * Backlog: Low priority issues or issues(or cards) that have not been
> > > initially investigated.
> > > * Todo: Newly added issues and issues suitable for contributors to pick
> > first.
> > > * In progress: Work in progress issues or PRs.
> > > * Review in progress: The PRs or issues that are waiting for review.
> > > * Reviewer approved: The PRs or issues that are waiting to merge.
> > > * Done: Merged PRs and Solved issues.
> > >
> > > This not only helps to track the status of other clients' features and
> > > bug fixes but also helps contributors who want to participate in other
> > > language client feature development to pick the task.
> > >
> > > This Project is currently only for clients maintained by the main
> > > pulsar repository, such as C++ and Python clients.
> > >
> > > I have created a PR [1] to add C++ and Python client issues and PRs to
> > > the Multi Clients project.
> > >
> > > Please let me know what you think. Thanks!
> > >
> > > [0] https://github.com/apache/pulsar/projects/12
> > > [1] https://github.com/apache/pulsar/pull/17201
> > >
> > > Best,
> > > Zike Yang
> >
> >
>


-- 
BR,
Qiang Huang


Re: [DISCUSS] Enable message deduplication for replicator by default

2022-08-24 Thread Qiang Huang
+1

Zike Yang  于2022年8月22日周一 15:32写道:

> +1
>
> Thanks,
> Zike Yang
>
> On Mon, Aug 22, 2022 at 3:16 PM mattison chao 
> wrote:
> >
> > +1
> >
> > Best,
> > Mattison
> >
> > On Fri, 19 Aug 2022 at 01:40, Enrico Olivelli 
> wrote:
> >
> > > I agree
> > >
> > > Enrico
> > >
> > > Il Gio 18 Ago 2022, 18:23 PengHui Li  ha scritto:
> > >
> > > > Hi all,
> > > >
> > > > When I tried to fix a problem related to replicator
> > > > https://github.com/apache/pulsar/pull/17154
> > > > It surprised me that the message deduplication will not work by
> default
> > > > with the replicator.
> > > > I always thought it was enabled for replicators by default. Details
> to
> > > see
> > > > [0].
> > > >
> > > > I think we should enable the deduplication for the replicator.
> Otherwise,
> > > > we will see duplicated
> > > > messages on the remote cluster. And the producer of the replicator
> always
> > > > has a fixed producer
> > > > name, this will make the message deduplication work properly.
> > > >
> > > > The test introduced in https://github.com/apache/pulsar/pull/17154
> will
> > > > check the message
> > > > replication ordering. Without the message deduplication enabled, the
> test
> > > > is flaky with received
> > > > duplicated messages. After enabling, everything is fine.
> > > >
> > > > Best,
> > > > Penghui
> > > >
> > > > [0]
> https://github.com/apache/pulsar/pull/17154#discussion_r948736894
> > > >
> > >
>


-- 
BR,
Qiang Huang


Re: [DISCUSS] Enable non-mandatory updating PR branches

2022-08-24 Thread Qiang Huang
It works now. It helps.

Zixuan Liu  于2022年8月22日周一 10:38写道:

> +1
>
> Best,
> Zixuan
>
> On 2022/08/18 13:01:58 tison wrote:
> > Hello,
> >
> > The short version
> > =
> >
> > Vote if you agree on enabling the non-mandatory updating PR branches
> > button, i.e., the "Always suggest updating pull request branches" GitHub
> > settings.
> >
> > The full version
> > 
> >
> > Pulsar is under rapid development and numerous fixes are pushed to master
> > every time. Since we are still suffering from quite a few flaky tests,
> > merge master and retest is a hotspot to verify the patch once more.
> >
> > However, we should pull the nightly master locally, check out the PR
> > branch, perform the merge and push to remote. It's a bit awkward
> especially
> > when a developer works on multiple branches.
> >
> > GitHub provides a button "Always suggest updating pull request branches"
> > with the description "Whenever there are new changes available in the
> base
> > branch, present an “update branch” option in the pull request."[1]
> >
> > It can simplify the workflow with one button click.
> >
> > To clarify, this is different from the branch protection rule "Require
> > branches to be up to date before merging" - it's non-mandatory and just
> > provides the "update branch" button. It means we don't force every PR to
> > catch up with the latest master before merged, which can cause
> exextremely
> > high unnecessary traffic. Since we already allow PR authors to retrigger
> > tests with the pulsorbot (or even contributors can push an empty commit),
> > providing such a button does no harm.
> >
> > I post this thread here to collect feedback, especially from the PMC
> > members. Previously I asked the INFRA team to turn on this option for
> > Apache Kvrocks (Incubating)[2] and I believe the INFRA team would be
> happy
> > to see an explicit community consensus.
> >
> > Best,
> > tison.
> >
> > [1]
> >
> https://github.blog/changelog/2022-02-03-more-ways-to-keep-your-pull-request-branch-up-to-date/
> > [2] https://issues.apache.org/jira/browse/INFRA-23432
> >



-- 
BR,
Qiang Huang


Re: Enable "Update branch" Github button in pull requests

2022-08-24 Thread Qiang Huang
+1. It can help a lot.

Zixuan Liu  于2022年8月22日周一 10:38写道:

> +1
>
> Best,
> Zixuan
>
> On 2022/07/12 16:06:43 Nicolò Boschi wrote:
> > Hi all,
> >
> > I'd like to propose to enable the Github button "update branch" in the
> pull
> > requests.
> >
> > The main reason is that it helps when you want to rebase your pull to the
> > current master.
> > For instance today a fix for a failing job in the CI has been committed
> and
> > you were forced to close/reopen the pull or manual rebase your branch.
> With
> > this button is much easier and it will retrigger the CI as well.
> >
> > This is the guide to enable it:
> >
> https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/proposing-changes-to-your-work-with-pull-requests/keeping-your-pull-request-in-sync-with-the-base-branch
> >
> > This is an example where I activated the button in my own Pulsar fork:
> > https://github.com/nicoloboschi/pulsar/pull/4
> >
> > Thanks,
> > Nicolò Boschi
> >



-- 
BR,
Qiang Huang


[GitHub] [pulsar-site] urfreespace merged pull request #169: Move config docs to /reference and improve docs gen scripts

2022-08-24 Thread GitBox


urfreespace merged PR #169:
URL: https://github.com/apache/pulsar-site/pull/169


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-site] Anonymitaet commented on issue #173: Improve color shades for dark mode

2022-08-24 Thread GitBox


Anonymitaet commented on issue #173:
URL: https://github.com/apache/pulsar-site/issues/173#issuecomment-1225476599

   Hi designers, @gancunjie @DaveDuggins, is it a good practice to set all the 
texts in the dark color background to `white`?
   
   https://user-images.githubusercontent.com/50226895/186384398-a6b3c13f-b49b-457a-9895-64740cfe3c48.png";>
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [Discussion] PIP 78: Replace brodocs with docsify?

2022-08-24 Thread Yu
Hi Dave, thanks for your great reminder! We'll add those rules.

Yu, signormercurio, urfreespace

On Tue, Aug 23, 2022 at 10:33 PM Dave Fisher  wrote:

> If page urls change then we should also add http redirect rules to help
> users who bookmarked these docs.
>
> Sent from my iPhone
>
> > On Aug 23, 2022, at 5:53 AM, Yu  wrote:
> >
> > Hi team,
> >
> > Now is 3 working days after the initial discussion
> >
> > We'll start replacing brodocs with docsify for docs since there is no
> > objection or technical suggestions.
> >
> > Yu, signormercurio, urfreespace
> >
> >> On Fri, Aug 19, 2022 at 5:09 PM Yu  wrote:
> >>
> >> Thanks Dave!
> >>
> >>> How is this search organized on the server and how would this integrate
> >> with algolia?
> >>
> >> Search is performed at the front end, and it does not need to integrate
> >> with Algolia.
> >>
> >>> Google Analytics will be disallowed on ASF project sites very soon due
> >> to Privacy concerns with tracking data. I’ve warned about this several
> >> times. Please eliminate GA. If we need to track then we have options.
> >>
> >> Listing GA here just shows docsify's capabilities.
> >> As we discussed before [1], GA will no longer process new data in
> standard
> >> properties beginning July 1, 2023.
> >> Matteo is being considered to replace it.
> >>
> >> [1] https://github.com/apache/pulsar/issues/15664
> >>
> >> Yu and signormercurio
> >>
> >>
> >>
> >>
> >>
> >>
> >>> On Thu, Aug 18, 2022 at 9:46 PM Dave Fisher 
> wrote:
> >>>
> >>>
> >>>
> >>> Sent from my iPhone
> >>>
>  On Aug 18, 2022, at 4:55 AM, Yu  wrote:
> 
>  Hi team,
> 
>  To reduce manual work, we’re constantly implementing PIP 78: Generate
> >>> Docs
>  from Code Automatically [1].
> 
>  Docs here refer to the following:
>  - Connector configurations [2]
>  - Client configurations [3]
>  - Pulsar CLI tools [4]
>  - Pulsar configurations [5]
>  - pulsar-admin [6]
> 
>  # Question
> 
>  Currently, some of the docs above [7] are generated using brodocs [8].
>  We’re wondering if it makes sense to replace brodocs with docsify?
> 
>  # docsify benefits
> 
>  ## For users
>  - Enjoy new features (eg. search, copy to clipboard, language
> >>> highlighting,
>  tabs, Disqus, etc.)
> >>>
> >>> How is this search organized on the server and how would this integrate
> >>> with algolia?
> >>>
>  - More clear and organized layout
> 
>  ## For doc maintainers
>  - Simple and lightweight
>  - No statically built HTML files. Instead, it smartly loads and parses
> >>> .md
>  files and displays them as a website.
>  - Allow feature customization
>  - Able to work with Google Analytics to track data
> >>>
> >>> Google Analytics will be disallowed on ASF project sites very soon due
> to
> >>> Privacy concerns with tracking data. I’ve warned about this several
> times.
> >>> Please eliminate GA. If we need to track then we have options.
> >>>
> >>> Also we never discuss GA anywhere within the PMC or developer
> community.
> >>> What value does it have?
> >>>
> >>> Dave
> >>>
> 
>  # Demo
> 
>  We’ve created a demo here [9].
>  This demo is only for demonstration purposes, so we just add a few
>  improvements. It can be iterated later.
> 
>  
> 
>  If docsify is better, we’ll use it for all the docs above to provide a
>  consistent content experience.
> 
>  Feel free to check and comment, thank you!
> 
>  [1] https://lists.apache.org/thread/638q6kmq81z2qjmyhgkqzp00sllh4x0w
>  [2] https://pulsar.apache.org/docs/next/io-connectors (Configurations
> >>> for
>  each connector)
>  [3] https://pulsar.apache.org/docs/next/client-libraries-java
>  (Configuration for each client. Here takes Java client as an example,
> it
>  refers to configurations of client, producer, consumer, and reader)
>  [4] https://pulsar.apache.org/docs/next/reference-cli-tools
>  [5] https://pulsar.apache.org/docs/next/reference-configuration
>  [6] https://pulsar.apache.org/tools/pulsar-admin/
>  [7] https://pulsar.apache.org/tools/
>  [8] https://github.com/Birdrock/brodocs
>  [9] https://pulsar.apache.org/tools/pulsar-config/2.11.0-SNAPSHOT/#/
> 
>  Yu, signormercurio, urfreespace
> >>>
> >>>
>
>


Re: [DISCUSS] Move PIPs to the codebase?

2022-08-24 Thread Yu
Hi Penghui, Thanks for raising this up!

> we should have a discussion first on the mailing list, just get feedback
on the
> motivation. If there are no objections,
> the proposal owner can add a line to the file with the PIP number through
> a PR, like PIP-123: xxx (Under Discussion).
> So that we can prevent the duplicated PIP number

This is a good idea to ensure no NO. is re-used or jumped over.

As you explained, we can act like this:

1. Send the PIP idea to the community without using a NO.

2. After the idea is approved, add "PIP 123 (under discussion)" to the "PIP
metadata" file.

3. ...

Yu


On Wed, Aug 24, 2022 at 10:01 AM PengHui Li  wrote:

> Hi Rajan,
>
> > I didn't understand this part. You want one to create a PR before
> submitting a proposal? That's clearly not a good idea because if the PIP
> approach will change then the entire development effort will be wasted and
> that's the whole purpose of PIP. I guess creating PIP into an issue and
> discussing the issue is definitely working and it's an easier way to
> discuss quickly rather than discussing over email threads.
>
> Sorry, I mean create a PR to add the proposal, not the implementation.
> Before adding the proposal, we should discuss the motivation on the mailing
> list first.
>
> > Let's not change this practice without good discussion and agreement from
> the community.
>
> I don't think this essentially changes this part, it just provides a way
> for reviewers to
> review the details of the proposal more efficiently. For a quick
> discussion, this will not change
> it. Let me try to clarify the procedure after we applied this idea
>
>
>1. Send the email first to discuss the motivation of your proposal
>2. Discuss the motivation under the mailing list and request a PIP
>number if there are no objections
>3. Create a PR to add the detailed proposal
>4. *Send out the VOTE email with the PR link*
>5. *Review the proposal(The PR) on Github and vote under the mailing
>list.*
>6. Merge the PR of the proposal after the proposal get 3 (+1) bindings
>
> The only difference is step 4 and step 5. Currently, we share the issue
> link to ask for a review.
>
> Thanks,
> Penghui
>
> On Wed, Aug 24, 2022 at 1:22 AM Rajan Dhabalia 
> wrote:
>
> > Hi,
> >
> > >>> I think we can move all the PIPs to the codebase and the new proposal
> > and proposal without any reviews should happen with a PR first. So that
> we
> > can review and comment easily.
> >
> > I didn't understand this part. You want one to create a PR before
> > submitting a proposal? That's clearly not a good idea because if the PIP
> > approach will change then the entire development effort will be wasted
> and
> > that's the whole purpose of PIP. I guess creating PIP into an issue and
> > discussing the issue is definitely working and it's an easier way to
> > discuss quickly rather than discussing over email threads.
> >
> > Let's not change this practice without good discussion and agreement from
> > the community.
> >
> > Thanks,
> > Rajan
> >
> > On Thu, Aug 18, 2022 at 8:27 AM PengHui Li  wrote:
> >
> > > Hi all,
> > >
> > > Currently, the new proposal will be added to the issue list and then
> > shared
> > > link in the email
> > > to request the proposal review. It's really hard to review a long
> > proposal
> > > if you want to comment
> > > in detail.
> > >
> > > Here is an example:
> > > https://github.com/apache/pulsar/issues/16763#issuecomment-1219606491
> > > This seems very unintuitive.
> > >
> > > I think we can move all the PIPs to the codebase and the new proposal
> and
> > > proposal without
> > > any reviews should happen with a PR first. So that we can review and
> > > comment easily.
> > > Certainly, all the votes should happen on the mailing list. And we can
> > also
> > > discuss the
> > > proposal on the mailing list.
> > >
> > > Following this way, we don't need to sync the PIPs from the issue to
> the
> > > wiki page.
> > > We can just add a link that points to the PIPs dir to the contribution
> > > guide or README.
> > >
> > > We have another pain point about the duplicated PIP number. We can
> > maintain
> > > a file, a list of
> > > all the proposal contains the approved, in-review, drafting. Before
> > > creating a proposal, we should
> > > have a discussion first on the mailing list, just get feedback on the
> > > motivation. If there are no objections,
> > > the proposal owner can add a line to the file with the PIP number
> > through a
> > > PR, like PIP-123: xxx (Under Discussion).
> > > So that we can prevent the duplicated PIP number(which will conflict if
> > > someone merged first).
> > > After the PR is merged, we can send out a new PR to add the proposal.
> > >
> > > Thanks,
> > > Penghui
> > >
> >
>


Re: [DISCUSS] PIP-200 Package Pulsar Trino distro and config in a dedicated folder

2022-08-24 Thread tison
Hi Enrico and Qiang,

The issue description was updated. Picked here:

> upgrade and downgrade doc

This change should not affect those who use Pulsar with the entry point
script, but it changes the layout of the release artifact.

I'm going to write a release note about this change and also post it on the
Pulsar SQL overview page as a caveat. Draft here:

# Caveat

If you're upgrading Pulsar SQL from 2.11 or early, you should copy the
related configs from `conf/presto` to `trino/conf`, and `lib/presto` to
`trino`. If you're downgrading Pulsar SQL to 2.11 or early from 2.12, do
verse visa.

> Pulsar Helm Chart

>From what I understand, Pulsar Helm Chart is a wrapper of pulsar-all docker
image. I don't find any reference to presto/trino/sql effectively in the
codebase, so I think it's currently transparent for its users and the
caveat under Pulsar SQL overview page should be enough.

Best,
tison.


tison  于2022年8月17日周三 19:40写道:

> Thanks for your feedback!
>
> 1. According to the upgrade and downgrade doc, I think the minimum
> requirements are a release note. Describe the layout change and how users
> should move the folder. I'll elaborate on the issue and notify you here.
> 2. It seems Pulsar Helm Chart support 2.9.3 now. I'll investigate how it
> can be relevant in days. If someone who maintains the Chart can provide
> some input, it will help!
> 3. "There are 3 issues". It's a description about the
> background/motivation, while we handle the first issue in this PIP.
> Although, subtasks can be divided into packaging changes, possible doc
> changes, and possible Chart changes.
>
> Best,
> tison.
>
>
> Qiang Huang  于2022年8月17日周三 19:25写道:
>
>> Looks good. I have two points:
>> 1. It is necessary to supplement the upgrade and downgrade documentation
>> in
>> Pulsar.
>> 2. There are 3 issues mentioned in the PIP, should we split it into 3
>> small
>> issues?
>>
>> Enrico Olivelli  于2022年8月17日周三 17:30写道:
>>
>> > I generally agree with the PIP
>> >
>> > Can you please explain the interactions with the Pulsar Helm chart ?
>> > also we have to draw a migration path, because users that will upgrade
>> > Pulsar will have to move the configuration files in another location
>> >
>> > Enrico
>> >
>> > Il giorno mer 17 ago 2022 alle ore 11:15 tison 
>> > ha scritto:
>> > >
>> > > Hello,
>> > >
>> > > This is a PIP to package the Pulsar Trino distro and config in a
>> > dedicated
>> > > folder.
>> > >
>> > > Link: https://github.com/apache/pulsar/issues/17137
>> > > Prototype: https://github.com/apache/pulsar/pull/17062
>> > >
>> > > Below you can find the proposal (I will amend the GH issue while we
>> > discuss
>> > > it).
>> > >
>> > > Best,
>> > > tison.
>> > >
>> > > Motivation
>> > > 
>> > >
>> > > After https://github.com/apache/pulsar/pull/16683 merged, we upgrade
>> > > PrestoSQL dependency in Pulsar SQL to the first several Trino
>> version. To
>> > > handle the name change cases and gradually refactor Pulsar SQL as a
>> > > self-contained module so that we can move it into a standalone
>> > repository,
>> > > I find that there're three major issues to resolve.
>> > >
>> > > 1. Configs of Pulsar SQL go under the `conf/` folder and mix with
>> other
>> > > Pulsar configs.
>> > > 2. Pulsar Docker images (base and all) bundle Pulsar SQL.
>> > > 3. Integration tests of Pulsar SQL are tightly coupled with the main
>> repo
>> > > (test infra).
>> > >
>> > > This proposal is aimed at resolving the first issue to package Pulsar
>> > Trino
>> > > distro and config in a dedicated folder; that is, to make it
>> > self-contained.
>> > >
>> > > Goal
>> > > 
>> > >
>> > > I have already prepared a draft to perform the changes as
>> > > https://github.com/apache/pulsar/pull/17062. Generally, we move the
>> > config
>> > > files under `PRESTO_HOME` and correspondingly update scripts.
>> > >
>> > > In this way, all Trino distro artifacts are under the same home path,
>> so
>> > > that we can later move it out as a whole.
>> > >
>> > > This change should not affect those who use Pulsar with the entry
>> point
>> > > script, but it changes the layout of the release artifact, so I'd
>> prefer
>> > to
>> > > perform a PIP process.
>> > >
>> > > Implementation
>> > > 
>> > >
>> > > It's straightforward to inline in the "Goal" section.
>> > >
>> > > However, the name of the folder (`presto` or `trino`) and the level of
>> > the
>> > > folder (`lib/presto/` or `trino/`) is open to discussion. I think both
>> > are
>> > > fine and will try `trino/` first.
>> > >
>> > > To minimize unnecessary changes, I tend to keep the modules name
>> > > `pulsar-presto-xxx` as is.
>> > >
>> > > Alternatives
>> > > =
>> > >
>> > > I don't make a completed proposal to resolve all three issues listed
>> > above.
>> > > Because I'm still unfamiliar with the latter two topics yet and I'd
>> > prefer
>> > > to implement these improvements one by one since they're naturally
>> > > independent. If I try to make a completed 

[DISCUSS] Deprecate KeyValue schema factory methods with Class parameters

2022-08-24 Thread Yunze Xu
Hi folks,

Recently I'm looking into the KeyValue schema and found **FOUR**
static methods in `Schema` to create a `KeyValueSchema` object:

1. KeyValue(Class key, Class value);
2. KeyValue(Class key, Class value, SchemaType type);
3. KeyValue(Schema key, Schema value);
4. KeyValue(Schema key, Schema value, KeyValueEncodingType 
keyValueEncodingType);

IMO, having too many overloads is not user-friendly. I turned to the
official website and found overload 4 is used. The overload 3 is just
a simple version of overload 4 whose encoding type is `INLINE`, but
it's not clear. I opened a PR
https://github.com/apache/pulsar/pull/17256 to make it more clear.

However, for overload 1 and 2, I can only find two references in unit
tests `testAllowNullAvroSchemaCreate` and
`testAllowNullJsonSchemaCreate`. From the very simple JavaDocs, it
looks like they are only available for JSON and AVRO schemas.

Then I found the original PR to introduce these overloads:
https://github.com/apache/pulsar/pull/2885

Code has been changed a lot since that. It looks other codes are not
much related to these two overloads now. IMO, we should not encourage
users to use these two overloads, so I suggest marking them as
deprecated and might remove them in future releases.


Thanks,
Yunze






[GitHub] [pulsar-site] tisonkun commented on issue #173: Improve color shades for dark mode

2022-08-24 Thread GitBox


tisonkun commented on issue #173:
URL: https://github.com/apache/pulsar-site/issues/173#issuecomment-1225347135

   @Anonymitaet for this specific issue, I think we need UI experts to help 
with selecting colors series.
   
   The related code is 
https://github.com/apache/pulsar-site/blob/e1785ad30a9bd02047ccd31e85e9c89fa1c37c6a/site2/website-next/src/css/custom.css
 while is hosted primary in this repo instead of synced from `pulsar`.
   
   @urfreespace suggests I report site-related topics here and content-related 
topics in the main repo. Although, I agree on the "High visibility". I'm OK to 
just disable the issue tab in this repo and report all issues in `pulsar` - 
[SkyWalking](https://github.com/apache/skywalking) follows this pattern.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [VOTE] [PIP-201] Extensions mechanism for Pulsar Admin CLI tools

2022-08-24 Thread Yunze Xu
+1 (non binding)

Thanks,
Yunze




> 2022年8月24日 15:38,Nicolò Boschi  写道:
> 
> +1 (non binding)
> Nicolò Boschi
> 
> 
> Il giorno mer 24 ago 2022 alle ore 09:11 Enrico Olivelli <
> eolive...@gmail.com> ha scritto:
> 
>> Hello,
>> this is the official thread VOTE for PIP-201 Extensions mechanism for
>> Pulsar Admin CLI tools
>> 
>> This is the PIP link https://github.com/apache/pulsar/issues/17155
>> This is the discussion:
>> https://lists.apache.org/thread/287ft8twc11cp4s1y4qkcx5nmh451cyo
>> 
>> I am still working on the PR, that is the subject of the VOTE.
>> 
>> Best regards
>> Enrico Olivelli
>> 



[GitHub] [pulsar-site] Anonymitaet commented on issue #173: Improve color shades for dark mode

2022-08-24 Thread GitBox


Anonymitaet commented on issue #173:
URL: https://github.com/apache/pulsar-site/issues/173#issuecomment-1225320929

   @tisonkun thanks for raising this up!
   
   Even though the Pulsar website is built in the `pulsar-site` repo and many 
website-related data are hosted here, I still suggest reporting website-related 
issues in the `pulsar` repo. 
   
   Reasons:
   
   - High visibility, which means the issue can get more attention since the 
majority of ppl go to `pulsar`. Only a few ppl know `pulsar-site`.
   
   - From past to now, we nearly track all website-related issues in `pulsar`, 
it's convenient for us to triage and check them all in one place.
   
   Thoughts?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [VOTE] [PIP-201] Extensions mechanism for Pulsar Admin CLI tools

2022-08-24 Thread Nicolò Boschi
+1 (non binding)
Nicolò Boschi


Il giorno mer 24 ago 2022 alle ore 09:11 Enrico Olivelli <
eolive...@gmail.com> ha scritto:

> Hello,
> this is the official thread VOTE for PIP-201 Extensions mechanism for
> Pulsar Admin CLI tools
>
> This is the PIP link https://github.com/apache/pulsar/issues/17155
> This is the discussion:
> https://lists.apache.org/thread/287ft8twc11cp4s1y4qkcx5nmh451cyo
>
> I am still working on the PR, that is the subject of the VOTE.
>
> Best regards
> Enrico Olivelli
>


[GitHub] [pulsar-site] Anonymitaet merged pull request #171: Update the website banner

2022-08-24 Thread GitBox


Anonymitaet merged PR #171:
URL: https://github.com/apache/pulsar-site/pull/171


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [pulsar-site] tisonkun opened a new issue, #173: Improve color shades for dark mode

2022-08-24 Thread GitBox


tisonkun opened a new issue, #173:
URL: https://github.com/apache/pulsar-site/issues/173

   Currently, in dark mode, the contents on several pages are hard to read.
   
   For example, 
https://pulsar.apache.org/docs/next/tiered-storage-filesystem#configure-filesystem-offloader-driver.
   
   https://user-images.githubusercontent.com/18818196/186356215-8b6e3d0f-0ded-4db4-9be7-d63c518f54bc.png";>
   
   cc @Anonymitaet @urfreespace 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@pulsar.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[VOTE] [PIP-201] Extensions mechanism for Pulsar Admin CLI tools

2022-08-24 Thread Enrico Olivelli
Hello,
this is the official thread VOTE for PIP-201 Extensions mechanism for
Pulsar Admin CLI tools

This is the PIP link https://github.com/apache/pulsar/issues/17155
This is the discussion:
https://lists.apache.org/thread/287ft8twc11cp4s1y4qkcx5nmh451cyo

I am still working on the PR, that is the subject of the VOTE.

Best regards
Enrico Olivelli


Re: [DISCUSS] [PIP-201] Extensions mechanism for Pulsar Admin CLI tools

2022-08-24 Thread Enrico Olivelli
Yunze,
I have updated the PIP with the code of a sample command.

Thank you everyone for your comments, I will start a VOTE.

Enrico

Il giorno mar 23 ago 2022 alle ore 05:26 Michael Marshall
 ha scritto:
>
> This proposal looks good to me. I think it will be a helpful addition
> to the pulsar admin tool. I like that the design avoids an unnecessary
> dependency on JCommander.
>
> Thanks,
> Michael
>
>
> On Mon, Aug 22, 2022 at 1:22 PM Enrico Olivelli  wrote:
> >
> > Yunze,
> >
> >
> >
> > Il Lun 22 Ago 2022, 20:20 Enrico Olivelli  ha scritto:
> >
> > >
> > >
> > > Il Lun 22 Ago 2022, 17:07 Yunze Xu  ha
> > > scritto:
> > >
> > >> I will take a look. But I think we should also add a trivial example
> > >> (or a test) in the Apache repo
> > >
> > >
> > I will add integration tests for sure.
> > Currently the PR is only a prototype without tests.
> >
> > Enrico
> >
> > , e.g. just print some messages for an
> > >> extended command.
> > >
> > >
> > > Sure. I will update the PIP
> > >
> > > And the JavaDocs of these interfaces should be
> > >> complete and more clear.
> > >>
> > >
> > >
> > > I will leave that to the implementation PR
> > >
> > > Enrico
> > >
> > >
> > >> Thanks,
> > >> Yunze
> > >>
> > >>
> > >>
> > >>
> > >> > 2022年8月22日 18:37,Enrico Olivelli  写道:
> > >> >
> > >> > Yunze,
> > >> >
> > >> > Il giorno lun 22 ago 2022 alle ore 08:06 Yunze Xu
> > >> >  ha scritto:
> > >> >>
> > >> >> The motivation and goal LGTM, but the API changes look very simple and
> > >> >> hard to use. Do we have to implement all these interfaces for an admin
> > >> >> extension? If yes, could you show an example in the proposal as a
> > >> >> guidance?
> > >> >>
> > >> >> For example, if I just want to implement a simple command:
> > >> >>
> > >> >> ```bash
> > >> >> ./bin/pulsar-admin kafka create-topic  --partitions
> > >> 
> > >> >> ```
> > >> >>
> > >> >> How should I implement these interfaces?
> > >> >
> > >> > This is a example for the implementation that I am going to do for JMS
> > >> >
> > >> https://github.com/datastax/pulsar-jms/pull/53/files#diff-9afaac9c7dc4b3d674e0623cd3d76348b01537c6095e9b5b8e804f59a481cceeR31
> > >> >
> > >> > it is only a mock command at the moment, but it is good to showcase the
> > >> feature
> > >> >
> > >> > Enrico
> > >> >
> > >> >
> > >> >>
> > >> >> Thanks,
> > >> >> Yunze
> > >> >>
> > >> >>
> > >> >>
> > >> >>
> > >> >>> 2022年8月18日 16:23,Enrico Olivelli  写道:
> > >> >>>
> > >> >>> Hello,
> > >> >>> I have drafted a PIP around this proposal.
> > >> >>>
> > >> >>> PIP link: https://github.com/apache/pulsar/issues/17155
> > >> >>>
> > >> >>> I am preparing an official PR, I already have a working prototype.
> > >> >>>
> > >> >>> Copy of the contents of the GH issue is attached for discussion here
> > >> >>> on the Mailing list.
> > >> >>>
> > >> >>> Motivation
> > >> >>>
> > >> >>> There are many projects that are in the Pulsar ecosystem like 
> > >> >>> Protocol
> > >> >>> Handlers (Kafka, MQTT, RabbitMQ) and libraries (JMS…) that need
> > >> >>> additional tools for operating Pulsar following specific conventions
> > >> >>> adopted by each project and to handle custom domain objects (like JMS
> > >> >>> queues, Kafka Consumer Groups...).
> > >> >>>
> > >> >>> Some examples:
> > >> >>>
> > >> >>> Kafka: tools for inspecting internal systems, SchemaRegistry,
> > >> >>> Transaction Manager, Consumers Groups
> > >> >>> JMS: tools to handling Subscriptions and Selectors
> > >> >>> RabbitMQ: tools to handle Pulsar topics and subscription following 
> > >> >>> the
> > >> >>> convention
> > >> >>>
> > >> >>> This is very important as it is hard to follow the conventions of 
> > >> >>> each
> > >> >>> project using pulsar-admin and the administrator may inadvertently
> > >> >>> break the system.
> > >> >>>
> > >> >>> This feature will enhance the UX of the Pulsar Admin CLI tools for 
> > >> >>> the
> > >> >>> benefit of the whole ecosystem and users.
> > >> >>>
> > >> >>> Goal
> > >> >>>
> > >> >>> As we do for many other components in Pulsar, we need a way to 
> > >> >>> enhance
> > >> >>> the CLI tools, pulsar-admin and pulsar-shell, with additional 
> > >> >>> commands
> > >> >>> that are specific to the additional features.
> > >> >>>
> > >> >>> The proposal here is to add an extension mechanism to the 
> > >> >>> pulsar-admin
> > >> >>> (and so pulsar-shell) tool.
> > >> >>> Following the usual conventions for extensions the extension will be
> > >> >>> bundled in a .nar file that may contain additional third party
> > >> >>> libraries.
> > >> >>>
> > >> >>> The extension will be able to provide new top level commands
> > >> >>>
> > >> >>> pulsar-admin my-command-group my-command arguments…
> > >> >>>
> > >> >>> The extension will be able to access the PulsarAdmin API provided by
> > >> >>> the environment.
> > >> >>>
> > >> >>> The extension must not depend directly on the JCommander library but
> > >> >>> we will provide an API to declare the parameters and the other
> > >> >>> met

Re: [VOTE] [PIP-169] Support split bundle by flow or qps

2022-08-24 Thread Enrico Olivelli
+1 (binding)


Enrico

Il giorno mer 24 ago 2022 alle ore 07:13 Aloys Zhang
 ha scritto:
>
> +1
>
> Haiting Jiang  于2022年8月24日周三 12:35写道:
>
> > +1 (non)
> >
> > Thanks,
> > Haiting
> >
> > On Wed, Aug 24, 2022 at 10:19 AM guo jiwei  wrote:
> >
> > > +1 (binding)
> > >
> > >
> > > Regards
> > > Jiwei Guo (Tboy)
> > >
> > >
> > > On Wed, Aug 24, 2022 at 1:01 AM Heesung Sohn
> > >  wrote:
> > >
> > > > +1 (non-binding)
> > > >
> > > > Heesung
> > > >
> > > > On Tue, Aug 23, 2022 at 10:00 AM Anon Hxy  wrote:
> > > > >
> > > > > +1 (non-binding)
> > > > >
> > > > > Thanks,
> > > > > Xiaoyu Hou
> > > > >
> > > > > lordcheng10 <1572139...@qq.com.invalid> 于2022年8月23日周二 15:10写道:
> > > > >
> > > > > > Hi Pulsar Community, I would like to start a VOTE on "Support split
> > > > bundle
> > > > > > by flow or qps."(PIP-169)
> > > > > > Here is the design detail:
> > > > https://github.com/apache/pulsar/issues/16782
> > > > > >
> > > > > >
> > > > > > and the discussion thread:
> > > > > > https://lists.apache.org/thread/cshyt10fwcjjxs93g8yf0svgwcgnshmg
> > > > > >
> > > > > >
> > > > > > Thanks,
> > > > > > lordcheng10
> > > >
> > >
> >