Re: Chunjun Proposal

2022-02-26 Thread Sheng Wu
Hi LuNing

I can see several similar concerns from IPMC members and committers of ASF TLPs.
This seems to become a serious concern of this new proposal. We should
resolve these concerns(if possible) first.

In my understanding, from the ASF incubator perspective, we prefer `Do
NOT confuse users` because we accepted projects nearly doing the same
thing.
We always encourage more people could join together and build a more
powerful project and community, rather than building several similar
projects.

I would like you to recommend you talk with some IPMC members or
Apache members, who have enough technology background of FlinkCDC,
Camel, and SeeTunnel projects.
They could help you to provide a more objective review, and discuss
with you privately whether this new proposed project is suitable for
the ASF incubator.


Sheng Wu 吴晟
Twitter, wusheng1108

Lidong Dai  于2022年2月26日周六 19:12写道:
>
> hi,
>
> if ChunJun also plan to support Spark or other runtimes,  it will be highly
> similar to Apache SeaTunnel, have you considered contributing to Apache
> SeaTunnel?
>
>
>
>
> Best Regards
>
>
>
> ---
> Apache DolphinScheduler PMC Chair
> Lidong Dai
> lidong...@apache.org
> Linkedin: https://www.linkedin.com/in/dailidong
> Twitter: @WorkflowEasy <https://twitter.com/WorkflowEasy>
>
> ---
>
>
> On Fri, Feb 25, 2022 at 2:29 PM LuNing Wang  wrote:
>
> > Hi,
> >
> > Thanks a lot Jean-Baptiste Onofré.
> >
> > >  how the users implement the ETL pipeline ?
> >
> > Users implement the ETL pipeline by configuring a json file or SQL.
> > the following is one of examples:
> >
> > https://github.com/DTStack/chunjun/blob/master/flinkx-examples/json/doris/mysql_doris.json
> >
> >
> > > Is it using a kind of SDK/DSL or do you plan an UI ?
> >
> > It is using a json file or SQL file. We had a plan to implement an UI
> > a year ago, but current core developers have no time to implement an
> > UI and are not good at Web frontend skills.
> >
> >
> > > Is it very coupled to Flink or the runtime is "pluggable" ? Do you plan
> > to have some kind of optional storage, similar to Apache Druid.
> >
> > Nowadays, the ChunJun is coupled to Flink. We plan to support Spark or
> > other runtimes, but no person in the ChunJun community is good at
> > Apache Spark. We need to bring in more contributors.
> >
> > Best,
> > LuNing Wang 王鲁宁
> >
> > Daniel Widdis  于2022年2月25日周五 11:11写道:
> > >
> > > Hi, LuNing.
> > >
> > > I am not an IPC member, just an interested open source enthusiast
> > looking for a project to contribute to.
> > >
> > > I was excited about contributing to another project that recently joined
> > the incubator but as someone who only speaks English, I have had a
> > challenge when a large number of issues are written in Chinese without
> > enough translation for me to help.
> > >
> > > Reading your proposal I thought this may be another opportunity for me
> > to help, but I just visited your github site and found that most of the
> > current open issues are not understandable to me, and while I am
> > enthusiastic and want to help, I do not see how I can do so.
> > >
> > > I  am concerned about your ability to gain members of your project who
> > do not speak your language.
> > >
> > > On 2/23/22, 9:15 PM, "LuNing Wang"  wrote:
> > >
> > > Hi,
> > >
> > > I am LuNing Wang who sent the Chunjun proposal using '
> > apa...@dtstack.com'
> > > and I'm one of the maintainer of Chunjun project.
> > > As Apache is a community of peers, I will use this email to reply to
> > all
> > > questions and issues in this thread, after I read The Apache
> > Incubator
> > > Cookbook.
> > >
> > > May I use this email to continue to communicate with the Apache
> > community
> > > in this thread?
> > >
> > > Best,
> > > LuNing Wang 王鲁宁
> > >
> > >
> > > Apache  于2022年2月24日周四 09:57写道:
> > >
> > > >
> > > >
> > > > --
> > > > 发件人:Calvin Kirs 
> > > > 发送时间:2022年2月23日(星期三) 00:07
> > > > 收件人:general 
> > > > 主 题:Re: Chunjun Proposal
> > > >
> > > > Hi,
> > > >
> > > > I second with Tison and I'm glad to see your proposal,
> > >

Re: Chunjun Proposal

2022-02-26 Thread Lidong Dai
hi,

if ChunJun also plan to support Spark or other runtimes,  it will be highly
similar to Apache SeaTunnel, have you considered contributing to Apache
SeaTunnel?




Best Regards



---
Apache DolphinScheduler PMC Chair
Lidong Dai
lidong...@apache.org
Linkedin: https://www.linkedin.com/in/dailidong
Twitter: @WorkflowEasy <https://twitter.com/WorkflowEasy>

---


On Fri, Feb 25, 2022 at 2:29 PM LuNing Wang  wrote:

> Hi,
>
> Thanks a lot Jean-Baptiste Onofré.
>
> >  how the users implement the ETL pipeline ?
>
> Users implement the ETL pipeline by configuring a json file or SQL.
> the following is one of examples:
>
> https://github.com/DTStack/chunjun/blob/master/flinkx-examples/json/doris/mysql_doris.json
>
>
> > Is it using a kind of SDK/DSL or do you plan an UI ?
>
> It is using a json file or SQL file. We had a plan to implement an UI
> a year ago, but current core developers have no time to implement an
> UI and are not good at Web frontend skills.
>
>
> > Is it very coupled to Flink or the runtime is "pluggable" ? Do you plan
> to have some kind of optional storage, similar to Apache Druid.
>
> Nowadays, the ChunJun is coupled to Flink. We plan to support Spark or
> other runtimes, but no person in the ChunJun community is good at
> Apache Spark. We need to bring in more contributors.
>
> Best,
> LuNing Wang 王鲁宁
>
> Daniel Widdis  于2022年2月25日周五 11:11写道:
> >
> > Hi, LuNing.
> >
> > I am not an IPC member, just an interested open source enthusiast
> looking for a project to contribute to.
> >
> > I was excited about contributing to another project that recently joined
> the incubator but as someone who only speaks English, I have had a
> challenge when a large number of issues are written in Chinese without
> enough translation for me to help.
> >
> > Reading your proposal I thought this may be another opportunity for me
> to help, but I just visited your github site and found that most of the
> current open issues are not understandable to me, and while I am
> enthusiastic and want to help, I do not see how I can do so.
> >
> > I  am concerned about your ability to gain members of your project who
> do not speak your language.
> >
> > On 2/23/22, 9:15 PM, "LuNing Wang"  wrote:
> >
> > Hi,
> >
> > I am LuNing Wang who sent the Chunjun proposal using '
> apa...@dtstack.com'
> > and I'm one of the maintainer of Chunjun project.
> > As Apache is a community of peers, I will use this email to reply to
> all
> > questions and issues in this thread, after I read The Apache
> Incubator
> > Cookbook.
> >
> > May I use this email to continue to communicate with the Apache
> community
> > in this thread?
> >
> > Best,
> > LuNing Wang 王鲁宁
> >
> >
> > Apache  于2022年2月24日周四 09:57写道:
> >
> > >
> > >
> > > --
> > > 发件人:Calvin Kirs 
> > > 发送时间:2022年2月23日(星期三) 00:07
> > > 收件人:general 
> > > 主 题:Re: Chunjun Proposal
> > >
> > > Hi,
> > >
> > > I second with Tison and I'm glad to see your proposal,
> > > you must have put a lot of effort into drafting this proposal,
> > > but it needs to be clear what you expect and what you know about
> Apache[1],
> > > and are we are on the same page?
> > >
> > >
> > > You can see the following information:
> > > The Apache Incubator Cookbook[2]
> > > The Apache Way[3]
> > >
> > > I'd be happy to help you if you need it.
> > >
> > > [1]https://www.apache.org/
> > > [2]https://incubator.apache.org/cookbook/
> > > [3]https://www.apache.org/theapacheway/index.html
> > >
> > > tison  于2022年2月22日周二 23:17写道:
> > >
> > > > Hi,
> > > >
> > > > I have two questions here:
> > > >
> > > > 1. How should I name you in this thread? Apache is a community
> of peers.
> > > I
> > > > can't image I'm talking to the whole Chunjun community or
> "Apache" expect
> > > > its your name.
> > > > 2. What's your expectations on going into the incubator? Among
> the whole
> > > > proposal it's almost about what the current state of Chunjun and
> the only
> > > > statement about your expectatio

Re: Chunjun Proposal

2022-02-25 Thread Sheng Wu
Hi

> Until now ChunJun is the only tool that can capture all tables data change of 
> the whole database

With the above feature description, I am curious too about is this
better to be an extension of Camel to provide data changes into a
pipeline?
Could you share why did you think `Establishing a new project and
rebuilding all things from new` is better? Have you considered
integrating with Camel?

Sheng Wu 吴晟
Twitter, wusheng1108

LuNing Wang  于2022年2月25日周五 16:38写道:
>
> Hi,
>
> Thank a lot Lidong Dai 代立冬老师.
>
> > I am curious that Chunjun(was Flinkx) is built on the Flink CDC, so what is 
> > its innovation?
>
> The ChunJun is not built on the Flink CDC. When we built the ChunJun
> CDC code, the Flink CDC did not exist. We independently created a new
> data structure that is different from Flink CDC. Every CDC record of
> this data structure owns all metadata.
>
> Until now ChunJun is the only tool that can capture all tables data
> change of the whole database by configuring a simple file rather than
> writing java code. The Flink CDC can capture one table by using SQL,
> and capture all tables by using java code.
>
> We will also implement capture schema evolution, it is also based on a
> new data structure. Now we have implemented capture schema evolution
> in the reader/source part, we will implement the writer/sink part.
>
> Best,
> LuNing Wang 王鲁宁
>
> Jean-Baptiste Onofré  于2022年2月25日周五 14:44写道:
> >
> > Hi
> >
> > Thanks for the details.
> >
> > Another question: how do you compare Chunjun with camel, nifi, goblin, and 
> > others ?
> > What kind of community you already have ?
> >
> > Thanks
> > Regards
> > JB
> >
> > Le ven. 25 févr. 2022 à 07:28, LuNing Wang  a écrit :
> >>
> >> Hi,
> >>
> >> Thanks a lot Jean-Baptiste Onofré.
> >>
> >> >  how the users implement the ETL pipeline ?
> >>
> >> Users implement the ETL pipeline by configuring a json file or SQL.
> >> the following is one of examples:
> >> https://github.com/DTStack/chunjun/blob/master/flinkx-examples/json/doris/mysql_doris.json
> >>
> >>
> >> > Is it using a kind of SDK/DSL or do you plan an UI ?
> >>
> >> It is using a json file or SQL file. We had a plan to implement an UI
> >> a year ago, but current core developers have no time to implement an
> >> UI and are not good at Web frontend skills.
> >>
> >>
> >> > Is it very coupled to Flink or the runtime is "pluggable" ? Do you plan 
> >> > to have some kind of optional storage, similar to Apache Druid.
> >>
> >> Nowadays, the ChunJun is coupled to Flink. We plan to support Spark or
> >> other runtimes, but no person in the ChunJun community is good at
> >> Apache Spark. We need to bring in more contributors.
> >>
> >> Best,
> >> LuNing Wang 王鲁宁
> >>
> >> Daniel Widdis  于2022年2月25日周五 11:11写道:
> >> >
> >> > Hi, LuNing.
> >> >
> >> > I am not an IPC member, just an interested open source enthusiast 
> >> > looking for a project to contribute to.
> >> >
> >> > I was excited about contributing to another project that recently joined 
> >> > the incubator but as someone who only speaks English, I have had a 
> >> > challenge when a large number of issues are written in Chinese without 
> >> > enough translation for me to help.
> >> >
> >> > Reading your proposal I thought this may be another opportunity for me 
> >> > to help, but I just visited your github site and found that most of the 
> >> > current open issues are not understandable to me, and while I am 
> >> > enthusiastic and want to help, I do not see how I can do so.
> >> >
> >> > I  am concerned about your ability to gain members of your project who 
> >> > do not speak your language.
> >> >
> >> > On 2/23/22, 9:15 PM, "LuNing Wang"  wrote:
> >> >
> >> > Hi,
> >> >
> >> > I am LuNing Wang who sent the Chunjun proposal using 
> >> > 'apa...@dtstack.com'
> >> > and I'm one of the maintainer of Chunjun project.
> >> > As Apache is a community of peers, I will use this email to reply to 
> >> > all
> >> > questions and issues in this thread, after I read The Apache 
> >> > Incubator
> >> > Cookbook.
> >> >
> >> > May I use this email t

Re: Chunjun Proposal

2022-02-25 Thread LuNing Wang
Hi,

Thank a lot Lidong Dai 代立冬老师.

> I am curious that Chunjun(was Flinkx) is built on the Flink CDC, so what is 
> its innovation?

The ChunJun is not built on the Flink CDC. When we built the ChunJun
CDC code, the Flink CDC did not exist. We independently created a new
data structure that is different from Flink CDC. Every CDC record of
this data structure owns all metadata.

Until now ChunJun is the only tool that can capture all tables data
change of the whole database by configuring a simple file rather than
writing java code. The Flink CDC can capture one table by using SQL,
and capture all tables by using java code.

We will also implement capture schema evolution, it is also based on a
new data structure. Now we have implemented capture schema evolution
in the reader/source part, we will implement the writer/sink part.

Best,
LuNing Wang 王鲁宁

Jean-Baptiste Onofré  于2022年2月25日周五 14:44写道:
>
> Hi
>
> Thanks for the details.
>
> Another question: how do you compare Chunjun with camel, nifi, goblin, and 
> others ?
> What kind of community you already have ?
>
> Thanks
> Regards
> JB
>
> Le ven. 25 févr. 2022 à 07:28, LuNing Wang  a écrit :
>>
>> Hi,
>>
>> Thanks a lot Jean-Baptiste Onofré.
>>
>> >  how the users implement the ETL pipeline ?
>>
>> Users implement the ETL pipeline by configuring a json file or SQL.
>> the following is one of examples:
>> https://github.com/DTStack/chunjun/blob/master/flinkx-examples/json/doris/mysql_doris.json
>>
>>
>> > Is it using a kind of SDK/DSL or do you plan an UI ?
>>
>> It is using a json file or SQL file. We had a plan to implement an UI
>> a year ago, but current core developers have no time to implement an
>> UI and are not good at Web frontend skills.
>>
>>
>> > Is it very coupled to Flink or the runtime is "pluggable" ? Do you plan to 
>> > have some kind of optional storage, similar to Apache Druid.
>>
>> Nowadays, the ChunJun is coupled to Flink. We plan to support Spark or
>> other runtimes, but no person in the ChunJun community is good at
>> Apache Spark. We need to bring in more contributors.
>>
>> Best,
>> LuNing Wang 王鲁宁
>>
>> Daniel Widdis  于2022年2月25日周五 11:11写道:
>> >
>> > Hi, LuNing.
>> >
>> > I am not an IPC member, just an interested open source enthusiast looking 
>> > for a project to contribute to.
>> >
>> > I was excited about contributing to another project that recently joined 
>> > the incubator but as someone who only speaks English, I have had a 
>> > challenge when a large number of issues are written in Chinese without 
>> > enough translation for me to help.
>> >
>> > Reading your proposal I thought this may be another opportunity for me to 
>> > help, but I just visited your github site and found that most of the 
>> > current open issues are not understandable to me, and while I am 
>> > enthusiastic and want to help, I do not see how I can do so.
>> >
>> > I  am concerned about your ability to gain members of your project who do 
>> > not speak your language.
>> >
>> > On 2/23/22, 9:15 PM, "LuNing Wang"  wrote:
>> >
>> > Hi,
>> >
>> > I am LuNing Wang who sent the Chunjun proposal using 
>> > 'apa...@dtstack.com'
>> > and I'm one of the maintainer of Chunjun project.
>> > As Apache is a community of peers, I will use this email to reply to 
>> > all
>> >     questions and issues in this thread, after I read The Apache Incubator
>> > Cookbook.
>> >
>> > May I use this email to continue to communicate with the Apache 
>> > community
>> > in this thread?
>> >
>> > Best,
>> > LuNing Wang 王鲁宁
>> >
>> >
>> > Apache  于2022年2月24日周四 09:57写道:
>> >
>> > >
>> > >
>> > > --
>> > > 发件人:Calvin Kirs 
>> > > 发送时间:2022年2月23日(星期三) 00:07
>> > > 收件人:general 
>> > > 主 题:Re: Chunjun Proposal
>> > >
>> > > Hi,
>> > >
>> > > I second with Tison and I'm glad to see your proposal,
>> > > you must have put a lot of effort into drafting this proposal,
>> > > but it needs to be clear what you expect and what you know about 
>> > Apache[1],
>> > > and are we are on the same page?
>> > >
>&

Re: Chunjun Proposal

2022-02-24 Thread Jean-Baptiste Onofré
Hi

Thanks for the details.

Another question: how do you compare Chunjun with camel, nifi, goblin, and
others ?
What kind of community you already have ?

Thanks
Regards
JB

Le ven. 25 févr. 2022 à 07:28, LuNing Wang  a écrit :

> Hi,
>
> Thanks a lot Jean-Baptiste Onofré.
>
> >  how the users implement the ETL pipeline ?
>
> Users implement the ETL pipeline by configuring a json file or SQL.
> the following is one of examples:
>
> https://github.com/DTStack/chunjun/blob/master/flinkx-examples/json/doris/mysql_doris.json
>
>
> > Is it using a kind of SDK/DSL or do you plan an UI ?
>
> It is using a json file or SQL file. We had a plan to implement an UI
> a year ago, but current core developers have no time to implement an
> UI and are not good at Web frontend skills.
>
>
> > Is it very coupled to Flink or the runtime is "pluggable" ? Do you plan
> to have some kind of optional storage, similar to Apache Druid.
>
> Nowadays, the ChunJun is coupled to Flink. We plan to support Spark or
> other runtimes, but no person in the ChunJun community is good at
> Apache Spark. We need to bring in more contributors.
>
> Best,
> LuNing Wang 王鲁宁
>
> Daniel Widdis  于2022年2月25日周五 11:11写道:
> >
> > Hi, LuNing.
> >
> > I am not an IPC member, just an interested open source enthusiast
> looking for a project to contribute to.
> >
> > I was excited about contributing to another project that recently joined
> the incubator but as someone who only speaks English, I have had a
> challenge when a large number of issues are written in Chinese without
> enough translation for me to help.
> >
> > Reading your proposal I thought this may be another opportunity for me
> to help, but I just visited your github site and found that most of the
> current open issues are not understandable to me, and while I am
> enthusiastic and want to help, I do not see how I can do so.
> >
> > I  am concerned about your ability to gain members of your project who
> do not speak your language.
> >
> > On 2/23/22, 9:15 PM, "LuNing Wang"  wrote:
> >
> > Hi,
> >
> > I am LuNing Wang who sent the Chunjun proposal using '
> apa...@dtstack.com'
> > and I'm one of the maintainer of Chunjun project.
> > As Apache is a community of peers, I will use this email to reply to
> all
> > questions and issues in this thread, after I read The Apache
> Incubator
> > Cookbook.
> >
> > May I use this email to continue to communicate with the Apache
> community
> > in this thread?
> >
> > Best,
> > LuNing Wang 王鲁宁
> >
> >
> > Apache  于2022年2月24日周四 09:57写道:
> >
> > >
> > >
> > > --
> > > 发件人:Calvin Kirs 
> > > 发送时间:2022年2月23日(星期三) 00:07
> > > 收件人:general 
> > > 主 题:Re: Chunjun Proposal
> > >
> > > Hi,
> > >
> > > I second with Tison and I'm glad to see your proposal,
> > > you must have put a lot of effort into drafting this proposal,
> > > but it needs to be clear what you expect and what you know about
> Apache[1],
> > > and are we are on the same page?
> > >
> > >
> > > You can see the following information:
> > > The Apache Incubator Cookbook[2]
> > > The Apache Way[3]
> > >
> > > I'd be happy to help you if you need it.
> > >
> > > [1]https://www.apache.org/
> > > [2]https://incubator.apache.org/cookbook/
> > > [3]https://www.apache.org/theapacheway/index.html
> > >
> > > tison  于2022年2月22日周二 23:17写道:
> > >
> > > > Hi,
> > > >
> > > > I have two questions here:
> > > >
> > > > 1. How should I name you in this thread? Apache is a community
> of peers.
> > > I
> > > > can't image I'm talking to the whole Chunjun community or
> "Apache" expect
> > > > its your name.
> > > > 2. What's your expectations on going into the incubator? Among
> the whole
> > > > proposal it's almost about what the current state of Chunjun and
> the only
> > > > statement about your expectations is:
> > > >
> > > > > we seek to further prosper the community with the aid of Apache
> > > >
> > > > Could you elaborate a bit the 

Re: Chunjun Proposal

2022-02-24 Thread LuNing Wang
Hi,

Thanks a lot Jean-Baptiste Onofré.

>  how the users implement the ETL pipeline ?

Users implement the ETL pipeline by configuring a json file or SQL.
the following is one of examples:
https://github.com/DTStack/chunjun/blob/master/flinkx-examples/json/doris/mysql_doris.json


> Is it using a kind of SDK/DSL or do you plan an UI ?

It is using a json file or SQL file. We had a plan to implement an UI
a year ago, but current core developers have no time to implement an
UI and are not good at Web frontend skills.


> Is it very coupled to Flink or the runtime is "pluggable" ? Do you plan to 
> have some kind of optional storage, similar to Apache Druid.

Nowadays, the ChunJun is coupled to Flink. We plan to support Spark or
other runtimes, but no person in the ChunJun community is good at
Apache Spark. We need to bring in more contributors.

Best,
LuNing Wang 王鲁宁

Daniel Widdis  于2022年2月25日周五 11:11写道:
>
> Hi, LuNing.
>
> I am not an IPC member, just an interested open source enthusiast looking for 
> a project to contribute to.
>
> I was excited about contributing to another project that recently joined the 
> incubator but as someone who only speaks English, I have had a challenge when 
> a large number of issues are written in Chinese without enough translation 
> for me to help.
>
> Reading your proposal I thought this may be another opportunity for me to 
> help, but I just visited your github site and found that most of the current 
> open issues are not understandable to me, and while I am enthusiastic and 
> want to help, I do not see how I can do so.
>
> I  am concerned about your ability to gain members of your project who do not 
> speak your language.
>
> On 2/23/22, 9:15 PM, "LuNing Wang"  wrote:
>
> Hi,
>
> I am LuNing Wang who sent the Chunjun proposal using 'apa...@dtstack.com'
> and I'm one of the maintainer of Chunjun project.
> As Apache is a community of peers, I will use this email to reply to all
> questions and issues in this thread, after I read The Apache Incubator
> Cookbook.
>
> May I use this email to continue to communicate with the Apache community
> in this thread?
>
> Best,
> LuNing Wang 王鲁宁
>
>
> Apache  于2022年2月24日周四 09:57写道:
>
> >
> >
> > ------
> > 发件人:Calvin Kirs 
> > 发送时间:2022年2月23日(星期三) 00:07
> > 收件人:general 
> > 主 题:Re: Chunjun Proposal
> >
> > Hi,
> >
> > I second with Tison and I'm glad to see your proposal,
> > you must have put a lot of effort into drafting this proposal,
> > but it needs to be clear what you expect and what you know about 
> Apache[1],
> > and are we are on the same page?
> >
> >
> > You can see the following information:
> > The Apache Incubator Cookbook[2]
> > The Apache Way[3]
> >
> > I'd be happy to help you if you need it.
> >
> > [1]https://www.apache.org/
> > [2]https://incubator.apache.org/cookbook/
> > [3]https://www.apache.org/theapacheway/index.html
> >
> > tison  于2022年2月22日周二 23:17写道:
> >
> > > Hi,
> > >
> > > I have two questions here:
> > >
> > > 1. How should I name you in this thread? Apache is a community of 
> peers.
> > I
> > > can't image I'm talking to the whole Chunjun community or "Apache" 
> expect
> > > its your name.
> > > 2. What's your expectations on going into the incubator? Among the 
> whole
> > > proposal it's almost about what the current state of Chunjun and the 
> only
> > > statement about your expectations is:
> > >
> > > > we seek to further prosper the community with the aid of Apache
> > >
> > > Could you elaborate a bit the motivation here? What help are you 
> seeking?
> > >
> > > Also I second to Sheng's comment that it's confused about your
> > expressions
> > > of contributors and initial committers. If your community continuously
> > > promote contributors , why the initial committer list is quite a bit
> > > limited?
> > >
> > > Best,
> > > tison.
> > >
> > >
> > > Sheng Wu  于2022年2月22日周二 20:54写道:
> > >
> > > > I think this description is incorrect.
> > > >
> > > > > Our initial committers will submit iCLA(s), SGA, and CCLA(s).
> &g

Re: Chunjun Proposal

2022-02-24 Thread Daniel Widdis
Hi, LuNing.

I am not an IPC member, just an interested open source enthusiast looking for a 
project to contribute to.  

I was excited about contributing to another project that recently joined the 
incubator but as someone who only speaks English, I have had a challenge when a 
large number of issues are written in Chinese without enough translation for me 
to help.

Reading your proposal I thought this may be another opportunity for me to help, 
but I just visited your github site and found that most of the current open 
issues are not understandable to me, and while I am enthusiastic and want to 
help, I do not see how I can do so.

I  am concerned about your ability to gain members of your project who do not 
speak your language.

On 2/23/22, 9:15 PM, "LuNing Wang"  wrote:

Hi,

I am LuNing Wang who sent the Chunjun proposal using 'apa...@dtstack.com'
and I'm one of the maintainer of Chunjun project.
As Apache is a community of peers, I will use this email to reply to all
questions and issues in this thread, after I read The Apache Incubator
Cookbook.

May I use this email to continue to communicate with the Apache community
in this thread?

Best,
LuNing Wang 王鲁宁


Apache  于2022年2月24日周四 09:57写道:

>
>
> --
> 发件人:Calvin Kirs 
> 发送时间:2022年2月23日(星期三) 00:07
> 收件人:general 
    > 主 题:Re: Chunjun Proposal
>
> Hi,
>
> I second with Tison and I'm glad to see your proposal,
> you must have put a lot of effort into drafting this proposal,
> but it needs to be clear what you expect and what you know about 
Apache[1],
> and are we are on the same page?
>
>
> You can see the following information:
> The Apache Incubator Cookbook[2]
> The Apache Way[3]
>
> I'd be happy to help you if you need it.
>
> [1]https://www.apache.org/
> [2]https://incubator.apache.org/cookbook/
> [3]https://www.apache.org/theapacheway/index.html
>
> tison  于2022年2月22日周二 23:17写道:
>
> > Hi,
> >
> > I have two questions here:
> >
> > 1. How should I name you in this thread? Apache is a community of peers.
> I
> > can't image I'm talking to the whole Chunjun community or "Apache" 
expect
> > its your name.
> > 2. What's your expectations on going into the incubator? Among the whole
> > proposal it's almost about what the current state of Chunjun and the 
only
> > statement about your expectations is:
> >
> > > we seek to further prosper the community with the aid of Apache
> >
> > Could you elaborate a bit the motivation here? What help are you 
seeking?
> >
> > Also I second to Sheng's comment that it's confused about your
> expressions
> > of contributors and initial committers. If your community continuously
> > promote contributors , why the initial committer list is quite a bit
> > limited?
> >
> > Best,
> > tison.
> >
> >
> > Sheng Wu  于2022年2月22日周二 20:54写道:
> >
> > > I think this description is incorrect.
> > >
> > > > Our initial committers will submit iCLA(s), SGA, and CCLA(s).
> > >
> > > Committers are individuals, who should only submit ICLA, their
> > > employers are recommended to submit CCLA, the owner of the project
> > > should sign the SGA.
> > > ___
> > >
> > > Also, I noticed a conflict in your description
> > > On one side, you mentioned `The initial committers are employees of
> > > DTStack.` with only 5 initial committers, and on the other hand, you
> > > gave a very long vendor list and core contributors list.
> > > So, which is an accurate description? If you have those contributors,
> > > why were all of them invited as PPMC members? Do you have any public
> > > discussion about this decision?
> > > Such as GitHub ID(demotto) is the #8 in the contributor list, and also
> > > listed in the core contributor list, but can't find it in the initial
> > > committer list.
> > >
> > >
> > > Sheng Wu 吴晟
> > > Twitter, wusheng1108
> > >
> > > Lidong Dai  于2022年2月22日周二 20:41写道:
> > > >
> > > > I am curious that Chunjun(was Flinkx) is built on the Flink CDC, so
> > what
> > > is
>

Re: Chunjun Proposal

2022-02-24 Thread Willem Jiang
Hi  LuNing,

It's good to see Chunjun project is interested about ASF incubating process.
Please go through the discussion and try to answer the questions of
the proposal.
It will help you address the concerns and polish the proposal.

Willem Jiang

Twitter: willemjiang
Weibo: 姜宁willem

On Thu, Feb 24, 2022 at 11:16 AM LuNing Wang  wrote:
>
> Hi,
>
> I am LuNing Wang who sent the Chunjun proposal using 'apa...@dtstack.com'
> and I'm one of the maintainer of Chunjun project.
> As Apache is a community of peers, I will use this email to reply to all
> questions and issues in this thread, after I read The Apache Incubator
> Cookbook.
>
> May I use this email to continue to communicate with the Apache community
> in this thread?
>
> Best,
> LuNing Wang 王鲁宁
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Chunjun Proposal

2022-02-23 Thread Sheng Wu
> May I use this email to continue to communicate with the Apache community
in this thread?

It is fine to continue on this thread.

Sheng Wu 吴晟
Twitter, wusheng1108

LuNing Wang  于2022年2月24日周四 11:16写道:
>
> Hi,
>
> I am LuNing Wang who sent the Chunjun proposal using 'apa...@dtstack.com'
> and I'm one of the maintainer of Chunjun project.
> As Apache is a community of peers, I will use this email to reply to all
> questions and issues in this thread, after I read The Apache Incubator
> Cookbook.
>
> May I use this email to continue to communicate with the Apache community
> in this thread?
>
> Best,
> LuNing Wang 王鲁宁
>
>
> Apache  于2022年2月24日周四 09:57写道:
>
> >
> >
> > --
> > 发件人:Calvin Kirs 
> > 发送时间:2022年2月23日(星期三) 00:07
> > 收件人:general 
> > 主 题:Re: Chunjun Proposal
> >
> > Hi,
> >
> > I second with Tison and I'm glad to see your proposal,
> > you must have put a lot of effort into drafting this proposal,
> > but it needs to be clear what you expect and what you know about Apache[1],
> > and are we are on the same page?
> >
> >
> > You can see the following information:
> > The Apache Incubator Cookbook[2]
> > The Apache Way[3]
> >
> > I'd be happy to help you if you need it.
> >
> > [1]https://www.apache.org/
> > [2]https://incubator.apache.org/cookbook/
> > [3]https://www.apache.org/theapacheway/index.html
> >
> > tison  于2022年2月22日周二 23:17写道:
> >
> > > Hi,
> > >
> > > I have two questions here:
> > >
> > > 1. How should I name you in this thread? Apache is a community of peers.
> > I
> > > can't image I'm talking to the whole Chunjun community or "Apache" expect
> > > its your name.
> > > 2. What's your expectations on going into the incubator? Among the whole
> > > proposal it's almost about what the current state of Chunjun and the only
> > > statement about your expectations is:
> > >
> > > > we seek to further prosper the community with the aid of Apache
> > >
> > > Could you elaborate a bit the motivation here? What help are you seeking?
> > >
> > > Also I second to Sheng's comment that it's confused about your
> > expressions
> > > of contributors and initial committers. If your community continuously
> > > promote contributors , why the initial committer list is quite a bit
> > > limited?
> > >
> > > Best,
> > > tison.
> > >
> > >
> > > Sheng Wu  于2022年2月22日周二 20:54写道:
> > >
> > > > I think this description is incorrect.
> > > >
> > > > > Our initial committers will submit iCLA(s), SGA, and CCLA(s).
> > > >
> > > > Committers are individuals, who should only submit ICLA, their
> > > > employers are recommended to submit CCLA, the owner of the project
> > > > should sign the SGA.
> > > > ___
> > > >
> > > > Also, I noticed a conflict in your description
> > > > On one side, you mentioned `The initial committers are employees of
> > > > DTStack.` with only 5 initial committers, and on the other hand, you
> > > > gave a very long vendor list and core contributors list.
> > > > So, which is an accurate description? If you have those contributors,
> > > > why were all of them invited as PPMC members? Do you have any public
> > > > discussion about this decision?
> > > > Such as GitHub ID(demotto) is the #8 in the contributor list, and also
> > > > listed in the core contributor list, but can't find it in the initial
> > > > committer list.
> > > >
> > > >
> > > > Sheng Wu 吴晟
> > > > Twitter, wusheng1108
> > > >
> > > > Lidong Dai  于2022年2月22日周二 20:41写道:
> > > > >
> > > > > I am curious that Chunjun(was Flinkx) is built on the Flink CDC, so
> > > what
> > > > is
> > > > > its innovation?
> > > > >
> > > > > BTW, you shouldn't use the mail(apa...@dtstack.com) as your user
> > > > account,
> > > > > apache is a registered trademark of ASF
> > > > >
> > > > >
> > > > >
> > > > > Best Regards
> > > > >
> > > > >
> > > > >
> > > > > ---
> > > > > Apache DolphinSched

Re: Chunjun Proposal

2022-02-23 Thread LuNing Wang
Hi,

I am LuNing Wang who sent the Chunjun proposal using 'apa...@dtstack.com'
and I'm one of the maintainer of Chunjun project.
As Apache is a community of peers, I will use this email to reply to all
questions and issues in this thread, after I read The Apache Incubator
Cookbook.

May I use this email to continue to communicate with the Apache community
in this thread?

Best,
LuNing Wang 王鲁宁


Apache  于2022年2月24日周四 09:57写道:

>
>
> --
> 发件人:Calvin Kirs 
> 发送时间:2022年2月23日(星期三) 00:07
> 收件人:general 
> 主 题:Re: Chunjun Proposal
>
> Hi,
>
> I second with Tison and I'm glad to see your proposal,
> you must have put a lot of effort into drafting this proposal,
> but it needs to be clear what you expect and what you know about Apache[1],
> and are we are on the same page?
>
>
> You can see the following information:
> The Apache Incubator Cookbook[2]
> The Apache Way[3]
>
> I'd be happy to help you if you need it.
>
> [1]https://www.apache.org/
> [2]https://incubator.apache.org/cookbook/
> [3]https://www.apache.org/theapacheway/index.html
>
> tison  于2022年2月22日周二 23:17写道:
>
> > Hi,
> >
> > I have two questions here:
> >
> > 1. How should I name you in this thread? Apache is a community of peers.
> I
> > can't image I'm talking to the whole Chunjun community or "Apache" expect
> > its your name.
> > 2. What's your expectations on going into the incubator? Among the whole
> > proposal it's almost about what the current state of Chunjun and the only
> > statement about your expectations is:
> >
> > > we seek to further prosper the community with the aid of Apache
> >
> > Could you elaborate a bit the motivation here? What help are you seeking?
> >
> > Also I second to Sheng's comment that it's confused about your
> expressions
> > of contributors and initial committers. If your community continuously
> > promote contributors , why the initial committer list is quite a bit
> > limited?
> >
> > Best,
> > tison.
> >
> >
> > Sheng Wu  于2022年2月22日周二 20:54写道:
> >
> > > I think this description is incorrect.
> > >
> > > > Our initial committers will submit iCLA(s), SGA, and CCLA(s).
> > >
> > > Committers are individuals, who should only submit ICLA, their
> > > employers are recommended to submit CCLA, the owner of the project
> > > should sign the SGA.
> > > ___
> > >
> > > Also, I noticed a conflict in your description
> > > On one side, you mentioned `The initial committers are employees of
> > > DTStack.` with only 5 initial committers, and on the other hand, you
> > > gave a very long vendor list and core contributors list.
> > > So, which is an accurate description? If you have those contributors,
> > > why were all of them invited as PPMC members? Do you have any public
> > > discussion about this decision?
> > > Such as GitHub ID(demotto) is the #8 in the contributor list, and also
> > > listed in the core contributor list, but can't find it in the initial
> > > committer list.
> > >
> > >
> > > Sheng Wu 吴晟
> > > Twitter, wusheng1108
> > >
> > > Lidong Dai  于2022年2月22日周二 20:41写道:
> > > >
> > > > I am curious that Chunjun(was Flinkx) is built on the Flink CDC, so
> > what
> > > is
> > > > its innovation?
> > > >
> > > > BTW, you shouldn't use the mail(apa...@dtstack.com) as your user
> > > account,
> > > > apache is a registered trademark of ASF
> > > >
> > > >
> > > >
> > > > Best Regards
> > > >
> > > >
> > > >
> > > > ---
> > > > Apache DolphinScheduler PMC Chair
> > > > Lidong Dai
> > > > lidong...@apache.org
> > > > Linkedin: https://www.linkedin.com/in/dailidong
> > > > Twitter: @WorkflowEasy <https://twitter.com/WorkflowEasy>
> > > >
> > > > ---
> > > >
> > > >
> > > > On Tue, Feb 22, 2022 at 3:10 PM Jean-Baptiste Onofré <
> j...@nanthrax.net>
> > > > wrote:
> > > >
> > > > > By the way, if you need a mentor, you can ping me.
> > > > >
> > > > > Regards
> > > > > JB
> > > > >
> > > > > On Tue, Feb 22, 2022 at 8:08 AM Jean-Baptiste Onofré <
> > j...@nanthrax.net>
> > > >

Re: Chunjun Proposal

2022-02-22 Thread Calvin Kirs
Hi,

I second with Tison and I'm glad to see your proposal,
you must have put a lot of effort into drafting this proposal,
but it needs to be clear what you expect and what you know about Apache[1],
and are we are on the same page?


You can see the following information:
The Apache Incubator Cookbook[2]
The Apache Way[3]

I'd be happy to help you if you need it.

[1]https://www.apache.org/
[2]https://incubator.apache.org/cookbook/
[3]https://www.apache.org/theapacheway/index.html

tison  于2022年2月22日周二 23:17写道:

> Hi,
>
> I have two questions here:
>
> 1. How should I name you in this thread? Apache is a community of peers. I
> can't image I'm talking to the whole Chunjun community or "Apache" expect
> its your name.
> 2. What's your expectations on going into the incubator? Among the whole
> proposal it's almost about what the current state of Chunjun and the only
> statement about your expectations is:
>
> > we seek to further prosper the community with the aid of Apache
>
> Could you elaborate a bit the motivation here? What help are you seeking?
>
> Also I second to Sheng's comment that it's confused about your expressions
> of contributors and initial committers. If your community continuously
> promote contributors , why the initial committer list is quite a bit
> limited?
>
> Best,
> tison.
>
>
> Sheng Wu  于2022年2月22日周二 20:54写道:
>
> > I think this description is incorrect.
> >
> > > Our initial committers will submit iCLA(s), SGA, and CCLA(s).
> >
> > Committers are individuals, who should only submit ICLA, their
> > employers are recommended to submit CCLA, the owner of the project
> > should sign the SGA.
> > ___
> >
> > Also, I noticed a conflict in your description
> > On one side, you mentioned `The initial committers are employees of
> > DTStack.` with only 5 initial committers, and on the other hand, you
> > gave a very long vendor list and core contributors list.
> > So, which is an accurate description? If you have those contributors,
> > why were all of them invited as PPMC members? Do you have any public
> > discussion about this decision?
> > Such as GitHub ID(demotto) is the #8 in the contributor list, and also
> > listed in the core contributor list, but can't find it in the initial
> > committer list.
> >
> >
> > Sheng Wu 吴晟
> > Twitter, wusheng1108
> >
> > Lidong Dai  于2022年2月22日周二 20:41写道:
> > >
> > > I am curious that Chunjun(was Flinkx) is built on the Flink CDC, so
> what
> > is
> > > its innovation?
> > >
> > > BTW, you shouldn't use the mail(apa...@dtstack.com) as your user
> > account,
> > > apache is a registered trademark of ASF
> > >
> > >
> > >
> > > Best Regards
> > >
> > >
> > >
> > > ---
> > > Apache DolphinScheduler PMC Chair
> > > Lidong Dai
> > > lidong...@apache.org
> > > Linkedin: https://www.linkedin.com/in/dailidong
> > > Twitter: @WorkflowEasy <https://twitter.com/WorkflowEasy>
> > >
> > > ---
> > >
> > >
> > > On Tue, Feb 22, 2022 at 3:10 PM Jean-Baptiste Onofré 
> > > wrote:
> > >
> > > > By the way, if you need a mentor, you can ping me.
> > > >
> > > > Regards
> > > > JB
> > > >
> > > > On Tue, Feb 22, 2022 at 8:08 AM Jean-Baptiste Onofré <
> j...@nanthrax.net>
> > > > wrote:
> > > > >
> > > > > Hi,
> > > > >
> > > > > It looks interesting. Quick questions about the current state of
> the
> > > > > project/roadmap: how the users implement the ETL pipeline ? Is it
> > > > > using a kind of SDK/DSL or do you plan an UI ?
> > > > > Is it very coupled to Flink or the runtime is "pluggable" ? Do you
> > > > > plan to have some kind of optional storage, similar to Apache
> Druid,
> > > > > ... ?
> > > > >
> > > > > Regards
> > > > > JB
> > > > >
> > > > > On Tue, Feb 22, 2022 at 7:41 AM Apache  wrote:
> > > > > >
> > > > > >
> > > > > > Dear Apache Incubator Community,
> > > > > > We propose to contribute Chunjun as an Apache Incubator project.
> > > > > > We are still looking for possible Champion and Mentors if anyone
> > would
> > > > like to volunteer. Thanks a lot.
> > > > > > Best Regards,
> &g

Re: Chunjun Proposal

2022-02-22 Thread tison
Hi,

I have two questions here:

1. How should I name you in this thread? Apache is a community of peers. I
can't image I'm talking to the whole Chunjun community or "Apache" expect
its your name.
2. What's your expectations on going into the incubator? Among the whole
proposal it's almost about what the current state of Chunjun and the only
statement about your expectations is:

> we seek to further prosper the community with the aid of Apache

Could you elaborate a bit the motivation here? What help are you seeking?

Also I second to Sheng's comment that it's confused about your expressions
of contributors and initial committers. If your community continuously
promote contributors , why the initial committer list is quite a bit
limited?

Best,
tison.


Sheng Wu  于2022年2月22日周二 20:54写道:

> I think this description is incorrect.
>
> > Our initial committers will submit iCLA(s), SGA, and CCLA(s).
>
> Committers are individuals, who should only submit ICLA, their
> employers are recommended to submit CCLA, the owner of the project
> should sign the SGA.
> ___
>
> Also, I noticed a conflict in your description
> On one side, you mentioned `The initial committers are employees of
> DTStack.` with only 5 initial committers, and on the other hand, you
> gave a very long vendor list and core contributors list.
> So, which is an accurate description? If you have those contributors,
> why were all of them invited as PPMC members? Do you have any public
> discussion about this decision?
> Such as GitHub ID(demotto) is the #8 in the contributor list, and also
> listed in the core contributor list, but can't find it in the initial
> committer list.
>
>
> Sheng Wu 吴晟
> Twitter, wusheng1108
>
> Lidong Dai  于2022年2月22日周二 20:41写道:
> >
> > I am curious that Chunjun(was Flinkx) is built on the Flink CDC, so what
> is
> > its innovation?
> >
> > BTW, you shouldn't use the mail(apa...@dtstack.com) as your user
> account,
> > apache is a registered trademark of ASF
> >
> >
> >
> > Best Regards
> >
> >
> >
> > ---
> > Apache DolphinScheduler PMC Chair
> > Lidong Dai
> > lidong...@apache.org
> > Linkedin: https://www.linkedin.com/in/dailidong
> > Twitter: @WorkflowEasy <https://twitter.com/WorkflowEasy>
> >
> > ---
> >
> >
> > On Tue, Feb 22, 2022 at 3:10 PM Jean-Baptiste Onofré 
> > wrote:
> >
> > > By the way, if you need a mentor, you can ping me.
> > >
> > > Regards
> > > JB
> > >
> > > On Tue, Feb 22, 2022 at 8:08 AM Jean-Baptiste Onofré 
> > > wrote:
> > > >
> > > > Hi,
> > > >
> > > > It looks interesting. Quick questions about the current state of the
> > > > project/roadmap: how the users implement the ETL pipeline ? Is it
> > > > using a kind of SDK/DSL or do you plan an UI ?
> > > > Is it very coupled to Flink or the runtime is "pluggable" ? Do you
> > > > plan to have some kind of optional storage, similar to Apache Druid,
> > > > ... ?
> > > >
> > > > Regards
> > > > JB
> > > >
> > > > On Tue, Feb 22, 2022 at 7:41 AM Apache  wrote:
> > > > >
> > > > >
> > > > > Dear Apache Incubator Community,
> > > > > We propose to contribute Chunjun as an Apache Incubator project.
> > > > > We are still looking for possible Champion and Mentors if anyone
> would
> > > like to volunteer. Thanks a lot.
> > > > > Best Regards,
> > > > > Real-time computing engine team of DTStack.
> > > > >
> > > > > #Chunjun Proposal
> > > > >
> > > > > ##Abstract
> > > > > Chunjun is a distributed ETL tool and distributed data integration
> > > tool. Currently Chunjun is based on Apache Flink. It was initially
> known as
> > > FlinkX and renamed Chunjun on February 22, 2022.
> > > > > - Chunjun codebase: https://github.com/DTStack/chunjun
> > > > >
> > > > > ##Proposal
> > > > > We propose to contribute the Chunjun codebase to the Apache
> Software
> > > Foundation with the intent of forming a productive, meritocratic and
> open
> > > community around Chunjun’s continued development, according to the
> 'Apache
> > > Way'. The Chunjun's source code is already under the Apache License
> Version
> > > 2.0.
> > > > >
> > > > > ##Background
> > > > > We developed Chunjun in DTSt

Re: Chunjun Proposal

2022-02-22 Thread Sheng Wu
I think this description is incorrect.

> Our initial committers will submit iCLA(s), SGA, and CCLA(s).

Committers are individuals, who should only submit ICLA, their
employers are recommended to submit CCLA, the owner of the project
should sign the SGA.
___

Also, I noticed a conflict in your description
On one side, you mentioned `The initial committers are employees of
DTStack.` with only 5 initial committers, and on the other hand, you
gave a very long vendor list and core contributors list.
So, which is an accurate description? If you have those contributors,
why were all of them invited as PPMC members? Do you have any public
discussion about this decision?
Such as GitHub ID(demotto) is the #8 in the contributor list, and also
listed in the core contributor list, but can't find it in the initial
committer list.


Sheng Wu 吴晟
Twitter, wusheng1108

Lidong Dai  于2022年2月22日周二 20:41写道:
>
> I am curious that Chunjun(was Flinkx) is built on the Flink CDC, so what is
> its innovation?
>
> BTW, you shouldn't use the mail(apa...@dtstack.com) as your user account,
> apache is a registered trademark of ASF
>
>
>
> Best Regards
>
>
>
> ---
> Apache DolphinScheduler PMC Chair
> Lidong Dai
> lidong...@apache.org
> Linkedin: https://www.linkedin.com/in/dailidong
> Twitter: @WorkflowEasy <https://twitter.com/WorkflowEasy>
>
> ---
>
>
> On Tue, Feb 22, 2022 at 3:10 PM Jean-Baptiste Onofré 
> wrote:
>
> > By the way, if you need a mentor, you can ping me.
> >
> > Regards
> > JB
> >
> > On Tue, Feb 22, 2022 at 8:08 AM Jean-Baptiste Onofré 
> > wrote:
> > >
> > > Hi,
> > >
> > > It looks interesting. Quick questions about the current state of the
> > > project/roadmap: how the users implement the ETL pipeline ? Is it
> > > using a kind of SDK/DSL or do you plan an UI ?
> > > Is it very coupled to Flink or the runtime is "pluggable" ? Do you
> > > plan to have some kind of optional storage, similar to Apache Druid,
> > > ... ?
> > >
> > > Regards
> > > JB
> > >
> > > On Tue, Feb 22, 2022 at 7:41 AM Apache  wrote:
> > > >
> > > >
> > > > Dear Apache Incubator Community,
> > > > We propose to contribute Chunjun as an Apache Incubator project.
> > > > We are still looking for possible Champion and Mentors if anyone would
> > like to volunteer. Thanks a lot.
> > > > Best Regards,
> > > > Real-time computing engine team of DTStack.
> > > >
> > > > #Chunjun Proposal
> > > >
> > > > ##Abstract
> > > > Chunjun is a distributed ETL tool and distributed data integration
> > tool. Currently Chunjun is based on Apache Flink. It was initially known as
> > FlinkX and renamed Chunjun on February 22, 2022.
> > > > - Chunjun codebase: https://github.com/DTStack/chunjun
> > > >
> > > > ##Proposal
> > > > We propose to contribute the Chunjun codebase to the Apache Software
> > Foundation with the intent of forming a productive, meritocratic and open
> > community around Chunjun’s continued development, according to the 'Apache
> > Way'. The Chunjun's source code is already under the Apache License Version
> > 2.0.
> > > >
> > > > ##Background
> > > > We developed Chunjun in DTStack company in 2017, when we needed a
> > low-code development and high-performance data integration tool. It has
> > been an open-source project on GitHub since April 2018. Chunjun is running
> > in DTStack production environment all the time. Chunjun has also been
> > widely used by companies in China, including DTStack (
> > https://www.dtstack.com/), Qihu360(https://www.360.cn/), Iflytek (
> > https://www.iflytek.com/), XPeng Motors (https://en.xiaopeng.com/),
> > WeBank (https://www.webank.com/), Asiainfo(https://asiainfo.com/), Guazi(
> > https://www.guazi.com/), Hello Inc (https://www.hello-inc.com/), etc.
> > Nowadays, Chunjun has a strong community in China.
> > > >
> > > > ##Rationale
> > > > High-performance of Chunjun is based on Apache Flink, and Chunjun can
> > integrate data from different data source. Users only need to configure a
> > JSON file to complete the data reading, transformer, and writing. Users can
> > implement new reader/writer plugins to meet their requirements. Chunjun
> > have implemented plugins that can capture data change for MySQL to restore
> > data for Apache Doris.
> > > > Chunjun has the following feature:
> > > > real-time and offline

Re: Chunjun Proposal

2022-02-22 Thread Lidong Dai
I am curious that Chunjun(was Flinkx) is built on the Flink CDC, so what is
its innovation?

BTW, you shouldn't use the mail(apa...@dtstack.com) as your user account,
apache is a registered trademark of ASF



Best Regards



---
Apache DolphinScheduler PMC Chair
Lidong Dai
lidong...@apache.org
Linkedin: https://www.linkedin.com/in/dailidong
Twitter: @WorkflowEasy <https://twitter.com/WorkflowEasy>

---


On Tue, Feb 22, 2022 at 3:10 PM Jean-Baptiste Onofré 
wrote:

> By the way, if you need a mentor, you can ping me.
>
> Regards
> JB
>
> On Tue, Feb 22, 2022 at 8:08 AM Jean-Baptiste Onofré 
> wrote:
> >
> > Hi,
> >
> > It looks interesting. Quick questions about the current state of the
> > project/roadmap: how the users implement the ETL pipeline ? Is it
> > using a kind of SDK/DSL or do you plan an UI ?
> > Is it very coupled to Flink or the runtime is "pluggable" ? Do you
> > plan to have some kind of optional storage, similar to Apache Druid,
> > ... ?
> >
> > Regards
> > JB
> >
> > On Tue, Feb 22, 2022 at 7:41 AM Apache  wrote:
> > >
> > >
> > > Dear Apache Incubator Community,
> > > We propose to contribute Chunjun as an Apache Incubator project.
> > > We are still looking for possible Champion and Mentors if anyone would
> like to volunteer. Thanks a lot.
> > > Best Regards,
> > > Real-time computing engine team of DTStack.
> > >
> > > #Chunjun Proposal
> > >
> > > ##Abstract
> > > Chunjun is a distributed ETL tool and distributed data integration
> tool. Currently Chunjun is based on Apache Flink. It was initially known as
> FlinkX and renamed Chunjun on February 22, 2022.
> > > - Chunjun codebase: https://github.com/DTStack/chunjun
> > >
> > > ##Proposal
> > > We propose to contribute the Chunjun codebase to the Apache Software
> Foundation with the intent of forming a productive, meritocratic and open
> community around Chunjun’s continued development, according to the 'Apache
> Way'. The Chunjun's source code is already under the Apache License Version
> 2.0.
> > >
> > > ##Background
> > > We developed Chunjun in DTStack company in 2017, when we needed a
> low-code development and high-performance data integration tool. It has
> been an open-source project on GitHub since April 2018. Chunjun is running
> in DTStack production environment all the time. Chunjun has also been
> widely used by companies in China, including DTStack (
> https://www.dtstack.com/), Qihu360(https://www.360.cn/), Iflytek (
> https://www.iflytek.com/), XPeng Motors (https://en.xiaopeng.com/),
> WeBank (https://www.webank.com/), Asiainfo(https://asiainfo.com/), Guazi(
> https://www.guazi.com/), Hello Inc (https://www.hello-inc.com/), etc.
> Nowadays, Chunjun has a strong community in China.
> > >
> > > ##Rationale
> > > High-performance of Chunjun is based on Apache Flink, and Chunjun can
> integrate data from different data source. Users only need to configure a
> JSON file to complete the data reading, transformer, and writing. Users can
> implement new reader/writer plugins to meet their requirements. Chunjun
> have implemented plugins that can capture data change for MySQL to restore
> data for Apache Doris.
> > > Chunjun has the following feature:
> > > real-time and offline integrate data from different data sources.
> > > change data capture(CDC) to merge restore data.
> > > resume from broken-point.
> > > capture and collect dirty data.
> > > limit data transferring rate.
> > > thoughput metrics.
> > > capture and restore schema evolution. (TODO)
> > >
> > > ##Current Status###Meritocracy
> > > Since Chunjun was open-sourced, many enterprises have adopted Chunjun
> to build up their data integration system. In return, we have received many
> issue reports or enhancements from them simultaneously. The codebase is now
> mainly managed by the development team inside DTStack who's responsible for
> building internal data integration system too.###Community
> > > Chunjun has been building a community around contributors and users to
> this framework for the last five years. We organized one meetup in 2020.
> Currently, we communicate in Github issues and in chinese DingTalk group.
> There are about 3000 people in this group. And we believe that we can get a
> lot of help from the Apache Flink community too. We will organize a meetup
> again in 2022.###Core Developers
> > > (In alphabetical order)
> > > Chao Xu (https://github.com/zoudaokoulife)
> > > Gongjiang

Re: Chunjun Proposal

2022-02-21 Thread Jean-Baptiste Onofré
By the way, if you need a mentor, you can ping me.

Regards
JB

On Tue, Feb 22, 2022 at 8:08 AM Jean-Baptiste Onofré  wrote:
>
> Hi,
>
> It looks interesting. Quick questions about the current state of the
> project/roadmap: how the users implement the ETL pipeline ? Is it
> using a kind of SDK/DSL or do you plan an UI ?
> Is it very coupled to Flink or the runtime is "pluggable" ? Do you
> plan to have some kind of optional storage, similar to Apache Druid,
> ... ?
>
> Regards
> JB
>
> On Tue, Feb 22, 2022 at 7:41 AM Apache  wrote:
> >
> >
> > Dear Apache Incubator Community,
> > We propose to contribute Chunjun as an Apache Incubator project.
> > We are still looking for possible Champion and Mentors if anyone would like 
> > to volunteer. Thanks a lot.
> > Best Regards,
> > Real-time computing engine team of DTStack.
> >
> > #Chunjun Proposal
> >
> > ##Abstract
> > Chunjun is a distributed ETL tool and distributed data integration tool. 
> > Currently Chunjun is based on Apache Flink. It was initially known as 
> > FlinkX and renamed Chunjun on February 22, 2022.
> > - Chunjun codebase: https://github.com/DTStack/chunjun
> >
> > ##Proposal
> > We propose to contribute the Chunjun codebase to the Apache Software 
> > Foundation with the intent of forming a productive, meritocratic and open 
> > community around Chunjun’s continued development, according to the 'Apache 
> > Way'. The Chunjun's source code is already under the Apache License Version 
> > 2.0.
> >
> > ##Background
> > We developed Chunjun in DTStack company in 2017, when we needed a low-code 
> > development and high-performance data integration tool. It has been an 
> > open-source project on GitHub since April 2018. Chunjun is running in 
> > DTStack production environment all the time. Chunjun has also been widely 
> > used by companies in China, including DTStack (https://www.dtstack.com/), 
> > Qihu360(https://www.360.cn/), Iflytek (https://www.iflytek.com/), XPeng 
> > Motors (https://en.xiaopeng.com/), WeBank (https://www.webank.com/), 
> > Asiainfo(https://asiainfo.com/), Guazi(https://www.guazi.com/), Hello Inc 
> > (https://www.hello-inc.com/), etc.  Nowadays, Chunjun has a strong 
> > community in China.
> >
> > ##Rationale
> > High-performance of Chunjun is based on Apache Flink, and Chunjun can 
> > integrate data from different data source. Users only need to configure a 
> > JSON file to complete the data reading, transformer, and writing. Users can 
> > implement new reader/writer plugins to meet their requirements. Chunjun 
> > have implemented plugins that can capture data change for MySQL to restore 
> > data for Apache Doris.
> > Chunjun has the following feature:
> > real-time and offline integrate data from different data sources.
> > change data capture(CDC) to merge restore data.
> > resume from broken-point.
> > capture and collect dirty data.
> > limit data transferring rate.
> > thoughput metrics.
> > capture and restore schema evolution. (TODO)
> >
> > ##Current Status###Meritocracy
> > Since Chunjun was open-sourced, many enterprises have adopted Chunjun to 
> > build up their data integration system. In return, we have received many 
> > issue reports or enhancements from them simultaneously. The codebase is now 
> > mainly managed by the development team inside DTStack who's responsible for 
> > building internal data integration system too.###Community
> > Chunjun has been building a community around contributors and users to this 
> > framework for the last five years. We organized one meetup in 2020. 
> > Currently, we communicate in Github issues and in chinese DingTalk group. 
> > There are about 3000 people in this group. And we believe that we can get a 
> > lot of help from the Apache Flink community too. We will organize a meetup 
> > again in 2022.###Core Developers
> > (In alphabetical order)
> > Chao Xu (https://github.com/zoudaokoulife)
> > Gongjiang Tang, (https://github.com/kyo-tom)
> > Huai Yang, (https://github.com/yanghuaiGit)
> > Jiangbo Li, (https://github.com/lijiangbo)
> > Luning Wong, (https://github.com/deadwind4)
> > Luo Li, (https://github.com/kanata163)
> > Sishu Yang, (https://github.com/yangsishu)
> > Tianzhu Wen, (https://github.com/WTZ468071157)
> > Weiliang Hao, (https://github.com/xiuzhu9527)
> > Wenqiang Liu, (https://github.com/meng1222)
> > Xing Liu, (https://github.com/simenliuxing)
> > Yang Lan, (https://github.com/HiLany)
> > Yanquan Lv, (https://github.c

Re: Chunjun Proposal

2022-02-21 Thread Jean-Baptiste Onofré
Hi,

It looks interesting. Quick questions about the current state of the
project/roadmap: how the users implement the ETL pipeline ? Is it
using a kind of SDK/DSL or do you plan an UI ?
Is it very coupled to Flink or the runtime is "pluggable" ? Do you
plan to have some kind of optional storage, similar to Apache Druid,
... ?

Regards
JB

On Tue, Feb 22, 2022 at 7:41 AM Apache  wrote:
>
>
> Dear Apache Incubator Community,
> We propose to contribute Chunjun as an Apache Incubator project.
> We are still looking for possible Champion and Mentors if anyone would like 
> to volunteer. Thanks a lot.
> Best Regards,
> Real-time computing engine team of DTStack.
>
> #Chunjun Proposal
>
> ##Abstract
> Chunjun is a distributed ETL tool and distributed data integration tool. 
> Currently Chunjun is based on Apache Flink. It was initially known as FlinkX 
> and renamed Chunjun on February 22, 2022.
> - Chunjun codebase: https://github.com/DTStack/chunjun
>
> ##Proposal
> We propose to contribute the Chunjun codebase to the Apache Software 
> Foundation with the intent of forming a productive, meritocratic and open 
> community around Chunjun’s continued development, according to the 'Apache 
> Way'. The Chunjun's source code is already under the Apache License Version 
> 2.0.
>
> ##Background
> We developed Chunjun in DTStack company in 2017, when we needed a low-code 
> development and high-performance data integration tool. It has been an 
> open-source project on GitHub since April 2018. Chunjun is running in DTStack 
> production environment all the time. Chunjun has also been widely used by 
> companies in China, including DTStack (https://www.dtstack.com/), 
> Qihu360(https://www.360.cn/), Iflytek (https://www.iflytek.com/), XPeng 
> Motors (https://en.xiaopeng.com/), WeBank (https://www.webank.com/), 
> Asiainfo(https://asiainfo.com/), Guazi(https://www.guazi.com/), Hello Inc 
> (https://www.hello-inc.com/), etc.  Nowadays, Chunjun has a strong community 
> in China.
>
> ##Rationale
> High-performance of Chunjun is based on Apache Flink, and Chunjun can 
> integrate data from different data source. Users only need to configure a 
> JSON file to complete the data reading, transformer, and writing. Users can 
> implement new reader/writer plugins to meet their requirements. Chunjun have 
> implemented plugins that can capture data change for MySQL to restore data 
> for Apache Doris.
> Chunjun has the following feature:
> real-time and offline integrate data from different data sources.
> change data capture(CDC) to merge restore data.
> resume from broken-point.
> capture and collect dirty data.
> limit data transferring rate.
> thoughput metrics.
> capture and restore schema evolution. (TODO)
>
> ##Current Status###Meritocracy
> Since Chunjun was open-sourced, many enterprises have adopted Chunjun to 
> build up their data integration system. In return, we have received many 
> issue reports or enhancements from them simultaneously. The codebase is now 
> mainly managed by the development team inside DTStack who's responsible for 
> building internal data integration system too.###Community
> Chunjun has been building a community around contributors and users to this 
> framework for the last five years. We organized one meetup in 2020. 
> Currently, we communicate in Github issues and in chinese DingTalk group. 
> There are about 3000 people in this group. And we believe that we can get a 
> lot of help from the Apache Flink community too. We will organize a meetup 
> again in 2022.###Core Developers
> (In alphabetical order)
> Chao Xu (https://github.com/zoudaokoulife)
> Gongjiang Tang, (https://github.com/kyo-tom)
> Huai Yang, (https://github.com/yanghuaiGit)
> Jiangbo Li, (https://github.com/lijiangbo)
> Luning Wong, (https://github.com/deadwind4)
> Luo Li, (https://github.com/kanata163)
> Sishu Yang, (https://github.com/yangsishu)
> Tianzhu Wen, (https://github.com/WTZ468071157)
> Weiliang Hao, (https://github.com/xiuzhu9527)
> Wenqiang Liu, (https://github.com/meng1222)
> Xing Liu, (https://github.com/simenliuxing)
> Yang Lan, (https://github.com/HiLany)
> Yanquan Lv, (https://github.com/lvyanquan)
> Yifan Hu, (https://github.com/demotto)
> Zaiyue Yu, (https://github.com/tonybobam)
> Zhangwan Zhao, (https://github.com/jiemotongxue)
> Zhiqiang Li, (https://github.com/ChestnutQiang)
> They are almost working in real-time computing engine team of DTStack. Only 
> Yifan Hu working for CaoCao Tech. Most of them are Apache Flink contributor.
>
> ##Known Risks###Project Name
> The name of the project is Chunjun. Chunjun comes from mandarin chinese 
> Pinyin "Chun Jun", and it is one of the top ten famous swords in 
> C

Re: Chunjun Proposal

2022-02-21 Thread Atri Sharma
I am happy to sign up to be the champion

On Tue, 22 Feb 2022, 12:11 Apache,  wrote:

>
> Dear Apache Incubator Community,
> We propose to contribute Chunjun as an Apache Incubator project.
> We are still looking for possible Champion and Mentors if anyone would
> like to volunteer. Thanks a lot.
> Best Regards,
> Real-time computing engine team of DTStack.
>
> #Chunjun Proposal
>
> ##Abstract
> Chunjun is a distributed ETL tool and distributed data integration tool.
> Currently Chunjun is based on Apache Flink. It was initially known as
> FlinkX and renamed Chunjun on February 22, 2022.
> - Chunjun codebase: https://github.com/DTStack/chunjun
>
> ##Proposal
> We propose to contribute the Chunjun codebase to the Apache Software
> Foundation with the intent of forming a productive, meritocratic and open
> community around Chunjun’s continued development, according to the 'Apache
> Way'. The Chunjun's source code is already under the Apache License Version
> 2.0.
>
> ##Background
> We developed Chunjun in DTStack company in 2017, when we needed a low-code
> development and high-performance data integration tool. It has been an
> open-source project on GitHub since April 2018. Chunjun is running in
> DTStack production environment all the time. Chunjun has also been widely
> used by companies in China, including DTStack (https://www.dtstack.com/),
> Qihu360(https://www.360.cn/), Iflytek (https://www.iflytek.com/), XPeng
> Motors (https://en.xiaopeng.com/), WeBank (https://www.webank.com/),
> Asiainfo(https://asiainfo.com/), Guazi(https://www.guazi.com/), Hello Inc
> (https://www.hello-inc.com/), etc.  Nowadays, Chunjun has a strong
> community in China.
>
> ##Rationale
> High-performance of Chunjun is based on Apache Flink, and Chunjun can
> integrate data from different data source. Users only need to configure a
> JSON file to complete the data reading, transformer, and writing. Users can
> implement new reader/writer plugins to meet their requirements. Chunjun
> have implemented plugins that can capture data change for MySQL to restore
> data for Apache Doris.
> Chunjun has the following feature:
> real-time and offline integrate data from different data sources.
> change data capture(CDC) to merge restore data.
> resume from broken-point.
> capture and collect dirty data.
> limit data transferring rate.
> thoughput metrics.
> capture and restore schema evolution. (TODO)
>
> ##Current Status###Meritocracy
> Since Chunjun was open-sourced, many enterprises have adopted Chunjun to
> build up their data integration system. In return, we have received many
> issue reports or enhancements from them simultaneously. The codebase is now
> mainly managed by the development team inside DTStack who's responsible for
> building internal data integration system too.###Community
> Chunjun has been building a community around contributors and users to
> this framework for the last five years. We organized one meetup in 2020.
> Currently, we communicate in Github issues and in chinese DingTalk group.
> There are about 3000 people in this group. And we believe that we can get a
> lot of help from the Apache Flink community too. We will organize a meetup
> again in 2022.###Core Developers
> (In alphabetical order)
> Chao Xu (https://github.com/zoudaokoulife)
> Gongjiang Tang, (https://github.com/kyo-tom)
> Huai Yang, (https://github.com/yanghuaiGit)
> Jiangbo Li, (https://github.com/lijiangbo)
> Luning Wong, (https://github.com/deadwind4)
> Luo Li, (https://github.com/kanata163)
> Sishu Yang, (https://github.com/yangsishu)
> Tianzhu Wen, (https://github.com/WTZ468071157)
> Weiliang Hao, (https://github.com/xiuzhu9527)
> Wenqiang Liu, (https://github.com/meng1222)
> Xing Liu, (https://github.com/simenliuxing)
> Yang Lan, (https://github.com/HiLany)
> Yanquan Lv, (https://github.com/lvyanquan)
> Yifan Hu, (https://github.com/demotto)
> Zaiyue Yu, (https://github.com/tonybobam)
> Zhangwan Zhao, (https://github.com/jiemotongxue)
> Zhiqiang Li, (https://github.com/ChestnutQiang)
> They are almost working in real-time computing engine team of DTStack.
> Only Yifan Hu working for CaoCao Tech. Most of them are Apache Flink
> contributor.
>
> ##Known Risks###Project Name
> The name of the project is Chunjun. Chunjun comes from mandarin chinese
> Pinyin "Chun Jun", and it is one of the top ten famous swords in
> China.###Orphaned products
> More than 20 contributors and thousands of forks and star further show
> that Chunjun is actively supported, and we seek to further prosper the
> community with the aid of Apache. As a consequence, Chunjun is unlikely to
> be reduced to an orphaned project.###Inexperience with Open Source
> Many of the C

Chunjun Proposal

2022-02-21 Thread Apache

Dear Apache Incubator Community, 
We propose to contribute Chunjun as an Apache Incubator project.
We are still looking for possible Champion and Mentors if anyone would like to 
volunteer. Thanks a lot.
Best Regards, 
Real-time computing engine team of DTStack.

#Chunjun Proposal

##Abstract
Chunjun is a distributed ETL tool and distributed data integration tool. 
Currently Chunjun is based on Apache Flink. It was initially known as FlinkX 
and renamed Chunjun on February 22, 2022.
- Chunjun codebase: https://github.com/DTStack/chunjun

##Proposal
We propose to contribute the Chunjun codebase to the Apache Software Foundation 
with the intent of forming a productive, meritocratic and open community around 
Chunjun’s continued development, according to the 'Apache Way'. The Chunjun's 
source code is already under the Apache License Version 2.0.

##Background
We developed Chunjun in DTStack company in 2017, when we needed a low-code 
development and high-performance data integration tool. It has been an 
open-source project on GitHub since April 2018. Chunjun is running in DTStack 
production environment all the time. Chunjun has also been widely used by 
companies in China, including DTStack (https://www.dtstack.com/), 
Qihu360(https://www.360.cn/), Iflytek (https://www.iflytek.com/), XPeng Motors 
(https://en.xiaopeng.com/), WeBank (https://www.webank.com/), 
Asiainfo(https://asiainfo.com/), Guazi(https://www.guazi.com/), Hello Inc 
(https://www.hello-inc.com/), etc.  Nowadays, Chunjun has a strong community in 
China. 

##Rationale
High-performance of Chunjun is based on Apache Flink, and Chunjun can integrate 
data from different data source. Users only need to configure a JSON file to 
complete the data reading, transformer, and writing. Users can implement new 
reader/writer plugins to meet their requirements. Chunjun have implemented 
plugins that can capture data change for MySQL to restore data for Apache Doris.
Chunjun has the following feature: 
real-time and offline integrate data from different data sources. 
change data capture(CDC) to merge restore data.
resume from broken-point.
capture and collect dirty data.
limit data transferring rate.
thoughput metrics.
capture and restore schema evolution. (TODO)

##Current Status###Meritocracy
Since Chunjun was open-sourced, many enterprises have adopted Chunjun to build 
up their data integration system. In return, we have received many issue 
reports or enhancements from them simultaneously. The codebase is now mainly 
managed by the development team inside DTStack who's responsible for building 
internal data integration system too.###Community
Chunjun has been building a community around contributors and users to this 
framework for the last five years. We organized one meetup in 2020. Currently, 
we communicate in Github issues and in chinese DingTalk group. There are about 
3000 people in this group. And we believe that we can get a lot of help from 
the Apache Flink community too. We will organize a meetup again in 2022.###Core 
Developers
(In alphabetical order) 
Chao Xu (https://github.com/zoudaokoulife)
Gongjiang Tang, (https://github.com/kyo-tom)
Huai Yang, (https://github.com/yanghuaiGit)
Jiangbo Li, (https://github.com/lijiangbo)
Luning Wong, (https://github.com/deadwind4)
Luo Li, (https://github.com/kanata163)
Sishu Yang, (https://github.com/yangsishu) 
Tianzhu Wen, (https://github.com/WTZ468071157)
Weiliang Hao, (https://github.com/xiuzhu9527)
Wenqiang Liu, (https://github.com/meng1222)
Xing Liu, (https://github.com/simenliuxing)
Yang Lan, (https://github.com/HiLany)
Yanquan Lv, (https://github.com/lvyanquan)
Yifan Hu, (https://github.com/demotto)
Zaiyue Yu, (https://github.com/tonybobam)
Zhangwan Zhao, (https://github.com/jiemotongxue)
Zhiqiang Li, (https://github.com/ChestnutQiang) 
They are almost working in real-time computing engine team of DTStack. Only 
Yifan Hu working for CaoCao Tech. Most of them are Apache Flink contributor.

##Known Risks###Project Name
The name of the project is Chunjun. Chunjun comes from mandarin chinese Pinyin 
"Chun Jun", and it is one of the top ten famous swords in China.###Orphaned 
products
More than 20 contributors and thousands of forks and star further show that 
Chunjun is actively supported, and we seek to further prosper the community 
with the aid of Apache. As a consequence, Chunjun is unlikely to be reduced to 
an orphaned project.###Inexperience with Open Source
Many of the Chunjun committers have experience working on open source projects. 
They are also active contributors to other Apache projects.
###Homogenous Developers 
The most of core developers are from DTStack, and Chunjun received some bug 
fixes and enhancements from other developers not working at DTStack. 
###Reliance on Salaried Developers
Currently, most of core developers are paid to work on Chunjun project by 
DTStack. We look forward to attracting more people outside DTStack to join this 
project.###Rel