Re: Re: [ANNOUNCE] Apache Paimon is graduated to Top Level Project

2024-03-31 Thread Guowei Ma
Congratulations!
Best,
Guowei


On Mon, Apr 1, 2024 at 11:15 AM Feng Jin  wrote:

> Congratulations!
>
> Best,
> Feng Jin
>
> On Mon, Apr 1, 2024 at 10:51 AM weijie guo 
> wrote:
>
>> Congratulations!
>>
>> Best regards,
>>
>> Weijie
>>
>>
>> Hang Ruan  于2024年4月1日周一 09:49写道:
>>
>> > Congratulations!
>> >
>> > Best,
>> > Hang
>> >
>> > Lincoln Lee  于2024年3月31日周日 00:10写道:
>> >
>> > > Congratulations!
>> > >
>> > > Best,
>> > > Lincoln Lee
>> > >
>> > >
>> > > Jark Wu  于2024年3月30日周六 22:13写道:
>> > >
>> > > > Congratulations!
>> > > >
>> > > > Best,
>> > > > Jark
>> > > >
>> > > > On Fri, 29 Mar 2024 at 12:08, Yun Tang  wrote:
>> > > >
>> > > > > Congratulations to all Paimon guys!
>> > > > >
>> > > > > Glad to see a Flink sub-project has been graduated to an Apache
>> > > top-level
>> > > > > project.
>> > > > >
>> > > > > Best
>> > > > > Yun Tang
>> > > > >
>> > > > > 
>> > > > > From: Hangxiang Yu 
>> > > > > Sent: Friday, March 29, 2024 10:32
>> > > > > To: dev@flink.apache.org 
>> > > > > Subject: Re: Re: [ANNOUNCE] Apache Paimon is graduated to Top
>> Level
>> > > > Project
>> > > > >
>> > > > > Congratulations!
>> > > > >
>> > > > > On Fri, Mar 29, 2024 at 10:27 AM Benchao Li > >
>> > > > wrote:
>> > > > >
>> > > > > > Congratulations!
>> > > > > >
>> > > > > > Zakelly Lan  于2024年3月29日周五 10:25写道:
>> > > > > > >
>> > > > > > > Congratulations!
>> > > > > > >
>> > > > > > >
>> > > > > > > Best,
>> > > > > > > Zakelly
>> > > > > > >
>> > > > > > > On Thu, Mar 28, 2024 at 10:13 PM Jing Ge
>> > > > > > > >
>> > > > > > wrote:
>> > > > > > >
>> > > > > > > > Congrats!
>> > > > > > > >
>> > > > > > > > Best regards,
>> > > > > > > > Jing
>> > > > > > > >
>> > > > > > > > On Thu, Mar 28, 2024 at 1:27 PM Feifan Wang <
>> > zoltar9...@163.com>
>> > > > > > wrote:
>> > > > > > > >
>> > > > > > > > > Congratulations!——
>> > > > > > > > >
>> > > > > > > > > Best regards,
>> > > > > > > > >
>> > > > > > > > > Feifan Wang
>> > > > > > > > >
>> > > > > > > > >
>> > > > > > > > >
>> > > > > > > > >
>> > > > > > > > > At 2024-03-28 20:02:43, "Yanfei Lei" > >
>> > > > wrote:
>> > > > > > > > > >Congratulations!
>> > > > > > > > > >
>> > > > > > > > > >Best,
>> > > > > > > > > >Yanfei
>> > > > > > > > > >
>> > > > > > > > > >Zhanghao Chen  于2024年3月28日周四
>> > > > 19:59写道:
>> > > > > > > > > >>
>> > > > > > > > > >> Congratulations!
>> > > > > > > > > >>
>> > > > > > > > > >> Best,
>> > > > > > > > > >> Zhanghao Chen
>> > > > > > > > > >> 
>> > > > > > > > > >> From: Yu Li 
>> > > > > > > > > >> Sent: Thursday, March 28, 2024 15:55
>> > > > > > > > > >> To: d...@paimon.apache.org 
>> > > > > > > > > >> Cc: dev ; user <
>> > u...@flink.apache.org
>> > > >
>> > > > > > > > > >> Subject: Re: [ANNOUNCE] Apache Paimon is graduated to
>> Top
>> > > > Level
>> > > > > > > > Project
>> > > > > > > > > >>
>> > > > > > > > > >> CC the Flink user and dev mailing list.
>> > > > > > > > > >>
>> > > > > > > > > >> Paimon originated within the Flink community, initially
>> > > known
>> > > > as
>> > > > > > Flink
>> > > > > > > > > >> Table Store, and all our incubating mentors are
>> members of
>> > > the
>> > > > > > Flink
>> > > > > > > > > >> Project Management Committee. I am confident that the
>> > bonds
>> > > of
>> > > > > > > > > >> enduring friendship and close collaboration will
>> continue
>> > to
>> > > > > > unite the
>> > > > > > > > > >> two communities.
>> > > > > > > > > >>
>> > > > > > > > > >> And congratulations all!
>> > > > > > > > > >>
>> > > > > > > > > >> Best Regards,
>> > > > > > > > > >> Yu
>> > > > > > > > > >>
>> > > > > > > > > >> On Wed, 27 Mar 2024 at 20:35, Guojun Li <
>> > > > > gjli.schna...@gmail.com>
>> > > > > > > > > wrote:
>> > > > > > > > > >> >
>> > > > > > > > > >> > Congratulations!
>> > > > > > > > > >> >
>> > > > > > > > > >> > Best,
>> > > > > > > > > >> > Guojun
>> > > > > > > > > >> >
>> > > > > > > > > >> > On Wed, Mar 27, 2024 at 5:24 PM wulin <
>> > > ouyangwu...@163.com>
>> > > > > > wrote:
>> > > > > > > > > >> >
>> > > > > > > > > >> > > Congratulations~
>> > > > > > > > > >> > >
>> > > > > > > > > >> > > > 2024年3月27日 15:54,王刚 > > .INVALID>
>> > > > 写道:
>> > > > > > > > > >> > > >
>> > > > > > > > > >> > > > Congratulations~
>> > > > > > > > > >> > > >
>> > > > > > > > > >> > > >> 2024年3月26日 10:25,Jingsong Li <
>> > jingsongl...@gmail.com
>> > > >
>> > > > > 写道:
>> > > > > > > > > >> > > >>
>> > > > > > > > > >> > > >> Hi Paimon community,
>> > > > > > > > > >> > > >>
>> > > > > > > > > >> > > >> I’m glad to announce that the ASF board has
>> > approved
>> > > a
>> > > > > > > > > resolution to
>> > > > > > > > > >> > > >> graduate Paimon into a full Top Level Project.
>> > Thanks
>> > > > to
>> > > > > > > > > everyone for
>> > > > > > > > > >> > > >> your help to get to this point.
>> > > > > > > > > >> > > >>
>> > > > > > > > > >> > > >> I just created an issue to track the things we
>> need
>> > > to
>> > 

[jira] [Created] (FLINK-34981) FLIP-426: Grouping Remote State Access

2024-03-31 Thread Jinzhong Li (Jira)
Jinzhong Li created FLINK-34981:
---

 Summary: FLIP-426: Grouping Remote State Access
 Key: FLINK-34981
 URL: https://issues.apache.org/jira/browse/FLINK-34981
 Project: Flink
  Issue Type: New Feature
  Components: Runtime / State Backends
Reporter: Jinzhong Li
 Fix For: 2.0.0


This is a sub-FLIP for the disaggregated state management and its related work, 
please read the [FLIP-423|https://cwiki.apache.org/confluence/x/R4p3EQ] first 
to know the whole story.

I/O speed and latency are critical for overall data throughput, particularly in 
jobs that manage large states. Implementing multiple asynchronous I/O 
operations is a proven strategy to enhance throughput by increasing parallelism 
of I/O execution. However, simply expanding I/O parallelism can quickly hit a 
ceiling due to finite I/O bandwidth. Additionally, when it comes to remote 
storage access, the time taken for RPC round trips significantly outweighs the 
impact of I/O size on individual I/O performance. So a promising optimization 
is to merge adjacent I/O requests into a single operation and fetch multiple 
keys with one I/O call. This approach requires a pre-prepared batch of keys for 
the query and the identification of I/O operations that can be combined. In 
this FLIP, we focus on the implementation details for batching state requests 
and processing them in batches.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-34980) Translate overview document into Chinese

2024-03-31 Thread Caican Cai (Jira)
Caican Cai created FLINK-34980:
--

 Summary: Translate overview document into Chinese
 Key: FLINK-34980
 URL: https://issues.apache.org/jira/browse/FLINK-34980
 Project: Flink
  Issue Type: Sub-task
Reporter: Caican Cai


Translate overview document into Chinese



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-34978) Introduce Asynchronous State APIs

2024-03-31 Thread Zakelly Lan (Jira)
Zakelly Lan created FLINK-34978:
---

 Summary: Introduce Asynchronous State APIs
 Key: FLINK-34978
 URL: https://issues.apache.org/jira/browse/FLINK-34978
 Project: Flink
  Issue Type: Sub-task
Reporter: Zakelly Lan
Assignee: Zakelly Lan
 Fix For: 1.20.0






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-34979) Implement State Future and related utilities

2024-03-31 Thread Zakelly Lan (Jira)
Zakelly Lan created FLINK-34979:
---

 Summary: Implement State Future and related utilities
 Key: FLINK-34979
 URL: https://issues.apache.org/jira/browse/FLINK-34979
 Project: Flink
  Issue Type: Sub-task
Reporter: Zakelly Lan
Assignee: Zakelly Lan






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-34977) FLIP-433: State Access on DataStream API V2

2024-03-31 Thread Weijie Guo (Jira)
Weijie Guo created FLINK-34977:
--

 Summary: FLIP-433: State Access on DataStream API V2
 Key: FLINK-34977
 URL: https://issues.apache.org/jira/browse/FLINK-34977
 Project: Flink
  Issue Type: Sub-task
  Components: API / DataStream, API / State Processor
Affects Versions: 1.20.0
Reporter: Weijie Guo
Assignee: Weijie Guo






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-34976) LD_PRELOAD environment may not be effective after su to flink user

2024-03-31 Thread xiaogang zhou (Jira)
xiaogang zhou created FLINK-34976:
-

 Summary: LD_PRELOAD environment may not be effective after su to 
flink user
 Key: FLINK-34976
 URL: https://issues.apache.org/jira/browse/FLINK-34976
 Project: Flink
  Issue Type: New Feature
  Components: flink-docker
Affects Versions: 1.19.0
Reporter: xiaogang zhou






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-34975) FLIP-427: ForSt - Disaggregated state Store

2024-03-31 Thread Hangxiang Yu (Jira)
Hangxiang Yu created FLINK-34975:


 Summary: FLIP-427: ForSt - Disaggregated state Store
 Key: FLINK-34975
 URL: https://issues.apache.org/jira/browse/FLINK-34975
 Project: Flink
  Issue Type: New Feature
  Components: Runtime / State Backends
Reporter: Hangxiang Yu
Assignee: Hangxiang Yu


This is a sub-FLIP for the disaggregated state management and its related work, 
please read the [FLIP-423|https://cwiki.apache.org/confluence/x/R4p3EQ] first 
to know the whole story.

As described in FLIP-423, there are some tough issues about embedded state 
backend on local file system, respecially when dealing with extremely large 
state:
 # {*}Constraints of local disk space complicate the prediction of storage 
requirements, potentially leading to job failures{*}: Especially in cloud 
native deployment mode, pre-allocated local disks typically face strict 
capacity constraints, making it challenging to forecast the size requirements 
of job states. Over-provisioning disk space results in unnecessary resource 
overhead, while under-provisioning risks job failure due to insufficient space.
 # *The tight coupling of compute and storage resources leads to 
underutilization and increased waste:* Jobs can generally be categorized as 
either CPU-intensive or IO-intensive. In a coupled architecture, CPU-intensive 
jobs leave a significant portion of storage resources underutilized, whereas 
IO-intensive jobs result in idle computing resources.

By considering remote storage as the primary storage, all working states are 
maintained on the remote file system, which brings several advantages:
 # *Remote storages e.g. S3/HDFS typically offer elastic scalability, 
theoretically providing unlimited space.*
 # *The allocation of remote storage resources can be optimized by reducing 
them for CPU-intensive jobs and augmenting them for IO-intensive jobs, thus 
enhancing overall resource utilization.*
 # *This architecture facilitates a highly efficient and lightweight process 
for checkpointing, recovery, and rescaling through fast copy or simple move.*

This FLIP aims to realize disaggregated state for our new key-value store named 
*ForSt* which evloves from RocksDB and supports remote file system. This makes 
Flink get rid of the disadvantages by coupled state architecture and embrace 
the scalable as well as flexible cloud-native storage.

Please see [FLIP-427 |https://cwiki.apache.org/confluence/x/T4p3EQ]for more 
details.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-34974) FLIP-424: Asynchronous State APIs

2024-03-31 Thread Zakelly Lan (Jira)
Zakelly Lan created FLINK-34974:
---

 Summary: FLIP-424: Asynchronous State APIs
 Key: FLINK-34974
 URL: https://issues.apache.org/jira/browse/FLINK-34974
 Project: Flink
  Issue Type: New Feature
  Components: Runtime / State Backends
Reporter: Zakelly Lan
Assignee: Zakelly Lan
 Fix For: 2.0.0


This is a sub-FLIP for the disaggregated state management and its related work, 
please read the [FLIP-423|https://cwiki.apache.org/confluence/x/R4p3EQ] first 
to know the whole story.

To maximize I/O capacity utilization and enhance the use of pre-allocated 
computational resources, this FLIP proposes the introduction of asynchronous 
state APIs. These APIs permit state access to be executed in threads separate 
from the task thread, returning the result when available. Consequently, the 
task thread can process another element while awaiting multiple pending state 
results. This enables concurrent processing of multiple records, ensuring that 
the latency of individual I/O operations no longer has a direct impact on job 
performance. This approach is particularly advantageous in scenarios where I/O 
bandwidth is underutilized and I/O latency is the limiting factor. The 
Disaggregated Storage Architecture, as discussed in FLIP-423, is a prime 
example of a scenario characterized by abundant and easily scalable I/O 
bandwidth coupled with higher I/O latency. The asynchronous state APIs hold 
great promise for significantly enhancing Flink's performance when dealing with 
disaggregated state.




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (FLINK-34973) FLIP-423: Disaggregated State Storage and Management

2024-03-31 Thread Zakelly Lan (Jira)
Zakelly Lan created FLINK-34973:
---

 Summary: FLIP-423: Disaggregated State Storage and Management
 Key: FLINK-34973
 URL: https://issues.apache.org/jira/browse/FLINK-34973
 Project: Flink
  Issue Type: New Feature
Reporter: Zakelly Lan
 Fix For: 2.0.0


The past decade has witnessed a dramatic shift in Flink's deployment mode, 
workload patterns, and hardware improvements. We've moved from the map-reduce 
era where workers are computation-storage tightly coupled nodes to a 
cloud-native world where containerized deployments on Kubernetes become 
standard. To enable Flink's Cloud-Native future, we introduce Disaggregated 
State Storage and Management that uses DFS as primary storage in Flink 2.0, as 
promised in the Flink 2.0 Roadmap.

Detailed design and story: 
https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=293046855

Also sub-FLIPs:
- Asynchronous State APIs 
([FLIP-424|https://cwiki.apache.org/confluence/x/SYp3EQ]): Introduce new APIs 
for asynchronous state access. 
- Asynchronous Execution Model 
([FLIP-425|https://cwiki.apache.org/confluence/x/S4p3EQ]): Implement a 
non-blocking execution model leveraging the asynchronous APIs introduced in 
FLIP-424. 
- Grouping Remote State Access 
([FLIP-426|https://cwiki.apache.org/confluence/x/TYp3EQ]): Enable retrieval of 
remote state data in batches to avoid unnecessary round-trip costs for remote 
access. 
- Disaggregated State Store 
([FLIP-427|https://cwiki.apache.org/confluence/x/T4p3EQ]): Introduce the 
initial version of the ForSt disaggregated state store.
- Fault Tolerance/Rescale Integration 
([FLIP-428|https://cwiki.apache.org/confluence/x/UYp3EQ]): Integrate 
checkpointing mechanisms with the disaggregated state store for fault tolerance 
and fast rescaling.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[RESULT][VOTE] FLIP-428: Fault Tolerance/Rescale Integration for Disaggregated State

2024-03-31 Thread Jinzhong Li
I'm happy to announce that FLIP-428: Fault Tolerance/Rescale Integration
for Disaggregated State [1] has been accepted with 9 approving votes (5
binding) [2]:



   - Yuan Mei (binding)
   - Feifan Wang (non-binding)
   - Rui Fan (binding)
   - yue ma (no-binding)
   - Yanfei Lei (binding)
   - Jeyhun Karimov (no-binding)
   - gongzhongqiang (no-binding)
   - Hangxiang Yu (binding)
   - Yun Tang (binding)


There are no disapproving votes. Thanks to everyone who participated in the
discussion and voting.

[1] https://cwiki.apache.org/confluence/x/UYp3EQ

[2] https://lists.apache.org/thread/j7015yymmdy9zp9vr3gg7qpmzjw0o7y2


Best,

Jinzhong Li


[RESULT][VOTE] FLIP-424: Asynchronous State APIs

2024-03-31 Thread Zakelly Lan
Hi devs,

I'm happy to announce that FLIP-424: Asynchronous State APIs [1] has been
accepted with 11 approving votes, 7 of which are binding [2]:

- Xuannan Su
- Yuan Mei (binding)
- Piotr Nowojski (binding)
- Jing Ge (binding)
- Feifan Wang
- Rui Fan (binding)
- Yunfeng Zhou
- Yuepeng Pan
- Xintong Song (binding)
- Roman Khachatryan (binding)
- Yun Tang (binding)

[1] https://cwiki.apache.org/confluence/x/SYp3EQ
[2] https://lists.apache.org/thread/tplyf17n3409l605bo9promf4o8tvl2j


Best,
Zakelly


Re: [VOTE] FLIP-428: Fault Tolerance/Rescale Integration for Disaggregated State

2024-03-31 Thread Yun Tang
+1 (binding)

Best
Yun Tang

From: Hangxiang Yu 
Sent: Monday, April 1, 2024 10:21
To: dev@flink.apache.org 
Subject: Re: [VOTE] FLIP-428: Fault Tolerance/Rescale Integration for 
Disaggregated State

+1 (binding)

On Mon, Apr 1, 2024 at 9:24 AM gongzhongqiang 
wrote:

> +1(non-binding)
>
> Best,
>
> Zhongqiang Gong
>
> Jinzhong Li  于2024年3月27日周三 19:31写道:
>
> > Hi devs,
> >
> >
> > I'd like to start a vote on the FLIP-428: Fault Tolerance/Rescale
> > Integration for Disaggregated State [1]. The discussion thread is here
> [2].
> >
> >
> > The vote will be open for at least 72 hours unless there is an objection
> or
> > insufficient votes.
> >
> > [1] https://cwiki.apache.org/confluence/x/UYp3EQ
> >
> > [2] https://lists.apache.org/thread/vr8f91p715ct4lop6b3nr0fh4z5p312b
> >
> > Best,
> >
> > Jinzhong
> >
>


--
Best,
Hangxiang.


Re: [VOTE] FLIP-424: Asynchronous State APIs

2024-03-31 Thread Yun Tang
+1 (binding)

Best
Yun Tang

From: Zakelly Lan 
Sent: Monday, April 1, 2024 10:08
To: dev@flink.apache.org 
Subject: Re: [VOTE] FLIP-424: Asynchronous State APIs

Thank you all for the votes! I'm closing this thread and the result will be
posted in a separate mail.


Best,
Zakelly

On Sat, Mar 30, 2024 at 4:43 PM Roman Khachatryan  wrote:

> +1 (binding)
>
> Regards,
> Roman
>
>
> On Fri, Mar 29, 2024 at 7:01 AM Xintong Song 
> wrote:
>
> > +1 (binding)
> >
> > Best,
> >
> > Xintong
> >
> >
> >
> > On Fri, Mar 29, 2024 at 12:51 PM Yuepeng Pan 
> > wrote:
> >
> > > +1(non-binding)
> > >
> > > Best,
> > > Yuepeng Pan
> > >
> > >
> > > On 2024/03/29 03:03:53 Yunfeng Zhou wrote:
> > > > +1 (non-binding)
> > > >
> > > > Best,
> > > > Yunfeng
> > > >
> > > > On Wed, Mar 27, 2024 at 6:23 PM Zakelly Lan 
> > > wrote:
> > > > >
> > > > > Hi devs,
> > > > >
> > > > > I'd like to start a vote on the FLIP-424: Asynchronous State APIs
> > [1].
> > > The
> > > > > discussion thread is here [2].
> > > > >
> > > > > The vote will be open for at least 72 hours unless there is an
> > > objection or
> > > > > insufficient votes.
> > > > >
> > > > > [1] https://cwiki.apache.org/confluence/x/SYp3EQ
> > > > > [2]
> https://lists.apache.org/thread/nmd9qd0k8l94ygcfgllxms49wmtz1864
> > > > >
> > > > >
> > > > > Best,
> > > > > Zakelly
> > > >
> > >
> >
>


[RESULT][VOTE] FLIP-426: Grouping Remote State Access

2024-03-31 Thread Jinzhong Li
I'm happy to announce that FLIP-426: Grouping Remote State Access [1] has
been accepted with 8 approving votes (4 binding) [2]:

   - Yuan Mei (binding)
   - Feifan Wang (non-binding)
   - Rui Fan (binding)
   - gongzhongqiang (no-binding)
   - Yuepeng Pan (no-binding)
   - yue ma (no-binding)
   - Yanfei Lei (binding)
   - Hangxiang Yu (binding)

There are no disapproving votes. Thanks to everyone who participated
in the discussion and voting.

[1] https://cwiki.apache.org/confluence/x/TYp3EQ

[2] https://lists.apache.org/thread/0yvspwhsf0vyvqkkd7snzg33k95v3jbc


Best,

Jinzhong Li


Re: Re: [ANNOUNCE] Apache Paimon is graduated to Top Level Project

2024-03-31 Thread Feng Jin
Congratulations!

Best,
Feng Jin

On Mon, Apr 1, 2024 at 10:51 AM weijie guo 
wrote:

> Congratulations!
>
> Best regards,
>
> Weijie
>
>
> Hang Ruan  于2024年4月1日周一 09:49写道:
>
> > Congratulations!
> >
> > Best,
> > Hang
> >
> > Lincoln Lee  于2024年3月31日周日 00:10写道:
> >
> > > Congratulations!
> > >
> > > Best,
> > > Lincoln Lee
> > >
> > >
> > > Jark Wu  于2024年3月30日周六 22:13写道:
> > >
> > > > Congratulations!
> > > >
> > > > Best,
> > > > Jark
> > > >
> > > > On Fri, 29 Mar 2024 at 12:08, Yun Tang  wrote:
> > > >
> > > > > Congratulations to all Paimon guys!
> > > > >
> > > > > Glad to see a Flink sub-project has been graduated to an Apache
> > > top-level
> > > > > project.
> > > > >
> > > > > Best
> > > > > Yun Tang
> > > > >
> > > > > 
> > > > > From: Hangxiang Yu 
> > > > > Sent: Friday, March 29, 2024 10:32
> > > > > To: dev@flink.apache.org 
> > > > > Subject: Re: Re: [ANNOUNCE] Apache Paimon is graduated to Top Level
> > > > Project
> > > > >
> > > > > Congratulations!
> > > > >
> > > > > On Fri, Mar 29, 2024 at 10:27 AM Benchao Li 
> > > > wrote:
> > > > >
> > > > > > Congratulations!
> > > > > >
> > > > > > Zakelly Lan  于2024年3月29日周五 10:25写道:
> > > > > > >
> > > > > > > Congratulations!
> > > > > > >
> > > > > > >
> > > > > > > Best,
> > > > > > > Zakelly
> > > > > > >
> > > > > > > On Thu, Mar 28, 2024 at 10:13 PM Jing Ge
> > >  > > > >
> > > > > > wrote:
> > > > > > >
> > > > > > > > Congrats!
> > > > > > > >
> > > > > > > > Best regards,
> > > > > > > > Jing
> > > > > > > >
> > > > > > > > On Thu, Mar 28, 2024 at 1:27 PM Feifan Wang <
> > zoltar9...@163.com>
> > > > > > wrote:
> > > > > > > >
> > > > > > > > > Congratulations!——
> > > > > > > > >
> > > > > > > > > Best regards,
> > > > > > > > >
> > > > > > > > > Feifan Wang
> > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > At 2024-03-28 20:02:43, "Yanfei Lei" 
> > > > wrote:
> > > > > > > > > >Congratulations!
> > > > > > > > > >
> > > > > > > > > >Best,
> > > > > > > > > >Yanfei
> > > > > > > > > >
> > > > > > > > > >Zhanghao Chen  于2024年3月28日周四
> > > > 19:59写道:
> > > > > > > > > >>
> > > > > > > > > >> Congratulations!
> > > > > > > > > >>
> > > > > > > > > >> Best,
> > > > > > > > > >> Zhanghao Chen
> > > > > > > > > >> 
> > > > > > > > > >> From: Yu Li 
> > > > > > > > > >> Sent: Thursday, March 28, 2024 15:55
> > > > > > > > > >> To: d...@paimon.apache.org 
> > > > > > > > > >> Cc: dev ; user <
> > u...@flink.apache.org
> > > >
> > > > > > > > > >> Subject: Re: [ANNOUNCE] Apache Paimon is graduated to
> Top
> > > > Level
> > > > > > > > Project
> > > > > > > > > >>
> > > > > > > > > >> CC the Flink user and dev mailing list.
> > > > > > > > > >>
> > > > > > > > > >> Paimon originated within the Flink community, initially
> > > known
> > > > as
> > > > > > Flink
> > > > > > > > > >> Table Store, and all our incubating mentors are members
> of
> > > the
> > > > > > Flink
> > > > > > > > > >> Project Management Committee. I am confident that the
> > bonds
> > > of
> > > > > > > > > >> enduring friendship and close collaboration will
> continue
> > to
> > > > > > unite the
> > > > > > > > > >> two communities.
> > > > > > > > > >>
> > > > > > > > > >> And congratulations all!
> > > > > > > > > >>
> > > > > > > > > >> Best Regards,
> > > > > > > > > >> Yu
> > > > > > > > > >>
> > > > > > > > > >> On Wed, 27 Mar 2024 at 20:35, Guojun Li <
> > > > > gjli.schna...@gmail.com>
> > > > > > > > > wrote:
> > > > > > > > > >> >
> > > > > > > > > >> > Congratulations!
> > > > > > > > > >> >
> > > > > > > > > >> > Best,
> > > > > > > > > >> > Guojun
> > > > > > > > > >> >
> > > > > > > > > >> > On Wed, Mar 27, 2024 at 5:24 PM wulin <
> > > ouyangwu...@163.com>
> > > > > > wrote:
> > > > > > > > > >> >
> > > > > > > > > >> > > Congratulations~
> > > > > > > > > >> > >
> > > > > > > > > >> > > > 2024年3月27日 15:54,王刚  > .INVALID>
> > > > 写道:
> > > > > > > > > >> > > >
> > > > > > > > > >> > > > Congratulations~
> > > > > > > > > >> > > >
> > > > > > > > > >> > > >> 2024年3月26日 10:25,Jingsong Li <
> > jingsongl...@gmail.com
> > > >
> > > > > 写道:
> > > > > > > > > >> > > >>
> > > > > > > > > >> > > >> Hi Paimon community,
> > > > > > > > > >> > > >>
> > > > > > > > > >> > > >> I’m glad to announce that the ASF board has
> > approved
> > > a
> > > > > > > > > resolution to
> > > > > > > > > >> > > >> graduate Paimon into a full Top Level Project.
> > Thanks
> > > > to
> > > > > > > > > everyone for
> > > > > > > > > >> > > >> your help to get to this point.
> > > > > > > > > >> > > >>
> > > > > > > > > >> > > >> I just created an issue to track the things we
> need
> > > to
> > > > > > modify
> > > > > > > > > [2],
> > > > > > > > > >> > > >> please comment on it if you feel that something
> is
> > > > > > missing. You
> > > > > > > > > can
> > > > > > > > > >> > > >> refer to apache documentation [1] too.
> > > > > > > > > >> > > >>
> > > > > > > > > 

Re: [DISCUSS] FLIP-428: Fault Tolerance/Rescale Integration for Disaggregated State

2024-03-31 Thread Jinzhong Li
Hi Yun,

Thanks for your advice. I have added the topic of remote working files
cleanup to the current FLIP.

Best,
Jinzhong

On Sat, Mar 30, 2024 at 10:44 AM Yun Tang  wrote:

> Hi Jinzhong,
>
> Yes, I know the cleanup mechanism for the remote working directory is the
> same as the current Rocksdb state-backend. However, the impact of the
> residual files in the remote working directory is different compared with
> residual files in the local directory, especially Flink just try the best
> to clean up during stateBackend#dispose.
>
> I agree that we could leave the optimization in the future FLIP, however,
> I think we should mention this topic in the current FLIP to make the
> overall design more complete and sophisticated.
>
>
> Best
> Yun Tang
> 
> From: Jinzhong Li 
> Sent: Thursday, March 28, 2024 12:45
> To: dev@flink.apache.org 
> Subject: Re: [DISCUSS] FLIP-428: Fault Tolerance/Rescale Integration for
> Disaggregated State
>
> Hi Feifan,
>
> Sorry for the misunderstanding. As Hangxiang explained, the basic cleanup
> mechanism for remote working directory is the same as rocksdb-statebackend,
> that is, when TM exits, forst-statebackend will delete the entire working
> dir. Regarding orphaned files cleanup in the case of TM crash, we will
> address it in the future FLIP.
>
> Best,
> Jinzhong
>
> On Thu, Mar 28, 2024 at 12:35 PM Hangxiang Yu  wrote:
>
> > Hi, Yun and Feifan.
> >
> > Thanks for your reply.
> >
> > About the cleanup of working dir, as mentioned in FLIP-427, "The life
> cycle
> > of working dir is managed as before local strategy.".
> > Since the current working dir and checkpoint dir are separate, The life
> > cycle including creating and cleanup of working dir could be aligned with
> > before easily.
> >
> > On Thu, Mar 28, 2024 at 12:07 PM Feifan Wang  wrote:
> >
> > > And I think the cleanup of working dir should be discussion in
> > FLIP-427[1]
> > > ( this mail list [2]) ?
> > >
> > >
> > > [1] https://cwiki.apache.org/confluence/x/T4p3EQ
> > > [2] https://lists.apache.org/thread/vktfzqvb7t4rltg7fdlsyd9sfdmrc4ft
> > >
> > > ——
> > >
> > > Best regards,
> > >
> > > Feifan Wang
> > >
> > >
> > >
> > >
> > > At 2024-03-28 11:56:22, "Feifan Wang"  wrote:
> > > >Hi Jinzhong :
> > > >
> > > >
> > > >> I suggest that we could postpone this topic for now and consider it
> > > comprehensively combined with the TM ownership file management in the
> > > future FLIP.
> > > >
> > > >
> > > >Sorry I still think we should consider the cleanup of the working dir
> in
> > > this FLIP, although we may come up with a better solution in a
> subsequent
> > > flip, I think it is important to maintain the integrity of the current
> > > changes. Otherwise we may suffer from wasted DFS space for some time.
> > > >Perhaps we only need a simple cleanup strategy at this stage, such as
> > > proactive cleanup when TM exits. While this may fail in the case of a
> TM
> > > crash, it already alleviates the problem.
> > > >
> > > >
> > > >
> > > >
> > > >——
> > > >
> > > >Best regards,
> > > >
> > > >Feifan Wang
> > > >
> > > >
> > > >
> > > >
> > > >At 2024-03-28 11:15:11, "Jinzhong Li" 
> wrote:
> > > >>Hi Yun,
> > > >>
> > > >>Thanks for your reply.
> > > >>
> > > >>> 1. Why must we have another 'subTask-checkpoint-sub-dir'
> > > >>> under the shared directory? if we don't consider making
> > > >>> TM ownership in this FLIP, this design seems unnecessary.
> > > >>
> > > >> Good catch! We will not change the directory layout of shared
> > directory
> > > in
> > > >>this FLIP. I have already removed this part from this FLIP. I think
> we
> > > >>could revisit this topic in a future FLIP about TM ownership.
> > > >>
> > > >>> 2. This FLIP forgets to mention the cleanup of the remote
> > > >>> working directory in case of the taskmanager crushes,
> > > >>> even though this is an open problem, we can still leave
> > > >>> some space for future optimization.
> > > >>
> > > >>Considering that we have plans to merge TM working dir and checkpoint
> > dir
> > > >>into one directory, I suggest that we could postpone this topic for
> now
> > > and
> > > >>consider it comprehensively combined with the TM ownership file
> > > management
> > > >>in the future FLIP.
> > > >>
> > > >>Best,
> > > >>Jinzhong
> > > >>
> > > >>
> > > >>
> > > >>On Wed, Mar 27, 2024 at 11:49 PM Yun Tang  wrote:
> > > >>
> > > >>> Hi Jinzhong,
> > > >>>
> > > >>> The overall design looks good.
> > > >>>
> > > >>> I have two minor questions:
> > > >>>
> > > >>> 1. Why must we have another 'subTask-checkpoint-sub-dir' under the
> > > shared
> > > >>> directory? if we don't consider making TM ownership in this FLIP,
> > this
> > > >>> design seems unnecessary.
> > > >>> 2. This FLIP forgets to mention the cleanup of the remote working
> > > >>> directory in case of the taskmanager crushes, even though this is
> an
> > > open
> > > >>> problem, we can still leave some space for future 

Re: Re: [ANNOUNCE] Apache Paimon is graduated to Top Level Project

2024-03-31 Thread weijie guo
Congratulations!

Best regards,

Weijie


Hang Ruan  于2024年4月1日周一 09:49写道:

> Congratulations!
>
> Best,
> Hang
>
> Lincoln Lee  于2024年3月31日周日 00:10写道:
>
> > Congratulations!
> >
> > Best,
> > Lincoln Lee
> >
> >
> > Jark Wu  于2024年3月30日周六 22:13写道:
> >
> > > Congratulations!
> > >
> > > Best,
> > > Jark
> > >
> > > On Fri, 29 Mar 2024 at 12:08, Yun Tang  wrote:
> > >
> > > > Congratulations to all Paimon guys!
> > > >
> > > > Glad to see a Flink sub-project has been graduated to an Apache
> > top-level
> > > > project.
> > > >
> > > > Best
> > > > Yun Tang
> > > >
> > > > 
> > > > From: Hangxiang Yu 
> > > > Sent: Friday, March 29, 2024 10:32
> > > > To: dev@flink.apache.org 
> > > > Subject: Re: Re: [ANNOUNCE] Apache Paimon is graduated to Top Level
> > > Project
> > > >
> > > > Congratulations!
> > > >
> > > > On Fri, Mar 29, 2024 at 10:27 AM Benchao Li 
> > > wrote:
> > > >
> > > > > Congratulations!
> > > > >
> > > > > Zakelly Lan  于2024年3月29日周五 10:25写道:
> > > > > >
> > > > > > Congratulations!
> > > > > >
> > > > > >
> > > > > > Best,
> > > > > > Zakelly
> > > > > >
> > > > > > On Thu, Mar 28, 2024 at 10:13 PM Jing Ge
> >  > > >
> > > > > wrote:
> > > > > >
> > > > > > > Congrats!
> > > > > > >
> > > > > > > Best regards,
> > > > > > > Jing
> > > > > > >
> > > > > > > On Thu, Mar 28, 2024 at 1:27 PM Feifan Wang <
> zoltar9...@163.com>
> > > > > wrote:
> > > > > > >
> > > > > > > > Congratulations!——
> > > > > > > >
> > > > > > > > Best regards,
> > > > > > > >
> > > > > > > > Feifan Wang
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > At 2024-03-28 20:02:43, "Yanfei Lei" 
> > > wrote:
> > > > > > > > >Congratulations!
> > > > > > > > >
> > > > > > > > >Best,
> > > > > > > > >Yanfei
> > > > > > > > >
> > > > > > > > >Zhanghao Chen  于2024年3月28日周四
> > > 19:59写道:
> > > > > > > > >>
> > > > > > > > >> Congratulations!
> > > > > > > > >>
> > > > > > > > >> Best,
> > > > > > > > >> Zhanghao Chen
> > > > > > > > >> 
> > > > > > > > >> From: Yu Li 
> > > > > > > > >> Sent: Thursday, March 28, 2024 15:55
> > > > > > > > >> To: d...@paimon.apache.org 
> > > > > > > > >> Cc: dev ; user <
> u...@flink.apache.org
> > >
> > > > > > > > >> Subject: Re: [ANNOUNCE] Apache Paimon is graduated to Top
> > > Level
> > > > > > > Project
> > > > > > > > >>
> > > > > > > > >> CC the Flink user and dev mailing list.
> > > > > > > > >>
> > > > > > > > >> Paimon originated within the Flink community, initially
> > known
> > > as
> > > > > Flink
> > > > > > > > >> Table Store, and all our incubating mentors are members of
> > the
> > > > > Flink
> > > > > > > > >> Project Management Committee. I am confident that the
> bonds
> > of
> > > > > > > > >> enduring friendship and close collaboration will continue
> to
> > > > > unite the
> > > > > > > > >> two communities.
> > > > > > > > >>
> > > > > > > > >> And congratulations all!
> > > > > > > > >>
> > > > > > > > >> Best Regards,
> > > > > > > > >> Yu
> > > > > > > > >>
> > > > > > > > >> On Wed, 27 Mar 2024 at 20:35, Guojun Li <
> > > > gjli.schna...@gmail.com>
> > > > > > > > wrote:
> > > > > > > > >> >
> > > > > > > > >> > Congratulations!
> > > > > > > > >> >
> > > > > > > > >> > Best,
> > > > > > > > >> > Guojun
> > > > > > > > >> >
> > > > > > > > >> > On Wed, Mar 27, 2024 at 5:24 PM wulin <
> > ouyangwu...@163.com>
> > > > > wrote:
> > > > > > > > >> >
> > > > > > > > >> > > Congratulations~
> > > > > > > > >> > >
> > > > > > > > >> > > > 2024年3月27日 15:54,王刚  .INVALID>
> > > 写道:
> > > > > > > > >> > > >
> > > > > > > > >> > > > Congratulations~
> > > > > > > > >> > > >
> > > > > > > > >> > > >> 2024年3月26日 10:25,Jingsong Li <
> jingsongl...@gmail.com
> > >
> > > > 写道:
> > > > > > > > >> > > >>
> > > > > > > > >> > > >> Hi Paimon community,
> > > > > > > > >> > > >>
> > > > > > > > >> > > >> I’m glad to announce that the ASF board has
> approved
> > a
> > > > > > > > resolution to
> > > > > > > > >> > > >> graduate Paimon into a full Top Level Project.
> Thanks
> > > to
> > > > > > > > everyone for
> > > > > > > > >> > > >> your help to get to this point.
> > > > > > > > >> > > >>
> > > > > > > > >> > > >> I just created an issue to track the things we need
> > to
> > > > > modify
> > > > > > > > [2],
> > > > > > > > >> > > >> please comment on it if you feel that something is
> > > > > missing. You
> > > > > > > > can
> > > > > > > > >> > > >> refer to apache documentation [1] too.
> > > > > > > > >> > > >>
> > > > > > > > >> > > >> And, we already completed the GitHub repo migration
> > > [3],
> > > > > please
> > > > > > > > update
> > > > > > > > >> > > >> your local git repo to track the new repo [4].
> > > > > > > > >> > > >>
> > > > > > > > >> > > >> You can run the following command to complete the
> > > remote
> > > > > repo
> > > > > > > > tracking
> > > > > > > > >> > > >> migration.
> > > > > > > > >> > > >>
> > > > > > > > >> > > >> git remote set-url 

Re: [VOTE] FLIP-426: Grouping Remote State Access

2024-03-31 Thread Hangxiang Yu
+1 (binding)

On Sun, Mar 31, 2024 at 9:02 PM Yanfei Lei  wrote:

> +1 (binding)
>
> Best,
> Yanfei
>
> yue ma  于2024年3月29日周五 16:10写道:
> >
> > +1 (non-binding)
> >
> > Jinzhong Li  于2024年3月27日周三 18:57写道:
> >
> > > Hi devs,
> > >
> > > I'd like to start a vote on the FLIP-426: Grouping Remote State Access
> [1].
> > > The discussion thread is here [2].
> > >
> > > The vote will be open for at least 72 hours unless there is an
> objection or
> > > insufficient votes.
> > >
> > >
> > > [1] https://cwiki.apache.org/confluence/x/TYp3EQ
> > >
> > > [2] https://lists.apache.org/thread/bt931focfl9971cwq194trmf3pkdsxrf
> > >
> > >
> > > Best,
> > >
> > > Jinzhong
> > >
> >
> >
> > --
> > Best,
> > Yue
>


-- 
Best,
Hangxiang.


[RESULT][VOTE] FLIP-427: Disaggregated state Store

2024-03-31 Thread Hangxiang Yu
I'm happy to announce that FLIP-427: Disaggregated state Store[1]
has been accepted with 7 approving votes (4 binding) [2]:

- Yuan Mei (binding)
- Feifan Wang (non-binding)
- Piotr Nowojski (binding)
- Rui Fan (binding)
- Yun Tang (binding)
- Yuepeng Pan (non-binding)
- yue ma (non-binding)

There are no disapproving votes. Thanks to everyone who participated
in the discussion and voting.

[1] https://cwiki.apache.org/confluence/x/T4p3EQ
[2] https://lists.apache.org/thread/lqpovwx3kpszgfcgl0olfhxotfo9d6nz


Best,
Hangxiang


Re: [VOTE] FLIP-427: Disaggregated state Store

2024-03-31 Thread Hangxiang Yu
Thanks all for the votes!
I'm closing the vote and the result will be posted in a separate mail.

On Fri, Mar 29, 2024 at 4:29 PM yue ma  wrote:

> +1(non-binding)
>
> Hangxiang Yu  于2024年3月27日周三 18:37写道:
>
> > Hi devs,
> >
> > Thanks all for your valuable feedback about FLIP-427: Disaggregated state
> > Store [1].
> > I'd like to start a vote on it.  The discussion thread is here [2].
> >
> > The vote will be open for at least 72 hours unless there is an objection
> or
> > insufficient votes.
> >
> > [1] https://cwiki.apache.org/confluence/x/T4p3EQ
> > [2] https://lists.apache.org/thread/vktfzqvb7t4rltg7fdlsyd9sfdmrc4ft
> >
> >
> > Best,
> > Hangxiang
> >
>
>
> --
> Best,
> Yue
>


-- 
Best,
Hangxiang.


Re: [VOTE] FLIP-428: Fault Tolerance/Rescale Integration for Disaggregated State

2024-03-31 Thread Hangxiang Yu
+1 (binding)

On Mon, Apr 1, 2024 at 9:24 AM gongzhongqiang 
wrote:

> +1(non-binding)
>
> Best,
>
> Zhongqiang Gong
>
> Jinzhong Li  于2024年3月27日周三 19:31写道:
>
> > Hi devs,
> >
> >
> > I'd like to start a vote on the FLIP-428: Fault Tolerance/Rescale
> > Integration for Disaggregated State [1]. The discussion thread is here
> [2].
> >
> >
> > The vote will be open for at least 72 hours unless there is an objection
> or
> > insufficient votes.
> >
> > [1] https://cwiki.apache.org/confluence/x/UYp3EQ
> >
> > [2] https://lists.apache.org/thread/vr8f91p715ct4lop6b3nr0fh4z5p312b
> >
> > Best,
> >
> > Jinzhong
> >
>


-- 
Best,
Hangxiang.


Re: [DISCUSS] FLIP-427: Disaggregated State Store

2024-03-31 Thread Hangxiang Yu
Hi Yun.
Thanks for the great suggestion.
I just added related information into the FLIP.

On Sat, Mar 30, 2024 at 10:49 AM Yun Tang  wrote:

> Hi Feifan,
>
> I just replied in the discussion of FLIP-428. I agree that we could leave
> the clean-up optimization in the future FLIP, however, I think we should
> mention this topic explicitly in the current FLIP to make the overall
> design complete and more sophisticated.
>
> Best
> Yun Tang
> 
> From: Feifan Wang 
> Sent: Thursday, March 28, 2024 12:35
> To: dev@flink.apache.org 
> Subject: Re: [DISCUSS] FLIP-427: Disaggregated State Store
>
> Thanks for your reply, Hangxiang. I totally agree with you about the jni
> part.
>
> Hi Yun Tang, I just noticed that FLIP-427 mentions “The life cycle of
> working dir is managed as before local strategy.” IIUC, the working dir
> will be deleted after TaskManager exit. And I think that's enough for
> current stage, WDYT ?
>
> ——
>
> Best regards,
>
> Feifan Wang
>
>
>
>
> At 2024-03-28 12:18:56, "Hangxiang Yu"  wrote:
> >Hi, Feifan.
> >
> >Thanks for your reply.
> >
> >What if we only use jni to access DFS that needs to reuse Flink
> FileSystem?
> >> And all local disk access through native api. This idea is based on the
> >> understanding that jni overhead is not worth mentioning compared to DFS
> >> access latency. It might make more sense to consider avoiding jni
> overhead
> >> for faster local disks. Since local disk as secondary is already under
> >> consideration [1], maybe we can discuss in that FLIP whether to use
> native
> >> api to access local disk?
> >>
> >This is a good suggestion. It's reasonable to use native api to access
> >local disk cache since it requires lower latency compared to remote
> access.
> >I also believe that the jni overhead is relatively negligible when weighed
> >against the latency of remote I/O as mentioned in the FLIP.
> >So I think we could just go on proposal 2 and keep proposal 1 as a
> >potential future optimization, which could work better when there is a
> >higher performance requirement or some native libraries of filesystems
> have
> >significantly higher performance and resource usage compared to their java
> >libs.
> >
> >
> >On Thu, Mar 28, 2024 at 11:39 AM Feifan Wang  wrote:
> >
> >> Thanks for this valuable proposal Hangxiang !
> >>
> >>
> >> > If we need to introduce a JNI call during each filesystem call, that
> >> would be N times JNI cost compared with the current RocksDB
> state-backend's
> >> JNI cost.
> >> What if we only use jni to access DFS that needs to reuse Flink
> >> FileSystem? And all local disk access through native api. This idea is
> >> based on the understanding that jni overhead is not worth mentioning
> >> compared to DFS access latency. It might make more sense to consider
> >> avoiding jni overhead for faster local disks. Since local disk as
> secondary
> >> is already under consideration [1], maybe we can discuss in that FLIP
> >> whether to use native api to access local disk?
> >>
> >>
> >> >I'd suggest keeping `state.backend.forSt.working-dir` as it is for now.
> >> >Different disaggregated state storages may have their own semantics
> about
> >> >this configuration, e.g. life cycle, supported file systems or
> storages.
> >> I agree with considering moving this configuration up to the engine
> level
> >> until there are other disaggreated backends.
> >>
> >>
> >> [1] https://cwiki.apache.org/confluence/x/U4p3EQ
> >>
> >> ——
> >>
> >> Best regards,
> >>
> >> Feifan Wang
> >>
> >>
> >>
> >>
> >> At 2024-03-28 09:55:48, "Hangxiang Yu"  wrote:
> >> >Hi, Yun.
> >> >Thanks for the reply.
> >> >
> >> >The JNI cost you considered is right. As replied to Yue, I agreed to
> leave
> >> >space and consider proposal 1 as an optimization in the future, which
> is
> >> >also updated in the FLIP.
> >> >
> >> >The other question is that the configuration of
> >> >> `state.backend.forSt.working-dir` looks too coupled with the ForSt
> >> >> state-backend, how would it be if we introduce another disaggregated
> >> state
> >> >> storage? Thus, I think `state.backend.disaggregated.working-dir`
> might
> >> be a
> >> >> better configuration name.
> >> >
> >> >I'd suggest keeping `state.backend.forSt.working-dir` as it is for now.
> >> >Different disaggregated state storages may have their own semantics
> about
> >> >this configuration, e.g. life cycle, supported file systems or
> storages.
> >> >Maybe it's more suitable to consider it together when we introduce
> other
> >> >disaggregated state storages in the future.
> >> >
> >> >On Thu, Mar 28, 2024 at 12:02 AM Yun Tang  wrote:
> >> >
> >> >> Hi Hangxiang,
> >> >>
> >> >> The design looks good, and I also support leaving space for proposal
> 1.
> >> >>
> >> >> As you know, loading index/filter/data blocks for querying across
> levels
> >> >> would introduce high IO access within the LSM tree for old data. If
> we
> >> need
> >> >> to introduce a JNI call 

Re: [VOTE] FLIP-424: Asynchronous State APIs

2024-03-31 Thread Zakelly Lan
Thank you all for the votes! I'm closing this thread and the result will be
posted in a separate mail.


Best,
Zakelly

On Sat, Mar 30, 2024 at 4:43 PM Roman Khachatryan  wrote:

> +1 (binding)
>
> Regards,
> Roman
>
>
> On Fri, Mar 29, 2024 at 7:01 AM Xintong Song 
> wrote:
>
> > +1 (binding)
> >
> > Best,
> >
> > Xintong
> >
> >
> >
> > On Fri, Mar 29, 2024 at 12:51 PM Yuepeng Pan 
> > wrote:
> >
> > > +1(non-binding)
> > >
> > > Best,
> > > Yuepeng Pan
> > >
> > >
> > > On 2024/03/29 03:03:53 Yunfeng Zhou wrote:
> > > > +1 (non-binding)
> > > >
> > > > Best,
> > > > Yunfeng
> > > >
> > > > On Wed, Mar 27, 2024 at 6:23 PM Zakelly Lan 
> > > wrote:
> > > > >
> > > > > Hi devs,
> > > > >
> > > > > I'd like to start a vote on the FLIP-424: Asynchronous State APIs
> > [1].
> > > The
> > > > > discussion thread is here [2].
> > > > >
> > > > > The vote will be open for at least 72 hours unless there is an
> > > objection or
> > > > > insufficient votes.
> > > > >
> > > > > [1] https://cwiki.apache.org/confluence/x/SYp3EQ
> > > > > [2]
> https://lists.apache.org/thread/nmd9qd0k8l94ygcfgllxms49wmtz1864
> > > > >
> > > > >
> > > > > Best,
> > > > > Zakelly
> > > >
> > >
> >
>


Re: Re: [ANNOUNCE] Apache Paimon is graduated to Top Level Project

2024-03-31 Thread Hang Ruan
Congratulations!

Best,
Hang

Lincoln Lee  于2024年3月31日周日 00:10写道:

> Congratulations!
>
> Best,
> Lincoln Lee
>
>
> Jark Wu  于2024年3月30日周六 22:13写道:
>
> > Congratulations!
> >
> > Best,
> > Jark
> >
> > On Fri, 29 Mar 2024 at 12:08, Yun Tang  wrote:
> >
> > > Congratulations to all Paimon guys!
> > >
> > > Glad to see a Flink sub-project has been graduated to an Apache
> top-level
> > > project.
> > >
> > > Best
> > > Yun Tang
> > >
> > > 
> > > From: Hangxiang Yu 
> > > Sent: Friday, March 29, 2024 10:32
> > > To: dev@flink.apache.org 
> > > Subject: Re: Re: [ANNOUNCE] Apache Paimon is graduated to Top Level
> > Project
> > >
> > > Congratulations!
> > >
> > > On Fri, Mar 29, 2024 at 10:27 AM Benchao Li 
> > wrote:
> > >
> > > > Congratulations!
> > > >
> > > > Zakelly Lan  于2024年3月29日周五 10:25写道:
> > > > >
> > > > > Congratulations!
> > > > >
> > > > >
> > > > > Best,
> > > > > Zakelly
> > > > >
> > > > > On Thu, Mar 28, 2024 at 10:13 PM Jing Ge
>  > >
> > > > wrote:
> > > > >
> > > > > > Congrats!
> > > > > >
> > > > > > Best regards,
> > > > > > Jing
> > > > > >
> > > > > > On Thu, Mar 28, 2024 at 1:27 PM Feifan Wang 
> > > > wrote:
> > > > > >
> > > > > > > Congratulations!——
> > > > > > >
> > > > > > > Best regards,
> > > > > > >
> > > > > > > Feifan Wang
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > >
> > > > > > > At 2024-03-28 20:02:43, "Yanfei Lei" 
> > wrote:
> > > > > > > >Congratulations!
> > > > > > > >
> > > > > > > >Best,
> > > > > > > >Yanfei
> > > > > > > >
> > > > > > > >Zhanghao Chen  于2024年3月28日周四
> > 19:59写道:
> > > > > > > >>
> > > > > > > >> Congratulations!
> > > > > > > >>
> > > > > > > >> Best,
> > > > > > > >> Zhanghao Chen
> > > > > > > >> 
> > > > > > > >> From: Yu Li 
> > > > > > > >> Sent: Thursday, March 28, 2024 15:55
> > > > > > > >> To: d...@paimon.apache.org 
> > > > > > > >> Cc: dev ; user  >
> > > > > > > >> Subject: Re: [ANNOUNCE] Apache Paimon is graduated to Top
> > Level
> > > > > > Project
> > > > > > > >>
> > > > > > > >> CC the Flink user and dev mailing list.
> > > > > > > >>
> > > > > > > >> Paimon originated within the Flink community, initially
> known
> > as
> > > > Flink
> > > > > > > >> Table Store, and all our incubating mentors are members of
> the
> > > > Flink
> > > > > > > >> Project Management Committee. I am confident that the bonds
> of
> > > > > > > >> enduring friendship and close collaboration will continue to
> > > > unite the
> > > > > > > >> two communities.
> > > > > > > >>
> > > > > > > >> And congratulations all!
> > > > > > > >>
> > > > > > > >> Best Regards,
> > > > > > > >> Yu
> > > > > > > >>
> > > > > > > >> On Wed, 27 Mar 2024 at 20:35, Guojun Li <
> > > gjli.schna...@gmail.com>
> > > > > > > wrote:
> > > > > > > >> >
> > > > > > > >> > Congratulations!
> > > > > > > >> >
> > > > > > > >> > Best,
> > > > > > > >> > Guojun
> > > > > > > >> >
> > > > > > > >> > On Wed, Mar 27, 2024 at 5:24 PM wulin <
> ouyangwu...@163.com>
> > > > wrote:
> > > > > > > >> >
> > > > > > > >> > > Congratulations~
> > > > > > > >> > >
> > > > > > > >> > > > 2024年3月27日 15:54,王刚 
> > 写道:
> > > > > > > >> > > >
> > > > > > > >> > > > Congratulations~
> > > > > > > >> > > >
> > > > > > > >> > > >> 2024年3月26日 10:25,Jingsong Li  >
> > > 写道:
> > > > > > > >> > > >>
> > > > > > > >> > > >> Hi Paimon community,
> > > > > > > >> > > >>
> > > > > > > >> > > >> I’m glad to announce that the ASF board has approved
> a
> > > > > > > resolution to
> > > > > > > >> > > >> graduate Paimon into a full Top Level Project. Thanks
> > to
> > > > > > > everyone for
> > > > > > > >> > > >> your help to get to this point.
> > > > > > > >> > > >>
> > > > > > > >> > > >> I just created an issue to track the things we need
> to
> > > > modify
> > > > > > > [2],
> > > > > > > >> > > >> please comment on it if you feel that something is
> > > > missing. You
> > > > > > > can
> > > > > > > >> > > >> refer to apache documentation [1] too.
> > > > > > > >> > > >>
> > > > > > > >> > > >> And, we already completed the GitHub repo migration
> > [3],
> > > > please
> > > > > > > update
> > > > > > > >> > > >> your local git repo to track the new repo [4].
> > > > > > > >> > > >>
> > > > > > > >> > > >> You can run the following command to complete the
> > remote
> > > > repo
> > > > > > > tracking
> > > > > > > >> > > >> migration.
> > > > > > > >> > > >>
> > > > > > > >> > > >> git remote set-url origin
> > > > https://github.com/apache/paimon.git
> > > > > > > >> > > >>
> > > > > > > >> > > >> If you have a different name, please change the
> > 'origin'
> > > to
> > > > > > your
> > > > > > > remote
> > > > > > > >> > > name.
> > > > > > > >> > > >>
> > > > > > > >> > > >> Please join me in celebrating!
> > > > > > > >> > > >>
> > > > > > > >> > > >> [1]
> > > > > > > >> > >
> > > > > > >
> > > > > >
> > > >
> > >
> >
> https://incubator.apache.org/guides/transferring.html#life_after_graduation
> > > > > > > 

Re: Bug report for reading Hive table as streaming source.

2024-03-31 Thread yuxia
Thanks for reporting. Could you please help create a jira about it?

Best regards,
Yuxia

- 原始邮件 -
发件人: "Xiaolong Wang" 
收件人: "dev" 
发送时间: 星期四, 2024年 3 月 28日 下午 5:11:20
主题: Re: Bug report for reading Hive table as streaming source.

I think it worth mentioning in the documentation of Hive read that it
cannot read a table that has more than 32,767 partitions.

On Thu, Mar 28, 2024 at 5:10 PM Xiaolong Wang 
wrote:

> Found out the reason:
>
> It turned out that in Flink, it uses hive’s IMetaStoreClient to fetch
> partitions using the following method:
>
>   List listPartitionNames(String db_name, String tbl_name,
> short max_parts) throws MetaException, TException;
>
> where the max_parts represents the max number of partitions it can fetch
> from the Hive metastore.
> So the max number of partitions it can fetch is Short.MAX_VALUE, which is
> 32767 .
>
> But the table has a way more partition number than the max value, thus the
> list partition operations cannot fetch all partitions, hence it cannot
> consume the recent partition.
>
> On Tue, Mar 26, 2024 at 5:00 PM Xiaolong Wang 
> wrote:
>
>> Hi,
>>
>> I found a weird bug when reading a Hive table as a streaming source.
>>
>> In summary, if the first partition is not time related, then the Hive
>> table cannot be read as a streaming source.
>>
>> e.g.
>>
>> I've a Hive table in the definition of
>>
>> ```
>> CREATE TABLE article (
>> id BIGINT,
>> edition STRING,
>> dt STRING,
>> hh STRING
>> )
>> PARTITIONED BY (edition, dt, hh)
>> USING orc;
>> ```
>> Then I try to query it as a streaming source:
>>
>> ```
>> INSERT INTO kafka_sink
>> SELECT id
>> FROM article /*+ OPTIONS('streaming-source.enable' = 'true',
>> 'streaming-source.partition-order' = 'partition-name',
>> 'streaming-source.consume-start-offset' =
>> 'edition=en_US/dt=2024-03-26/hh=00') */
>> ```
>>
>> And I see no output in the `kafka_sink`.
>>
>> Then I defined an external table pointing to the same path but has no
>> `edition` partition,
>>
>> ```
>> CREATE TABLE en_article (
>> id BIGINT,
>> edition STRING,
>> dt STRING,
>> hh STRING
>> )
>> PARTITIONED BY (edition, dt, hh)
>> LOCATION 's3://xxx/article/edition=en_US'
>> USING orc;
>> ```
>>
>> And insert with the following statement:
>>
>> ```
>> INSERT INTO kafka_sink
>> SELECT id
>> FROM en_article /*+ OPTIONS('streaming-source.enable' = 'true',
>> 'streaming-source.partition-order' = 'partition-name',
>> 'streaming-source.consume-start-offset' = 'dt=2024-03-26/hh=00') */
>> ```
>>
>> The data is sinked.
>>
>>
>


Re: [VOTE] FLIP-428: Fault Tolerance/Rescale Integration for Disaggregated State

2024-03-31 Thread gongzhongqiang
+1(non-binding)

Best,

Zhongqiang Gong

Jinzhong Li  于2024年3月27日周三 19:31写道:

> Hi devs,
>
>
> I'd like to start a vote on the FLIP-428: Fault Tolerance/Rescale
> Integration for Disaggregated State [1]. The discussion thread is here [2].
>
>
> The vote will be open for at least 72 hours unless there is an objection or
> insufficient votes.
>
> [1] https://cwiki.apache.org/confluence/x/UYp3EQ
>
> [2] https://lists.apache.org/thread/vr8f91p715ct4lop6b3nr0fh4z5p312b
>
> Best,
>
> Jinzhong
>


Re: [VOTE] FLIP-428: Fault Tolerance/Rescale Integration for Disaggregated State

2024-03-31 Thread Jeyhun Karimov
+1 (non-binding)


Regards,
Jeyhun

On Wed, Mar 27, 2024 at 12:31 PM Jinzhong Li 
wrote:

> Hi devs,
>
>
> I'd like to start a vote on the FLIP-428: Fault Tolerance/Rescale
> Integration for Disaggregated State [1]. The discussion thread is here [2].
>
>
> The vote will be open for at least 72 hours unless there is an objection or
> insufficient votes.
>
> [1] https://cwiki.apache.org/confluence/x/UYp3EQ
>
> [2] https://lists.apache.org/thread/vr8f91p715ct4lop6b3nr0fh4z5p312b
>
> Best,
>
> Jinzhong
>


[jira] [Created] (FLINK-34972) MigrationTestsSnapshotGenerator does not work properly for all subclasses of SnapshotMigrationTestBase

2024-03-31 Thread lincoln lee (Jira)
lincoln lee created FLINK-34972:
---

 Summary: MigrationTestsSnapshotGenerator does not work properly 
for all subclasses of SnapshotMigrationTestBase
 Key: FLINK-34972
 URL: https://issues.apache.org/jira/browse/FLINK-34972
 Project: Flink
  Issue Type: Bug
Affects Versions: 1.19.0
Reporter: lincoln lee


This issue was found on the followup work of 1.19.0 releasing 
https://issues.apache.org/jira/browse/FLINK-34712.

{code}MigrationTestsSnapshotGenerator{code} actually didn't generate the 
corresponding testing files for the new version(1.19) without any abnormal 
logs. And only appears in all subclasses of 
{code}SnapshotMigrationTestBase{code}.

more context: https://github.com/apache/flink/pull/24517



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[RESULT][VOTE] FLIP-425: Asynchronous Execution Model

2024-03-31 Thread Yanfei Lei
I'm happy to announce that FLIP-425: Asynchronous Execution Model[1]
has been accepted with 10 approving votes (6 binding) [2]:

- Xuannan Su (non-binding)
- Yuan Mei (binding)
- Piotr Nowojski (binding)
- Feifan Wang (non-binding)
- Jing Ge (binding)
- Rui Fan (binding)
- Yunfeng Zhou (non-binding)
- Xintong Song (binding)
- Yue Ma (non-binding)
- Roman Khachatryan (binding)

There are no disapproving votes. Thanks to everyone who participated
in the discussion and voting.

[1] https://cwiki.apache.org/confluence/x/S4p3EQ
[2] https://lists.apache.org/thread/drglfqv83z5n4hybdjn0lvcc3xjkrtk0


Best regards,
Yanfei


Re: [VOTE] FLIP-428: Fault Tolerance/Rescale Integration for Disaggregated State

2024-03-31 Thread Yanfei Lei
+1 (binding)

Best,
Yanfei

yue ma  于2024年3月29日周五 17:29写道:
>
> +1(non-binding)
>
> Jinzhong Li  于2024年3月27日周三 19:31写道:
>
> > Hi devs,
> >
> >
> > I'd like to start a vote on the FLIP-428: Fault Tolerance/Rescale
> > Integration for Disaggregated State [1]. The discussion thread is here [2].
> >
> >
> > The vote will be open for at least 72 hours unless there is an objection or
> > insufficient votes.
> >
> > [1] https://cwiki.apache.org/confluence/x/UYp3EQ
> >
> > [2] https://lists.apache.org/thread/vr8f91p715ct4lop6b3nr0fh4z5p312b
> >
> > Best,
> >
> > Jinzhong
> >
>
>
> --
> Best,
> Yue


Re: [VOTE] FLIP-426: Grouping Remote State Access

2024-03-31 Thread Yanfei Lei
+1 (binding)

Best,
Yanfei

yue ma  于2024年3月29日周五 16:10写道:
>
> +1 (non-binding)
>
> Jinzhong Li  于2024年3月27日周三 18:57写道:
>
> > Hi devs,
> >
> > I'd like to start a vote on the FLIP-426: Grouping Remote State Access [1].
> > The discussion thread is here [2].
> >
> > The vote will be open for at least 72 hours unless there is an objection or
> > insufficient votes.
> >
> >
> > [1] https://cwiki.apache.org/confluence/x/TYp3EQ
> >
> > [2] https://lists.apache.org/thread/bt931focfl9971cwq194trmf3pkdsxrf
> >
> >
> > Best,
> >
> > Jinzhong
> >
>
>
> --
> Best,
> Yue


[jira] [Created] (FLINK-34971) Add the function of History Server cleaning job archives regularly

2024-03-31 Thread dengxaing (Jira)
dengxaing created FLINK-34971:
-

 Summary: Add the function of History Server cleaning job archives 
regularly
 Key: FLINK-34971
 URL: https://issues.apache.org/jira/browse/FLINK-34971
 Project: Flink
  Issue Type: New Feature
Reporter: dengxaing


Currently, flink does not provide the ability to regularly clean the archives 
of completed jobs.  I think this ability is necessary.
When flink provides this ability, users can decide whether to use this ability. 
They also can define the time for regular cleaning.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)