Re: How to figure out what's the size of ListState?

2023-05-27 Thread Amir Hossein Sharifzadeh
Thank you Yun for your reply. Yes, I did the same thing and posted my
solution here:

https://stackoverflow.com/questions/76308686/how-to-figure-out-whats-the-size-of-liststate?noredirect=1#comment134577594_76308686

Best,
Anir

On Wed, May 24, 2023 at 5:13 AM Yun Tang  wrote:

> Hi Amir,
>
> For the current Flink, you have to iterator the returned Iterable of
> ListState#get().
>
> Why Flink lacks an API to get the size of listState directly? This is
> because Flink leverages RocksDB's merge operator [1] for high-performance
> listState#add [2], that is to say, we would not even read the original data
> when appending the listState. I think you can record the size in another
> valueState when implementing your logic.
>
>
> [1] https://github.com/facebook/rocksdb/wiki/Merge-Operator
> [2]
> https://github.com/apache/flink/blob/d8c64a808484cab78c8bd7b74a287edf7d1f3b01/flink-state-backends/flink-statebackend-rocksdb/src/main/java/org/apache/flink/contrib/streaming/state/RocksDBListState.java#L131
>
> Best
> Yun Tang
> 
> From: Amir Hossein Sharifzadeh 
> Sent: Tuesday, May 23, 2023 23:51
> To: dev 
> Subject: How to figure out what's the size of ListState?
>
> Dear Flink Dev team,
>
> It's about a while since I am dealing with an issue that can't figure out
> myself. I spent quite a lot of time trying to solve the problem myself, but
> I feel stuck.
>
> I explained the problem statement and the issue here:
>
> https://stackoverflow.com/questions/76308686/how-to-figure-out-whats-the-size-of-liststate
>
> I really appreciate any suggestion.
>
> Best,
> Amir
>


[ANNOUNCE] Apache flink-connector-cassandra 3.1.0 released

2023-05-27 Thread Danny Cranmer
The Apache Flink community is very happy to announce the release of Apache
flink-connector-cassandra 3.1.0.
This connector supports Flink 1.16 and 1.17.

Apache Flink® is an open-source stream processing framework for
distributed, high-performing, always-available, and accurate data streaming
applications.

The release is available for download at:
https://flink.apache.org/downloads.html

The full release notes are available in Jira:
https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12315522=12353030

We would like to thank all contributors of the Apache Flink community who
made this release possible!

Regards,
Danny


[jira] [Created] (FLINK-32210) Error import Pyflink.table.descriptors due to python3.10 version mismatch

2023-05-27 Thread Alireza Omidvar (Jira)
Alireza Omidvar created FLINK-32210:
---

 Summary: Error import Pyflink.table.descriptors due to python3.10 
version mismatch
 Key: FLINK-32210
 URL: https://issues.apache.org/jira/browse/FLINK-32210
 Project: Flink
  Issue Type: Bug
  Components: API / Python, Connectors / Kafka, Connectors / MongoDB
Affects Versions: 1.13.0
Reporter: Alireza Omidvar


Following to the issue[jira] [Created] 
https://issues.apache.org/jira/browse/FLINK-32207(FLINK-32206) I decided to 
install latest 1.13 version where Kafka and Json imports are working which 
needed to create env python 3.8. I faced a few issues

 

1: CommandNotFoundError: Your shell has not been properly configured to

 

use 'conda activate'. To initialize your shell, run $ conda init

 Currently supported shells are: - bash - fish - tcsh -

xonsh - zsh - powershell See 'conda init --help' for more information

and options. IMPORTANT: You may need to close and restart your shell

after running 'conda init'.

 

2. I tried to initiate but the new error faced 

 

No Pyflink module found



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: [DISCUSS] FLIP-313 Add support of User Defined AsyncTableFunction

2023-05-27 Thread Aitozi
Hi Jing,
Thanks for your response. As stated in the FLIP, the purpose of this
FLIP is meant to support
user-defined async table function. As described in flink document [1]

Async table functions are special functions for table sources that perform
> a lookup.
>

So end user can not directly define and use async table function now. An
user case is reported in [2]

So, in conclusion, no new interface is introduced, but we extend the
ability to support user-defined async table function.

[1]:
https://nightlies.apache.org/flink/flink-docs-release-1.17/docs/dev/table/functions/udfs/
[2]: https://lists.apache.org/thread/qljwd40v5ntz6733cwcdr8s4z97b343b

Thanks.
Aitozi.


Jing Ge  于2023年5月27日周六 06:40写道:

> Hi Aitozi,
>
> Thanks for your proposal. I am not quite sure if I understood your thoughts
> correctly. You described a special case implementation of the
> AsyncTableFunction with on public API changes. Would you please elaborate
> your purpose of writing a FLIP according to the FLIP documentation[1]?
> Thanks!
>
> [1]
>
> https://cwiki.apache.org/confluence/display/FLINK/Flink+Improvement+Proposals
>
> Best regards,
> Jing
>
> On Wed, May 24, 2023 at 1:07 PM Aitozi  wrote:
>
> > May I ask for some feedback  :D
> >
> > Thanks,
> > Aitozi
> >
> > Aitozi  于2023年5月23日周二 19:14写道:
> > >
> > > Just catch an user case report from Giannis Polyzos for this usage:
> > >
> > > https://lists.apache.org/thread/qljwd40v5ntz6733cwcdr8s4z97b343b
> > >
> > > Aitozi  于2023年5月23日周二 17:45写道:
> > > >
> > > > Hi guys,
> > > > I want to bring up a discussion about adding support of User
> > > > Defined AsyncTableFunction in Flink.
> > > > Currently, async table function are special functions for table
> source
> > > > to perform
> > > > async lookup. However, it's worth to support the user defined async
> > > > table function.
> > > > Because, in this way, the end SQL user can leverage it to perform the
> > > > async operation
> > > > which is useful to maximum the system throughput especially for IO
> > > > bottleneck case.
> > > >
> > > > You can find some more detail in [1].
> > > >
> > > > Looking forward to feedback
> > > >
> > > >
> > > > [1]:
> >
> https://cwiki.apache.org/confluence/display/FLINK/%5BFLIP-313%5D+Add+support+of+User+Defined+AsyncTableFunction
> > > >
> > > > Thanks,
> > > > Aitozi.
> >
>