Re: [VOTE][SPIP] PySpark Test Framework

2023-06-24 Thread Yikun Jiang
+1 Regards, Yikun On Fri, Jun 23, 2023 at 6:17 AM L. C. Hsieh wrote: > +1 > > On Thu, Jun 22, 2023 at 3:10 PM Xinrong Meng wrote: > > > > +1 > > > > Thanks for driving that! > > > > On Wed, Jun 21, 2023 at 10:25 PM Ruifeng Zheng > wrote: > >> > >> +1 > >> > >> On Thu, Jun 22, 2023 at 1:11 

Re: [ANNOUNCE] Apache Spark 3.4.1 released

2023-06-24 Thread yangjie01
Thanks Dongjoon ~ 在 2023/6/24 10:29,“L. C. Hsieh”mailto:vii...@gmail.com>> 写入: Thanks Dongjoon! On Fri, Jun 23, 2023 at 7:10 PM Hyukjin Kwon mailto:gurwls...@apache.org>> wrote: > > Thanks! > > On Sat, Jun 24, 2023 at 11:01 AM Mridul Muralidharan > wrote: >> >> >>

Re:[ANNOUNCE] Apache Spark 3.4.1 released

2023-06-24 Thread beliefer
Thanks! Dongjoon Hyun. Congratulation too! At 2023-06-24 07:57:05, "Dongjoon Hyun" wrote: We are happy to announce the availability of Apache Spark 3.4.1! Spark 3.4.1 is a maintenance release containing stability fixes. This release is based on the branch-3.4 maintenance branch of Spark.

Re: [DISCUSS] SPIP: Python Data Source API

2023-06-24 Thread Martin Grund
Hey, I would like to express my strong support for Python Data Sources even though they might not be immediately as powerful as Scala-based data sources. One element that is easily lost in this discussion is how much faster the iteration speed is with Python compared to Scala. Due to the dynamic

Re: [DISCUSS] SPIP: Python Data Source API

2023-06-24 Thread Maciej
With such limited scope (both language availability and features) do we have any representative examples of sources that could significantly benefit from providing this API,  compared other available options, such as batch imports, direct queries from vectorized  UDFs or even interfacing