Re: [DISCUSS] Add SQL functions into Scala, Python and R API

2023-05-31 Thread Hyukjin Kwon
Thanks all. I created a JIRA at https://issues.apache.org/jira/browse/SPARK-43907. On Mon, 29 May 2023 at 09:12, Hyukjin Kwon wrote: > Yes, some were cases like you mentioned. > But I found myself explaining that reason to a lot of people, not only > developers but users - I was asked in a

Re: [DISCUSS] Add SQL functions into Scala, Python and R API

2023-05-28 Thread Hyukjin Kwon
Yes, some were cases like you mentioned. But I found myself explaining that reason to a lot of people, not only developers but users - I was asked in a conference, email, slack, internally and externally. Then realised that maybe we're doing something wrong. This is based on my experience so I

Re: [DISCUSS] Add SQL functions into Scala, Python and R API

2023-05-26 Thread Maciej
Weren't some of these functions provided only for compatibility  and intentionally left out of the language APIs? -- Best regards, Maciej On 5/25/23 23:21, Hyukjin Kwon wrote: I don't think it'd be a release blocker .. I think we can implement them across multiple releases. On Fri, May 26,

Re: [DISCUSS] Add SQL functions into Scala, Python and R API

2023-05-25 Thread Hyukjin Kwon
I don't think it'd be a release blocker .. I think we can implement them across multiple releases. On Fri, May 26, 2023 at 1:01 AM Dongjoon Hyun wrote: > Thank you for the proposal. > > I'm wondering if we are going to consider them as release blockers or not. > > In general, I don't think

Re: [DISCUSS] Add SQL functions into Scala, Python and R API

2023-05-25 Thread Dongjoon Hyun
Thank you for the proposal. I'm wondering if we are going to consider them as release blockers or not. In general, I don't think those SQL functions should be available in all languages as release blockers. (Especially in R or new Spark Connect languages like Go and Rust). If they are not

Re: [DISCUSS] Add SQL functions into Scala, Python and R API

2023-05-24 Thread Jia Fan
+1 It is important that different APIs can be used to call the same function Ryan Berti 于2023年5月25日周四 01:48写道: > During my recent experience developing functions, I found that identifying > locations (sql + connect functions.scala + functions.py, FunctionRegistry, > + whatever is required for

Re: [DISCUSS] Add SQL functions into Scala, Python and R API

2023-05-24 Thread Ryan Berti
During my recent experience developing functions, I found that identifying locations (sql + connect functions.scala + functions.py, FunctionRegistry, + whatever is required for R) and standards for adding function signatures was not straight forward (should you use optional args or overload

Re: [DISCUSS] Add SQL functions into Scala, Python and R API

2023-05-24 Thread Enrico Minack
+1 Functions available in SQL (more general in one API) should be available in all APIs. I am very much in favor of this. Enrico Am 24.05.23 um 09:41 schrieb Hyukjin Kwon: Hi all, I would like to discuss adding all SQL functions into Scala, Python and R API. We have SQL functions that

[DISCUSS] Add SQL functions into Scala, Python and R API

2023-05-24 Thread Hyukjin Kwon
Hi all, I would like to discuss adding all SQL functions into Scala, Python and R API. We have SQL functions that do not exist in Scala, Python and R around 175. For example, we don’t have pyspark.sql.functions.percentile but you can invoke it as a SQL function, e.g., SELECT percentile(...). The