[jira] [Commented] (SPARK-34078) Provide async variants for Dataset APIs

2021-01-19 Thread Yesheng Ma (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17268201#comment-17268201
 ] 

Yesheng Ma commented on SPARK-34078:


Thanks! I'm looking into this and will prepare a diff shortly.

> Provide async variants for Dataset APIs
> ---
>
> Key: SPARK-34078
> URL: https://issues.apache.org/jira/browse/SPARK-34078
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.0.1
>Reporter: Yesheng Ma
>Priority: Major
>
> Spark RDDs have async variants such as `collectAsync`, which comes handy when 
> we want to cancel a job. However, Dataset APIs are lacking such APIs, which 
> makes it very painful to cancel a Dataset/SQL job.
>  
> The proposed change was to add async variants so that we can directly cancel 
> a Dataset/SQL query via a future programmatically.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34078) Provide async variants for Dataset APIs

2021-01-18 Thread Wenchen Fan (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17267260#comment-17267260
 ] 

Wenchen Fan commented on SPARK-34078:
-

I think it's a good idea and we can probably follow some streaming APIs. It 
will also be interesting to think about how this can be done in SQL.

> Provide async variants for Dataset APIs
> ---
>
> Key: SPARK-34078
> URL: https://issues.apache.org/jira/browse/SPARK-34078
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.0.1
>Reporter: Yesheng Ma
>Priority: Major
>
> Spark RDDs have async variants such as `collectAsync`, which comes handy when 
> we want to cancel a job. However, Dataset APIs are lacking such APIs, which 
> makes it very painful to cancel a Dataset/SQL job.
>  
> The proposed change was to add async variants so that we can directly cancel 
> a Dataset/SQL query via a future programmatically.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34078) Provide async variants for Dataset APIs

2021-01-11 Thread Xiao Li (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17263044#comment-17263044
 ] 

Xiao Li commented on SPARK-34078:
-

A sample PR? or a lightweight design doc?

> Provide async variants for Dataset APIs
> ---
>
> Key: SPARK-34078
> URL: https://issues.apache.org/jira/browse/SPARK-34078
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.0.1
>Reporter: Yesheng Ma
>Priority: Major
>
> Spark RDDs have async variants such as `collectAsync`, which comes handy when 
> we want to cancel a job. However, Dataset APIs are lacking such APIs, which 
> makes it very painful to cancel a Dataset/SQL job.
>  
> The proposed change was to add async variants so that we can directly cancel 
> a Dataset/SQL query via a future programmatically.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34078) Provide async variants for Dataset APIs

2021-01-11 Thread Yesheng Ma (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17262984#comment-17262984
 ] 

Yesheng Ma commented on SPARK-34078:


[~cloud_fan] [~smilegator] Could you shed some light on this as I'm preparing a 
draft diff?

> Provide async variants for Dataset APIs
> ---
>
> Key: SPARK-34078
> URL: https://issues.apache.org/jira/browse/SPARK-34078
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.0.1
>Reporter: Yesheng Ma
>Priority: Major
>
> Spark RDDs have async variants such as `collectAsync`, which comes handy when 
> we want to cancel a job. However, Dataset APIs are lacking such APIs, which 
> makes it very painful to cancel a Dataset/SQL job.
>  
> The proposed change was to add async variants so that we can directly cancel 
> a Dataset/SQL query via a future programmatically.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org