[jira] [Commented] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-24 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448539#comment-17448539 ] Carlos Gameiro commented on SPARK-37449: Sometimes there is no natural way to group a dataframe

[jira] [Commented] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-24 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448500#comment-17448500 ] Carlos Gameiro commented on SPARK-37449: You are right. I'm selecting the first 4 indexes of

[jira] [Commented] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-23 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448321#comment-17448321 ] Hyukjin Kwon commented on SPARK-37449: -- {{applyInPandas}} does not maintain its index. Each pandas