[ https://issues.apache.org/jira/browse/SPARK-3292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14114829#comment-14114829 ]
Sean Owen commented on SPARK-3292: ---------------------------------- Can you elaborate this? it's not clear whether you're reporting that the process hangs, runs slowly, or creates too many files. > Shuffle Tasks run indefinitely even though there's no inputs > ------------------------------------------------------------ > > Key: SPARK-3292 > URL: https://issues.apache.org/jira/browse/SPARK-3292 > Project: Spark > Issue Type: Improvement > Components: Streaming > Affects Versions: 1.0.2 > Reporter: guowei > > such as repartition groupby join and cogroup > it's too expensive , for example if i want outputs save as hadoop file ,then > many emtpy file generate. -- This message was sent by Atlassian JIRA (v6.2#6252) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org