[ https://issues.apache.org/jira/browse/SPARK-18791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15981383#comment-15981383 ]
Saul Shanabrook commented on SPARK-18791: ----------------------------------------- I am using Spark to process the results from genetic programming experiments. One dataframe (from a directory of Parquet files) has a row for each experiment, holding the configuration. Another dataframe has one row for each "generation" of each experiment. I want to join these together and write out a dataframe that has one row per experiment, where one column contains an array of all the generations for each experiment. > Stream-Stream Joins > ------------------- > > Key: SPARK-18791 > URL: https://issues.apache.org/jira/browse/SPARK-18791 > Project: Spark > Issue Type: New Feature > Components: Structured Streaming > Reporter: Michael Armbrust > > Just a placeholder for now. Please comment with your requirements. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org