Hi Dhruv, One option is the join order benchmark <https://github.com/gregrahn/join-order-benchmark> ; it has become very popular in DB research over the past couple years and features many-many joins. Another option is crafting many-many queries from graph datasets like social media or travel networks.
Walter -- Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ --------------------------------------------------------------------- To unsubscribe e-mail: dev-unsubscr...@spark.apache.org