Hi Dhruv,

One option is the  join order benchmark
<https://github.com/gregrahn/join-order-benchmark>  ; it has become very
popular in DB research over the past couple years and features many-many
joins. Another option is crafting many-many queries from graph datasets like
social media or travel networks.

Walter



--
Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org

Reply via email to