Re: Benchmarks for Many-to-Many Joins

2021-04-21 Thread waltercai
Hi Dhruv,

One option is the  join order benchmark
  ; it has become very
popular in DB research over the past couple years and features many-many
joins. Another option is crafting many-many queries from graph datasets like
social media or travel networks.

Walter



--
Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/

-
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org



Benchmarks for Many-to-Many Joins

2021-04-21 Thread Dhruv Kumar
Hi

I wanted to ask if anyone knows any datasets or benchmarks which I can use for 
evaluating many-to-many joins (as depicted in the attached snapshot). I looked 
at TPC-H  and TPC-DS  
benchmarks but surprisingly, they mostly have one-to-many joins and I could not 
get much help there.





Thanks
Dhruv

--
Dhruv Kumar
PhD Candidate
Computer Science and Engineering
University of Minnesota
www.dhruvkumar.me