alippai opened a new issue #451:
URL: https://github.com/apache/arrow-datafusion/issues/451


   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   Recently I came across LDBC benchmarks which is focused on graph-like 
workloads. I'm wondering whether Datafusion already covers the features the 
queries need. While I don't think it's as important as TPC-H it'd increase the 
coverage helping to identify performance regressions during the Datafusion 
development. This would be an extra tool to get a broader picture in a 
structured way (at least more structured than ad-hoc queries)
   
   **Describe the solution you'd like**
   Supporting the queries written for PostgreSQL: 
https://github.com/ldbc/ldbc_snb_bi/tree/main/postgres/queries .
   
   **Describe alternatives you've considered**
   Not implementing it. Optimizing Datafusion to perform well on this 
particular benchmark is out of the scope as well. My assumption is that OLAP 
should be first-class and this should be a second class target.
   
   **Additional context**
   While it's not an OLAP workload, I believe Datafusion would perform 
relatively or extremely well.
   
   Cc @Dandandan IIRC you contributed the most (CTE+UNION ALL) in this field


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to