This is an automated email from the ASF dual-hosted git repository.
github-bot pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/datafusion.git
The following commit(s) were added to refs/heads/main by this push:
new 0b160c567f Cleanup user guide known users section (#17834)
0b160c567f is described below
commit 0b160c567ffdc6fca6d99822305fdb7976d0cc00
Author: Dmitrii Blaginin <[email protected]>
AuthorDate: Tue Sep 30 22:31:28 2025 +0100
Cleanup user guide known users section (#17834)
* Cleanup usages + Add Vortex
* Move to `less active projects`
* Sort
* fmt
---
docs/source/user-guide/introduction.md | 14 +++++++++-----
1 file changed, 9 insertions(+), 5 deletions(-)
diff --git a/docs/source/user-guide/introduction.md
b/docs/source/user-guide/introduction.md
index ef82de9a24..51f025d279 100644
--- a/docs/source/user-guide/introduction.md
+++ b/docs/source/user-guide/introduction.md
@@ -78,7 +78,7 @@ Here are some example systems built using DataFusion:
- Specialized Analytical Database systems such as [HoraeDB] and more general
Apache Spark like system such as [Ballista]
- New query language engines such as [prql-query] and accelerators such as
[VegaFusion]
- Research platform for new Database Systems, such as [Flock]
-- SQL support to another library, such as [dask sql]
+- SQL support to another library, such as [Vortex]
- Streaming data platforms such as [Synnada]
- Tools for reading / sorting / transcoding Parquet, CSV, AVRO, and JSON files
such as [qv]
- Native Spark runtime replacement such as [Auron]
@@ -101,11 +101,10 @@ Here are some active projects using DataFusion:
- [CnosDB] Open Source Distributed Time Series Database
- [Comet](https://github.com/apache/datafusion-comet) Apache Spark native
query execution plugin
- [Cube Store] Cube’s universal semantic layer platform is the next evolution
of OLAP technology for AI, BI, spreadsheets, and embedded analytics
-- [Dask SQL] Distributed SQL query engine in Python
- [datafusion-dft](https://github.com/datafusion-contrib/datafusion-dft)
Batteries included CLI, TUI, and server implementations for DataFusion.
- [dbt Fusion engine](https://github.com/dbt-labs/dbt-fusion) The dbt Fusion
engine, written in Rust, designed for speed and correctness with a native SQL
understanding across DWH SQL dialects.
- [delta-rs] Native Rust implementation of Delta Lake
-- [Exon](https://github.com/wheretrue/exon) Analysis toolkit for life-science
applications
+- [EDB Postgres Lakehouse] built with [Seafowl]
- [Feldera](https://github.com/feldera/feldera) Fast query engine for
incremental computation
- [Funnel](https://funnel.io/) Data Platform powering Marketing Intelligence
applications.
- [GlareDB](https://github.com/GlareDB/glaredb) Fast SQL database for querying
and analyzing distributed data.
@@ -125,12 +124,12 @@ Here are some active projects using DataFusion:
- [Restate](https://github.com/restatedev) Easily build resilient applications
using distributed durable async/await
- [ROAPI] Create full-fledged APIs for slowly moving datasets without writing
a single line of code
- [Sail](https://github.com/lakehq/sail) Unifying stream, batch and AI
workloads with Apache Spark compatibility
-- [Seafowl] CDN-friendly analytical database
- [SedonaDB](https://github.com/apache/sedona-db) A single-node analytical
database engine with geospatial as a first-class citizen
- [Sleeper](https://github.com/gchq/sleeper) Serverless, cloud-native,
log-structured merge tree based, scalable key-value store
- [Spice.ai] Building blocks for data-driven AI applications
- [Synnada] Streaming-first framework for data products
- [VegaFusion] Server-side acceleration for the
[Vega](https://vega.github.io/) visualization grammar
+- [Vortex] An extensible, state of the art columnar file format
- [Telemetry](https://telemetry.sh/) Structured logging made easy
- [Xorq](https://github.com/xorq-labs/xorq/) Xorq is a multi-engine batch
transformation framework built on Ibis, DataFusion and Arrow
@@ -138,6 +137,8 @@ Here are some less active projects that used DataFusion:
- [bdt](https://github.com/datafusion-contrib/bdt) Boring Data Tool
- [Cloudfuse Buzz]
+- [Dask SQL] Distributed SQL query engine in Python
+- [Exon] Analysis toolkit for life-science applications
- [Flock]
- [Tensorbase]
@@ -149,6 +150,8 @@ Here are some less active projects that used DataFusion:
[dask sql]: https://github.com/dask-contrib/dask-sql
[datafusion-tui]: https://github.com/datafusion-contrib/datafusion-tui
[delta-rs]: https://github.com/delta-io/delta-rs
+[edb postgres lakehouse]: https://www.enterprisedb.com/products/analytics
+[exon]: https://github.com/wheretrue/exon
[flock]: https://github.com/flock-lab/flock
[kamu]: https://github.com/kamu-data/kamu-cli
[greptimedb]: https://github.com/GreptimeTeam/greptimedb
@@ -163,7 +166,8 @@ Here are some less active projects that used DataFusion:
[spice.ai]: https://github.com/spiceai/spiceai
[synnada]: https://synnada.ai/
[tensorbase]: https://github.com/tensorbase/tensorbase
-[vegafusion]: https://vegafusion.io/ "if you know of another project, please
submit a PR to add a link!"
+[vegafusion]: https://vegafusion.io/
+[vortex]: https://vortex.dev/ "if you know of another project, please submit a
PR to add a link!"
## Integrations and Extensions
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]