This is an automated email from the ASF dual-hosted git repository.
github-bot pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/datafusion.git
The following commit(s) were added to refs/heads/asf-site by this push:
new 02974b65b3 Publish built docs triggered by
087174102593d20f8eeac425704529f5ba4e0b06
02974b65b3 is described below
commit 02974b65b39cc022dff60e772448fdfef9964487
Author: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
AuthorDate: Tue Nov 25 21:33:28 2025 +0000
Publish built docs triggered by 087174102593d20f8eeac425704529f5ba4e0b06
---
_sources/user-guide/introduction.md.txt | 3 +++
searchindex.js | 2 +-
user-guide/introduction.html | 3 +++
3 files changed, 7 insertions(+), 1 deletion(-)
diff --git a/_sources/user-guide/introduction.md.txt
b/_sources/user-guide/introduction.md.txt
index 778562d55f..66076e6b73 100644
--- a/_sources/user-guide/introduction.md.txt
+++ b/_sources/user-guide/introduction.md.txt
@@ -82,6 +82,7 @@ Here are some example systems built using DataFusion:
- Streaming data platforms such as [Synnada]
- Tools for reading / sorting / transcoding Parquet, CSV, AVRO, and JSON files
such as [qv]
- Native Spark runtime replacement such as [Auron]
+- Distributed data cache to boost GPU utilization of AI workloads with
[Kubeflow
Trainer](https://www.kubeflow.org/docs/components/trainer/user-guides/data-cache/)
By using DataFusion, projects are freed to focus on their specific
features, and avoid reimplementing general (but still necessary)
@@ -114,6 +115,8 @@ Here are some active projects using DataFusion:
- [Iceberg-rust](https://github.com/apache/iceberg-rust) Rust implementation
of Apache Iceberg
- [InfluxDB] Time Series Database
- [Kamu] Planet-scale streaming data pipeline
+- [Kubeflow Trainer](https://github.com/kubeflow/trainer) Kubernetes-native
project designed for
+ scalable LLMs fine-tuning and distributed AI model training.
- [LakeSoul](https://github.com/lakesoul-io/LakeSoul) Open source LakeHouse
framework with native IO in Rust.
- [Lance](https://github.com/lancedb/lance) Modern columnar data format for ML
- [OpenObserve] Distributed cloud native observability platform
diff --git a/searchindex.js b/searchindex.js
index f030421317..39b17f64a6 100644
--- a/searchindex.js
+++ b/searchindex.js
@@ -1 +1 @@
-Search.setIndex({"alltitles":{"!=":[[60,"op-neq"]],"!~":[[60,"op-re-not-match"]],"!~*":[[60,"op-re-not-match-i"]],"!~~":[[60,"id19"]],"!~~*":[[60,"id20"]],"#":[[60,"op-bit-xor"]],"%":[[60,"op-modulo"]],"&":[[60,"op-bit-and"]],"(relation,
name) tuples in logical fields and logical columns are
unique":[[13,"relation-name-tuples-in-logical-fields-and-logical-columns-are-unique"]],"*":[[60,"op-multiply"]],"+":[[60,"op-plus"]],"-":[[60,"op-minus"]],"/":[[60,"op-divide"]],"<":[[60,"op-lt"]],"<
[...]
\ No newline at end of file
+Search.setIndex({"alltitles":{"!=":[[60,"op-neq"]],"!~":[[60,"op-re-not-match"]],"!~*":[[60,"op-re-not-match-i"]],"!~~":[[60,"id19"]],"!~~*":[[60,"id20"]],"#":[[60,"op-bit-xor"]],"%":[[60,"op-modulo"]],"&":[[60,"op-bit-and"]],"(relation,
name) tuples in logical fields and logical columns are
unique":[[13,"relation-name-tuples-in-logical-fields-and-logical-columns-are-unique"]],"*":[[60,"op-multiply"]],"+":[[60,"op-plus"]],"-":[[60,"op-minus"]],"/":[[60,"op-divide"]],"<":[[60,"op-lt"]],"<
[...]
\ No newline at end of file
diff --git a/user-guide/introduction.html b/user-guide/introduction.html
index c3876cc079..5bcf1693c2 100644
--- a/user-guide/introduction.html
+++ b/user-guide/introduction.html
@@ -462,6 +462,7 @@ latency).</p>
<li><p>Streaming data platforms such as <a class="reference external"
href="https://synnada.ai/">Synnada</a></p></li>
<li><p>Tools for reading / sorting / transcoding Parquet, CSV, AVRO, and JSON
files such as <a class="reference external"
href="https://github.com/timvw/qv">qv</a></p></li>
<li><p>Native Spark runtime replacement such as <a class="reference external"
href="https://github.com/apache/auron">Auron</a></p></li>
+<li><p>Distributed data cache to boost GPU utilization of AI workloads with <a
class="reference external"
href="https://www.kubeflow.org/docs/components/trainer/user-guides/data-cache/">Kubeflow
Trainer</a></p></li>
</ul>
<p>By using DataFusion, projects are freed to focus on their specific
features, and avoid reimplementing general (but still necessary)
@@ -493,6 +494,8 @@ parallelized streaming execution plans, file format
support, etc.</p>
<li><p><a class="reference external"
href="https://github.com/apache/iceberg-rust">Iceberg-rust</a> Rust
implementation of Apache Iceberg</p></li>
<li><p><a class="reference external"
href="https://github.com/influxdata/influxdb">InfluxDB</a> Time Series
Database</p></li>
<li><p><a class="reference external"
href="https://github.com/kamu-data/kamu-cli">Kamu</a> Planet-scale streaming
data pipeline</p></li>
+<li><p><a class="reference external"
href="https://github.com/kubeflow/trainer">Kubeflow Trainer</a>
Kubernetes-native project designed for
+scalable LLMs fine-tuning and distributed AI model training.</p></li>
<li><p><a class="reference external"
href="https://github.com/lakesoul-io/LakeSoul">LakeSoul</a> Open source
LakeHouse framework with native IO in Rust.</p></li>
<li><p><a class="reference external"
href="https://github.com/lancedb/lance">Lance</a> Modern columnar data format
for ML</p></li>
<li><p><a class="reference external"
href="https://github.com/openobserve/openobserve">OpenObserve</a> Distributed
cloud native observability platform</p></li>
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]