(datafusion) branch asf-site updated: Publish built docs triggered by 087174102593d20f8eeac425704529f5ba4e0b06

github-bot Tue, 25 Nov 2025 13:33:40 -0800

This is an automated email from the ASF dual-hosted git repository.

github-bot pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/datafusion.git



The following commit(s) were added to refs/heads/asf-site by this push:
     new 02974b65b3 Publish built docs triggered by 
087174102593d20f8eeac425704529f5ba4e0b06
02974b65b3 is described below

commit 02974b65b39cc022dff60e772448fdfef9964487
Author: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
AuthorDate: Tue Nov 25 21:33:28 2025 +0000

    Publish built docs triggered by 087174102593d20f8eeac425704529f5ba4e0b06
---
 _sources/user-guide/introduction.md.txt | 3 +++
 searchindex.js                          | 2 +-
 user-guide/introduction.html            | 3 +++
 3 files changed, 7 insertions(+), 1 deletion(-)

diff --git a/_sources/user-guide/introduction.md.txt 
b/_sources/user-guide/introduction.md.txt
index 778562d55f..66076e6b73 100644
--- a/_sources/user-guide/introduction.md.txt
+++ b/_sources/user-guide/introduction.md.txt
@@ -82,6 +82,7 @@ Here are some example systems built using DataFusion:
 - Streaming data platforms such as [Synnada]
 - Tools for reading / sorting / transcoding Parquet, CSV, AVRO, and JSON files 
such as [qv]
 - Native Spark runtime replacement such as [Auron]
+- Distributed data cache to boost GPU utilization of AI workloads with 
[Kubeflow 
Trainer](https://www.kubeflow.org/docs/components/trainer/user-guides/data-cache/)
 
 By using DataFusion, projects are freed to focus on their specific
 features, and avoid reimplementing general (but still necessary)
@@ -114,6 +115,8 @@ Here are some active projects using DataFusion:
 - [Iceberg-rust](https://github.com/apache/iceberg-rust) Rust implementation 
of Apache Iceberg
 - [InfluxDB] Time Series Database
 - [Kamu] Planet-scale streaming data pipeline
+- [Kubeflow Trainer](https://github.com/kubeflow/trainer) Kubernetes-native 
project designed for
+  scalable LLMs fine-tuning and distributed AI model training.
 - [LakeSoul](https://github.com/lakesoul-io/LakeSoul) Open source LakeHouse 
framework with native IO in Rust.
 - [Lance](https://github.com/lancedb/lance) Modern columnar data format for ML
 - [OpenObserve] Distributed cloud native observability platform
diff --git a/searchindex.js b/searchindex.js
index f030421317..39b17f64a6 100644
--- a/searchindex.js
+++ b/searchindex.js
@@ -1 +1 @@
-Search.setIndex({"alltitles":{"!=":[[60,"op-neq"]],"!~":[[60,"op-re-not-match"]],"!~*":[[60,"op-re-not-match-i"]],"!~~":[[60,"id19"]],"!~~*":[[60,"id20"]],"#":[[60,"op-bit-xor"]],"%":[[60,"op-modulo"]],"&":[[60,"op-bit-and"]],"(relation,
 name) tuples in logical fields and logical columns are 
unique":[[13,"relation-name-tuples-in-logical-fields-and-logical-columns-are-unique"]],"*":[[60,"op-multiply"]],"+":[[60,"op-plus"]],"-":[[60,"op-minus"]],"/":[[60,"op-divide"]],"<":[[60,"op-lt"]],"<
 [...]
\ No newline at end of file
+Search.setIndex({"alltitles":{"!=":[[60,"op-neq"]],"!~":[[60,"op-re-not-match"]],"!~*":[[60,"op-re-not-match-i"]],"!~~":[[60,"id19"]],"!~~*":[[60,"id20"]],"#":[[60,"op-bit-xor"]],"%":[[60,"op-modulo"]],"&":[[60,"op-bit-and"]],"(relation,
 name) tuples in logical fields and logical columns are 
unique":[[13,"relation-name-tuples-in-logical-fields-and-logical-columns-are-unique"]],"*":[[60,"op-multiply"]],"+":[[60,"op-plus"]],"-":[[60,"op-minus"]],"/":[[60,"op-divide"]],"<":[[60,"op-lt"]],"<
 [...]
\ No newline at end of file
diff --git a/user-guide/introduction.html b/user-guide/introduction.html
index c3876cc079..5bcf1693c2 100644
--- a/user-guide/introduction.html
+++ b/user-guide/introduction.html
@@ -462,6 +462,7 @@ latency).</p>
 <li><p>Streaming data platforms such as <a class="reference external" 
href="https://synnada.ai/";>Synnada</a></p></li>
 <li><p>Tools for reading / sorting / transcoding Parquet, CSV, AVRO, and JSON 
files such as <a class="reference external" 
href="https://github.com/timvw/qv";>qv</a></p></li>
 <li><p>Native Spark runtime replacement such as <a class="reference external" 
href="https://github.com/apache/auron";>Auron</a></p></li>
+<li><p>Distributed data cache to boost GPU utilization of AI workloads with <a 
class="reference external" 
href="https://www.kubeflow.org/docs/components/trainer/user-guides/data-cache/";>Kubeflow
 Trainer</a></p></li>
 </ul>
 <p>By using DataFusion, projects are freed to focus on their specific
 features, and avoid reimplementing general (but still necessary)
@@ -493,6 +494,8 @@ parallelized streaming execution plans, file format 
support, etc.</p>
 <li><p><a class="reference external" 
href="https://github.com/apache/iceberg-rust";>Iceberg-rust</a> Rust 
implementation of Apache Iceberg</p></li>
 <li><p><a class="reference external" 
href="https://github.com/influxdata/influxdb";>InfluxDB</a> Time Series 
Database</p></li>
 <li><p><a class="reference external" 
href="https://github.com/kamu-data/kamu-cli";>Kamu</a> Planet-scale streaming 
data pipeline</p></li>
+<li><p><a class="reference external" 
href="https://github.com/kubeflow/trainer";>Kubeflow Trainer</a> 
Kubernetes-native project designed for
+scalable LLMs fine-tuning and distributed AI model training.</p></li>
 <li><p><a class="reference external" 
href="https://github.com/lakesoul-io/LakeSoul";>LakeSoul</a> Open source 
LakeHouse framework with native IO in Rust.</p></li>
 <li><p><a class="reference external" 
href="https://github.com/lancedb/lance";>Lance</a> Modern columnar data format 
for ML</p></li>
 <li><p><a class="reference external" 
href="https://github.com/openobserve/openobserve";>OpenObserve</a> Distributed 
cloud native observability platform</p></li>


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

(datafusion) branch asf-site updated: Publish built docs triggered by 087174102593d20f8eeac425704529f5ba4e0b06

Reply via email to