This is an automated email from the ASF dual-hosted git repository.

github-bot pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/datafusion-site.git


The following commit(s) were added to refs/heads/asf-site by this push:
     new 57cde16  Commit build products
57cde16 is described below

commit 57cde16983d8a7c31b46375ff5163f2e6a232060
Author: Build Pelican (action) <[email protected]>
AuthorDate: Thu Oct 16 20:42:54 2025 +0000

    Commit build products
---
 .../2024/09/13/string-view-german-style-strings-part-1/index.html   | 4 ++--
 output/author/xiangpeng-hao-andrew-lamb.html                        | 2 +-
 output/category/blog.html                                           | 2 +-
 output/feed.xml                                                     | 2 +-
 output/feeds/all-en.atom.xml                                        | 6 +++---
 output/feeds/blog.atom.xml                                          | 6 +++---
 output/feeds/xiangpeng-hao-andrew-lamb.atom.xml                     | 6 +++---
 output/feeds/xiangpeng-hao-andrew-lamb.rss.xml                      | 2 +-
 output/index.html                                                   | 2 +-
 9 files changed, 16 insertions(+), 16 deletions(-)

diff --git 
a/output/2024/09/13/string-view-german-style-strings-part-1/index.html 
b/output/2024/09/13/string-view-german-style-strings-part-1/index.html
index 90389bc..3c1d770 100644
--- a/output/2024/09/13/string-view-german-style-strings-part-1/index.html
+++ b/output/2024/09/13/string-view-german-style-strings-part-1/index.html
@@ -79,7 +79,7 @@ limitations under the License.
 {% endcomment %}
 -->
 
-<p><em>Editor's Note: This is the first of a <a 
href="../2024/09/13/string-view-german-style-strings-part-2/">two part</a> blog 
series that was first published on the <a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/";>InfluxData
 blog</a>. Thanks to InfluxData for sponsoring this work as <a 
href="https://haoxp.xyz/";>Xiangpeng Hao</a>'s summer intern project</em></p>
+<p><em>Editor's Note: This is the first of a <a 
href="../string-view-german-style-strings-part-2/">two part</a> blog series 
that was first published on the <a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/";>InfluxData
 blog</a>. Thanks to InfluxData for sponsoring this work as <a 
href="https://haoxp.xyz/";>Xiangpeng Hao</a>'s summer intern project</em></p>
 <p>This blog describes our experience implementing <a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout";>StringView</a>
 in the <a href="https://github.com/apache/arrow-rs";>Rust implementation</a> of 
<a href="https://arrow.apache.org/";>Apache Arrow</a>, and integrating it into 
<a href="https://datafusion.apache.org/";>Apache DataFusion</a>, significantly 
accelerating string-intensive queries in the <a 
href="https://benchmark.clickhouse.com/";>ClickBen [...]
 <p>Getting significant end-to-end performance improvements was non-trivial. 
Implementing StringView itself was only a fraction of the effort required. 
Among other things, we had to optimize UTF-8 validation, implement unintuitive 
compiler optimizations, tune block sizes, and time GC to realize the <a 
href="https://www.influxdata.com/blog/flight-datafusion-arrow-parquet-fdap-architecture-influxdb/";>FDAP
 ecosystem</a>’s benefit. With other members of the open source community, we 
were able [...]
 <p>StringView is based on a simple idea: avoid some string copies and 
accelerate comparisons with inlined prefixes. Like most great ideas, it is 
“obvious” only after <a 
href="https://db.in.tum.de/~freitag/papers/p29-neumann-cidr20.pdf";>someone 
describes it clearly</a>. Although simple, straightforward implementation 
actually <em>slows down performance for almost every query</em>. We must, 
therefore, apply astute observations and diligent engineering to realize the 
actual benefits from St [...]
@@ -164,7 +164,7 @@ StringViewArray accelerates string-intensive operations via 
prefix inlining and
 <h2 id="conclusion">Conclusion<a class="headerlink" href="#conclusion" 
title="Permanent link">¶</a></h2>
 <p>In this first blog post, we have described what it took to improve the
 performance of simply reading strings from Parquet files using StringView. 
While
-this resulted in real end-to-end query performance improvements, in our <a 
href="https://datafusion.apache.org/blog/2024/09/13/using-stringview-to-make-queries-faster-part-2.html";>next
+this resulted in real end-to-end query performance improvements, in our <a 
href="https://datafusion.apache.org/blog/2024/09/13//string-view-german-style-strings-part-2";>next
 post</a>, we explore additional optimizations enabled by StringView in 
DataFusion,
 along with some of the pitfalls we encountered while implementing them.</p>
 <h1 id="footnotes">Footnotes<a class="headerlink" href="#footnotes" 
title="Permanent link">¶</a></h1>
diff --git a/output/author/xiangpeng-hao-andrew-lamb.html 
b/output/author/xiangpeng-hao-andrew-lamb.html
index 1a7c429..e45bc05 100644
--- a/output/author/xiangpeng-hao-andrew-lamb.html
+++ b/output/author/xiangpeng-hao-andrew-lamb.html
@@ -47,7 +47,7 @@ limitations under the License.
 {% endcomment %}
 -->
 
-<p><em>Editor's Note: This is the first of a <a 
href="../2024/09/13/string-view-german-style-strings-part-2/">two part</a> blog 
series that was first published on the <a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/";>InfluxData
 blog</a>. Thanks to InfluxData for sponsoring this work as <a 
href="https://haoxp.xyz/";>Xiangpeng Hao</a>'s summer intern project</em></p>
+<p><em>Editor's Note: This is the first of a <a 
href="../string-view-german-style-strings-part-2/">two part</a> blog series 
that was first published on the <a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/";>InfluxData
 blog</a>. Thanks to InfluxData for sponsoring this work as <a 
href="https://haoxp.xyz/";>Xiangpeng Hao</a>'s summer intern project</em></p>
 <p>This blog describes our experience implementing <a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout";>StringView</a>
 in the <a href="https://github.com/apache/arrow-rs";>Rust implementation</a> of 
<a href="https://arrow.apache.org/";>Apache Arrow</a>, and integrating …</p> 
</div><!-- /.entry-content -->
         </article></li>
         <li><article class="hentry">
diff --git a/output/category/blog.html b/output/category/blog.html
index ae06c8c..45fc70a 100644
--- a/output/category/blog.html
+++ b/output/category/blog.html
@@ -1098,7 +1098,7 @@ limitations under the License.
 {% endcomment %}
 -->
 
-<p><em>Editor's Note: This is the first of a <a 
href="../2024/09/13/string-view-german-style-strings-part-2/">two part</a> blog 
series that was first published on the <a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/";>InfluxData
 blog</a>. Thanks to InfluxData for sponsoring this work as <a 
href="https://haoxp.xyz/";>Xiangpeng Hao</a>'s summer intern project</em></p>
+<p><em>Editor's Note: This is the first of a <a 
href="../string-view-german-style-strings-part-2/">two part</a> blog series 
that was first published on the <a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/";>InfluxData
 blog</a>. Thanks to InfluxData for sponsoring this work as <a 
href="https://haoxp.xyz/";>Xiangpeng Hao</a>'s summer intern project</em></p>
 <p>This blog describes our experience implementing <a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout";>StringView</a>
 in the <a href="https://github.com/apache/arrow-rs";>Rust implementation</a> of 
<a href="https://arrow.apache.org/";>Apache Arrow</a>, and integrating …</p> 
</div><!-- /.entry-content -->
         </article></li>
         <li><article class="hentry">
diff --git a/output/feed.xml b/output/feed.xml
index 79c26fa..43b2d08 100644
--- a/output/feed.xml
+++ b/output/feed.xml
@@ -758,7 +758,7 @@ limitations under the License.
 {% endcomment %}
 --&gt;
 
-&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../2024/09/13/string-view-german-style-strings-part-2/"&gt;two 
part&lt;/a&gt; blog series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
+&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../string-view-german-style-strings-part-2/"&gt;two part&lt;/a&gt; blog 
series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
 &lt;p&gt;This blog describes our experience implementing &lt;a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout"&gt;StringView&lt;/a&gt;
 in the &lt;a href="https://github.com/apache/arrow-rs"&gt;Rust 
implementation&lt;/a&gt; of &lt;a href="https://arrow.apache.org/"&gt;Apache 
Arrow&lt;/a&gt;, and integrating …&lt;/p&gt;</description><dc:creator 
xmlns:dc="http://purl.org/dc/elements/1.1/";>Xiangpeng Hao, Andrew 
Lamb</dc:creator><pubDate>Fri, 13 Sep  [...]
 {% comment %}
 Licensed to the Apache Software Foundation (ASF) under one or more
diff --git a/output/feeds/all-en.atom.xml b/output/feeds/all-en.atom.xml
index 1596292..5e381b8 100644
--- a/output/feeds/all-en.atom.xml
+++ b/output/feeds/all-en.atom.xml
@@ -7983,7 +7983,7 @@ limitations under the License.
 {% endcomment %}
 --&gt;
 
-&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../2024/09/13/string-view-german-style-strings-part-2/"&gt;two 
part&lt;/a&gt; blog series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
+&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../string-view-german-style-strings-part-2/"&gt;two part&lt;/a&gt; blog 
series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
 &lt;p&gt;This blog describes our experience implementing &lt;a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout"&gt;StringView&lt;/a&gt;
 in the &lt;a href="https://github.com/apache/arrow-rs"&gt;Rust 
implementation&lt;/a&gt; of &lt;a href="https://arrow.apache.org/"&gt;Apache 
Arrow&lt;/a&gt;, and integrating …&lt;/p&gt;</summary><content 
type="html">&lt;!--
 {% comment %}
 Licensed to the Apache Software Foundation (ASF) under one or more
@@ -8003,7 +8003,7 @@ limitations under the License.
 {% endcomment %}
 --&gt;
 
-&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../2024/09/13/string-view-german-style-strings-part-2/"&gt;two 
part&lt;/a&gt; blog series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
+&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../string-view-german-style-strings-part-2/"&gt;two part&lt;/a&gt; blog 
series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
 &lt;p&gt;This blog describes our experience implementing &lt;a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout"&gt;StringView&lt;/a&gt;
 in the &lt;a href="https://github.com/apache/arrow-rs"&gt;Rust 
implementation&lt;/a&gt; of &lt;a href="https://arrow.apache.org/"&gt;Apache 
Arrow&lt;/a&gt;, and integrating it into &lt;a 
href="https://datafusion.apache.org/"&gt;Apache DataFusion&lt;/a&gt;, 
significantly accelerating string-intensive queries in th [...]
 &lt;p&gt;Getting significant end-to-end performance improvements was 
non-trivial. Implementing StringView itself was only a fraction of the effort 
required. Among other things, we had to optimize UTF-8 validation, implement 
unintuitive compiler optimizations, tune block sizes, and time GC to realize 
the &lt;a 
href="https://www.influxdata.com/blog/flight-datafusion-arrow-parquet-fdap-architecture-influxdb/"&gt;FDAP
 ecosystem&lt;/a&gt;’s benefit. With other members of the open source commu 
[...]
 &lt;p&gt;StringView is based on a simple idea: avoid some string copies and 
accelerate comparisons with inlined prefixes. Like most great ideas, it is 
“obvious” only after &lt;a 
href="https://db.in.tum.de/~freitag/papers/p29-neumann-cidr20.pdf"&gt;someone 
describes it clearly&lt;/a&gt;. Although simple, straightforward implementation 
actually &lt;em&gt;slows down performance for almost every query&lt;/em&gt;. We 
must, therefore, apply astute observations and diligent engineering to reali 
[...]
@@ -8088,7 +8088,7 @@ StringViewArray accelerates string-intensive operations 
via prefix inlining and
 &lt;h2 id="conclusion"&gt;Conclusion&lt;a class="headerlink" 
href="#conclusion" title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h2&gt;
 &lt;p&gt;In this first blog post, we have described what it took to improve the
 performance of simply reading strings from Parquet files using StringView. 
While
-this resulted in real end-to-end query performance improvements, in our &lt;a 
href="https://datafusion.apache.org/blog/2024/09/13/using-stringview-to-make-queries-faster-part-2.html"&gt;next
+this resulted in real end-to-end query performance improvements, in our &lt;a 
href="https://datafusion.apache.org/blog/2024/09/13//string-view-german-style-strings-part-2"&gt;next
 post&lt;/a&gt;, we explore additional optimizations enabled by StringView in 
DataFusion,
 along with some of the pitfalls we encountered while implementing 
them.&lt;/p&gt;
 &lt;h1 id="footnotes"&gt;Footnotes&lt;a class="headerlink" href="#footnotes" 
title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h1&gt;
diff --git a/output/feeds/blog.atom.xml b/output/feeds/blog.atom.xml
index b30e9f3..e54b3e1 100644
--- a/output/feeds/blog.atom.xml
+++ b/output/feeds/blog.atom.xml
@@ -7983,7 +7983,7 @@ limitations under the License.
 {% endcomment %}
 --&gt;
 
-&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../2024/09/13/string-view-german-style-strings-part-2/"&gt;two 
part&lt;/a&gt; blog series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
+&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../string-view-german-style-strings-part-2/"&gt;two part&lt;/a&gt; blog 
series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
 &lt;p&gt;This blog describes our experience implementing &lt;a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout"&gt;StringView&lt;/a&gt;
 in the &lt;a href="https://github.com/apache/arrow-rs"&gt;Rust 
implementation&lt;/a&gt; of &lt;a href="https://arrow.apache.org/"&gt;Apache 
Arrow&lt;/a&gt;, and integrating …&lt;/p&gt;</summary><content 
type="html">&lt;!--
 {% comment %}
 Licensed to the Apache Software Foundation (ASF) under one or more
@@ -8003,7 +8003,7 @@ limitations under the License.
 {% endcomment %}
 --&gt;
 
-&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../2024/09/13/string-view-german-style-strings-part-2/"&gt;two 
part&lt;/a&gt; blog series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
+&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../string-view-german-style-strings-part-2/"&gt;two part&lt;/a&gt; blog 
series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
 &lt;p&gt;This blog describes our experience implementing &lt;a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout"&gt;StringView&lt;/a&gt;
 in the &lt;a href="https://github.com/apache/arrow-rs"&gt;Rust 
implementation&lt;/a&gt; of &lt;a href="https://arrow.apache.org/"&gt;Apache 
Arrow&lt;/a&gt;, and integrating it into &lt;a 
href="https://datafusion.apache.org/"&gt;Apache DataFusion&lt;/a&gt;, 
significantly accelerating string-intensive queries in th [...]
 &lt;p&gt;Getting significant end-to-end performance improvements was 
non-trivial. Implementing StringView itself was only a fraction of the effort 
required. Among other things, we had to optimize UTF-8 validation, implement 
unintuitive compiler optimizations, tune block sizes, and time GC to realize 
the &lt;a 
href="https://www.influxdata.com/blog/flight-datafusion-arrow-parquet-fdap-architecture-influxdb/"&gt;FDAP
 ecosystem&lt;/a&gt;’s benefit. With other members of the open source commu 
[...]
 &lt;p&gt;StringView is based on a simple idea: avoid some string copies and 
accelerate comparisons with inlined prefixes. Like most great ideas, it is 
“obvious” only after &lt;a 
href="https://db.in.tum.de/~freitag/papers/p29-neumann-cidr20.pdf"&gt;someone 
describes it clearly&lt;/a&gt;. Although simple, straightforward implementation 
actually &lt;em&gt;slows down performance for almost every query&lt;/em&gt;. We 
must, therefore, apply astute observations and diligent engineering to reali 
[...]
@@ -8088,7 +8088,7 @@ StringViewArray accelerates string-intensive operations 
via prefix inlining and
 &lt;h2 id="conclusion"&gt;Conclusion&lt;a class="headerlink" 
href="#conclusion" title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h2&gt;
 &lt;p&gt;In this first blog post, we have described what it took to improve the
 performance of simply reading strings from Parquet files using StringView. 
While
-this resulted in real end-to-end query performance improvements, in our &lt;a 
href="https://datafusion.apache.org/blog/2024/09/13/using-stringview-to-make-queries-faster-part-2.html"&gt;next
+this resulted in real end-to-end query performance improvements, in our &lt;a 
href="https://datafusion.apache.org/blog/2024/09/13//string-view-german-style-strings-part-2"&gt;next
 post&lt;/a&gt;, we explore additional optimizations enabled by StringView in 
DataFusion,
 along with some of the pitfalls we encountered while implementing 
them.&lt;/p&gt;
 &lt;h1 id="footnotes"&gt;Footnotes&lt;a class="headerlink" href="#footnotes" 
title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h1&gt;
diff --git a/output/feeds/xiangpeng-hao-andrew-lamb.atom.xml 
b/output/feeds/xiangpeng-hao-andrew-lamb.atom.xml
index 77ec248..155166a 100644
--- a/output/feeds/xiangpeng-hao-andrew-lamb.atom.xml
+++ b/output/feeds/xiangpeng-hao-andrew-lamb.atom.xml
@@ -18,7 +18,7 @@ limitations under the License.
 {% endcomment %}
 --&gt;
 
-&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../2024/09/13/string-view-german-style-strings-part-2/"&gt;two 
part&lt;/a&gt; blog series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
+&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../string-view-german-style-strings-part-2/"&gt;two part&lt;/a&gt; blog 
series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
 &lt;p&gt;This blog describes our experience implementing &lt;a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout"&gt;StringView&lt;/a&gt;
 in the &lt;a href="https://github.com/apache/arrow-rs"&gt;Rust 
implementation&lt;/a&gt; of &lt;a href="https://arrow.apache.org/"&gt;Apache 
Arrow&lt;/a&gt;, and integrating …&lt;/p&gt;</summary><content 
type="html">&lt;!--
 {% comment %}
 Licensed to the Apache Software Foundation (ASF) under one or more
@@ -38,7 +38,7 @@ limitations under the License.
 {% endcomment %}
 --&gt;
 
-&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../2024/09/13/string-view-german-style-strings-part-2/"&gt;two 
part&lt;/a&gt; blog series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
+&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../string-view-german-style-strings-part-2/"&gt;two part&lt;/a&gt; blog 
series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
 &lt;p&gt;This blog describes our experience implementing &lt;a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout"&gt;StringView&lt;/a&gt;
 in the &lt;a href="https://github.com/apache/arrow-rs"&gt;Rust 
implementation&lt;/a&gt; of &lt;a href="https://arrow.apache.org/"&gt;Apache 
Arrow&lt;/a&gt;, and integrating it into &lt;a 
href="https://datafusion.apache.org/"&gt;Apache DataFusion&lt;/a&gt;, 
significantly accelerating string-intensive queries in th [...]
 &lt;p&gt;Getting significant end-to-end performance improvements was 
non-trivial. Implementing StringView itself was only a fraction of the effort 
required. Among other things, we had to optimize UTF-8 validation, implement 
unintuitive compiler optimizations, tune block sizes, and time GC to realize 
the &lt;a 
href="https://www.influxdata.com/blog/flight-datafusion-arrow-parquet-fdap-architecture-influxdb/"&gt;FDAP
 ecosystem&lt;/a&gt;’s benefit. With other members of the open source commu 
[...]
 &lt;p&gt;StringView is based on a simple idea: avoid some string copies and 
accelerate comparisons with inlined prefixes. Like most great ideas, it is 
“obvious” only after &lt;a 
href="https://db.in.tum.de/~freitag/papers/p29-neumann-cidr20.pdf"&gt;someone 
describes it clearly&lt;/a&gt;. Although simple, straightforward implementation 
actually &lt;em&gt;slows down performance for almost every query&lt;/em&gt;. We 
must, therefore, apply astute observations and diligent engineering to reali 
[...]
@@ -123,7 +123,7 @@ StringViewArray accelerates string-intensive operations via 
prefix inlining and
 &lt;h2 id="conclusion"&gt;Conclusion&lt;a class="headerlink" 
href="#conclusion" title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h2&gt;
 &lt;p&gt;In this first blog post, we have described what it took to improve the
 performance of simply reading strings from Parquet files using StringView. 
While
-this resulted in real end-to-end query performance improvements, in our &lt;a 
href="https://datafusion.apache.org/blog/2024/09/13/using-stringview-to-make-queries-faster-part-2.html"&gt;next
+this resulted in real end-to-end query performance improvements, in our &lt;a 
href="https://datafusion.apache.org/blog/2024/09/13//string-view-german-style-strings-part-2"&gt;next
 post&lt;/a&gt;, we explore additional optimizations enabled by StringView in 
DataFusion,
 along with some of the pitfalls we encountered while implementing 
them.&lt;/p&gt;
 &lt;h1 id="footnotes"&gt;Footnotes&lt;a class="headerlink" href="#footnotes" 
title="Permanent link"&gt;¶&lt;/a&gt;&lt;/h1&gt;
diff --git a/output/feeds/xiangpeng-hao-andrew-lamb.rss.xml 
b/output/feeds/xiangpeng-hao-andrew-lamb.rss.xml
index 5df1c3f..c7bd8ba 100644
--- a/output/feeds/xiangpeng-hao-andrew-lamb.rss.xml
+++ b/output/feeds/xiangpeng-hao-andrew-lamb.rss.xml
@@ -18,7 +18,7 @@ limitations under the License.
 {% endcomment %}
 --&gt;
 
-&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../2024/09/13/string-view-german-style-strings-part-2/"&gt;two 
part&lt;/a&gt; blog series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
+&lt;p&gt;&lt;em&gt;Editor's Note: This is the first of a &lt;a 
href="../string-view-german-style-strings-part-2/"&gt;two part&lt;/a&gt; blog 
series that was first published on the &lt;a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/"&gt;InfluxData
 blog&lt;/a&gt;. Thanks to InfluxData for sponsoring this work as &lt;a 
href="https://haoxp.xyz/"&gt;Xiangpeng Hao&lt;/a&gt;'s summer intern 
project&lt;/em&gt;&lt;/p&gt;
 &lt;p&gt;This blog describes our experience implementing &lt;a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout"&gt;StringView&lt;/a&gt;
 in the &lt;a href="https://github.com/apache/arrow-rs"&gt;Rust 
implementation&lt;/a&gt; of &lt;a href="https://arrow.apache.org/"&gt;Apache 
Arrow&lt;/a&gt;, and integrating …&lt;/p&gt;</description><dc:creator 
xmlns:dc="http://purl.org/dc/elements/1.1/";>Xiangpeng Hao, Andrew 
Lamb</dc:creator><pubDate>Fri, 13 Sep  [...]
 {% comment %}
 Licensed to the Apache Software Foundation (ASF) under one or more
diff --git a/output/index.html b/output/index.html
index ae29fcc..b38853e 100644
--- a/output/index.html
+++ b/output/index.html
@@ -1401,7 +1401,7 @@ limitations under the License.
 {% endcomment %}
 -->
 
-<p><em>Editor's Note: This is the first of a <a 
href="../2024/09/13/string-view-german-style-strings-part-2/">two part</a> blog 
series that was first published on the <a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/";>InfluxData
 blog</a>. Thanks to InfluxData for sponsoring this work as <a 
href="https://haoxp.xyz/";>Xiangpeng Hao</a>'s summer intern project</em></p>
+<p><em>Editor's Note: This is the first of a <a 
href="../string-view-german-style-strings-part-2/">two part</a> blog series 
that was first published on the <a 
href="https://www.influxdata.com/blog/faster-queries-with-stringview-part-one-influxdb/";>InfluxData
 blog</a>. Thanks to InfluxData for sponsoring this work as <a 
href="https://haoxp.xyz/";>Xiangpeng Hao</a>'s summer intern project</em></p>
 <p>This blog describes our experience implementing <a 
href="https://arrow.apache.org/docs/format/Columnar.html#variable-size-binary-view-layout";>StringView</a>
 in the <a href="https://github.com/apache/arrow-rs";>Rust implementation</a> of 
<a href="https://arrow.apache.org/";>Apache Arrow</a>, and integrating …</p></p>
                         <footer>
                             <ul class="actions">


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to