This is an automated email from the ASF dual-hosted git repository.
xqhu pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git
The following commit(s) were added to refs/heads/master by this push:
new 0eb3626b3a2 Add Albertsons case study (#37201)
0eb3626b3a2 is described below
commit 0eb3626b3a21a3911c14cf7efdc106c24c5cc8fa
Author: Danny McCormick <[email protected]>
AuthorDate: Tue Dec 30 19:44:01 2025 -0500
Add Albertsons case study (#37201)
* Add Albertsons case study
* fix image ref
* Fix table formatting
* early tuesday
---
.../www/site/content/en/case-studies/albertsons.md | 203 +++++++++++++++++++++
website/www/site/data/en/quotes.yaml | 5 +
.../images/case-study/albertsons/ankurraj.jpeg | Bin 0 -> 27941 bytes
.../case-study/albertsons/mohammedjawedkhan.jpeg | Bin 0 -> 5515 bytes
.../images/case-study/albertsons/utkarshparekh.png | Bin 0 -> 1239692 bytes
.../images/case-study/albertsons/vinaydesai.jpeg | Bin 0 -> 42897 bytes
.../static/images/logos/powered-by/albertsons.jpg | Bin 0 -> 192436 bytes
7 files changed, 208 insertions(+)
diff --git a/website/www/site/content/en/case-studies/albertsons.md
b/website/www/site/content/en/case-studies/albertsons.md
new file mode 100644
index 00000000000..167275b10c8
--- /dev/null
+++ b/website/www/site/content/en/case-studies/albertsons.md
@@ -0,0 +1,203 @@
+---
+title: "Albertsons: Using Apache Beam for Unified Analytics Ingestion"
+name: "Albertsons: Beam for Analytics Ingestion"
+icon: /images/logos/powered-by/albertsons.jpg
+hasNav: true
+category: study
+cardTitle: "Albertsons: Using Apache Beam for Unified Analytics Ingestion"
+cardDescription: "Apache Beam enabled Albertsons to standardize ingestion into
a resilient and portable framework, delivering 99.9% reliability at enterprise
scale across both real-time signals and core business data."
+authorName: "Utkarsh Parekh"
+authorPosition: "Staff Engineer, Data @ Albertsons"
+authorImg: /images/case-study/albertsons/utkarshparekh.png
+publishDate: 2026-01-06T00:04:00+00:00
+---
+<!--
+Licensed under the Apache License, Version 2.0 (the "License");
+you may not use this file except in compliance with the License.
+You may obtain a copy of the License at
+
+http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+-->
+<!-- div with class case-study-opinion is displayed at the top left area of
the case study page -->
+<div class="case-study-opinion">
+ <div class="case-study-opinion-img">
+ <img src="/images/logos/powered-by/albertsons.jpg"/>
+ </div>
+ <blockquote class="case-study-quote-block">
+ <p class="case-study-quote-text">
+ “Apache Beam enabled Albertsons to standardize ingestion into a
resilient and portable framework, delivering 99.9% reliability at enterprise
scale across both real-time signals and core business data.”
+ </p>
+ <div class="case-study-quote-author">
+ <div class="case-study-quote-author-img">
+ <img src="/images/case-study/albertsons/utkarshparekh.png">
+ </div>
+ <div class="case-study-quote-author-info">
+ <div class="case-study-quote-author-name">
+ Utkarsh Parekh
+ </div>
+ <div class="case-study-quote-author-position">
+ Staff Engineer, Data @ Albertsons
+ </div>
+ </div>
+ </div>
+ </blockquote>
+</div>
+
+<!-- div with class case-study-post is the case study page main content -->
+<div class="case-study-post">
+
+# Albertsons: Using Apache Beam for Unified Analytics Ingestion
+
+## Context
+
+Albertsons Companies is one of the largest retail grocery organizations in
North America, operating over 2,200 stores and serving millions of customers
across physical and digital channels.
+
+Apache Beam is the foundation of the **internal Unified Data Ingestion
framework**, a standardized enterprise ELT platform that delivers both
streaming and batch data into modern cloud analytics systems. The framework
uses **both Java and Python Beam SDKs, Dataflow Flex Templates, enabling
flexibility across workloads. When a capability is not yet supported in the
Python SDK but is available in the Java SDK, we can seamlessly leverage
Java-based implementations to deliver the required fu [...]
+
+This unified architecture reduces duplicated logic, standardizes governance,
and accelerates data enablement across business domains.
+
+## Challenges and Use Cases
+
+Before Apache Beam, ingestion patterns were fragmented across streaming and
batch pipelines. This led to longer development cycles, inconsistent data
quality, and increased operational overhead.
+
+The framework’s architecture emphasizes object-oriented principles including
single responsibility, modularity, and separation of concerns. This enables
reusable Beam transforms, configurable IO connectors, and clean abstractions
between orchestration and execution layers.
+
+Beam enabled:
+
+- Unified development for real-time and scheduled ingestion
+- Standardized connectivity to enterprise systems
+- Reliable governance and observability baked into pipelines
+
+
+The framework supports:
+
+- **Real-time streaming analytics** from operational and digital signals
+- **Batch ingestion** from mission-critical enterprise systems
+- **File-based ingestion** for vendor and financial datasets
+- **Legacy MQ ingestion** using JMSIO-based connectors
+
+To scale efficiently, the framework features **Apache Airflow dynamic DAG
creation.**
+
+Metadata-driven ingestion jobs generate DAGs automatically at runtime, and
**BashOperator** is used to submit **Dataflow** jobs for consistent execution,
security, and monitoring.
+
+Common Beam transforms include Impulse, windowing, grouping, and batching
optimizations.
+
+<blockquote class="case-study-quote-block case-study-quote-wrapped">
+ <p class="case-study-quote-text">
+ In Albertsons we utilized Apache Beam to write an in-house framework that
enabled our data engineering teams to create robust data pipelines through a
consistent - single interface. The framework helped reduce the overall
development cycle since we templatized the various data integration patterns.
Having a custom framework gave us flexibility to prioritize and configure
multiple technologies/integration points like Kafka, Files, Managed Queues,
Databases (Oracle, DB2, Azure SQL etc. [...]
+ </p>
+ <div class="case-study-quote-author">
+ <div class="case-study-quote-author-img">
+ <img src="/images/case-study/albertsons/mohammedjawedkhan.jpeg">
+ </div>
+ <div class="case-study-quote-author-info">
+ <div class="case-study-quote-author-name">
+ Mohammed Jawed Khan
+ </div>
+ <div class="case-study-quote-author-position">
+ Principal Data Engineer @ Albertsons
+ </div>
+ </div>
+ </div>
+</blockquote>
+
+## Technical Data
+
+Apache Beam pipelines operate at enterprise scale:
+
+- Hundreds of production pipelines
+- Terabytes of data processed weekly, including thousands of streaming events
per second.
+
+All ingestion paths adhere to internal security controls and support
**tokenization** for PII and sensitive data protection using Protegrity.
+
+## Results
+
+Apache Beam has significantly improved the reliability, reusability, and speed
of Albertsons’ data platforms:
+
+{{< table >}}
+| Area | Outcome
|
+| ---------------------- | ---------------------------------------------------
|
+| Reliability | **99.9%+ uptime** for data ingestion
|
+| Developer Productivity | Pipelines created faster via standardized templates
|
+| Operational Efficiency | **Autoscaling** optimizes resource utilization
|
+| Business Enablement | Enables **real-time decisioning**
|
+{{< /table >}}
+
+### Business Impact
+
+Beam enabled one unified ingestion framework that supports both streaming and
batch workloads - eliminating fragmentation and delivering trusted signals to
analytics.
+
+<blockquote class="case-study-quote-block case-study-quote-wrapped">
+ <p class="case-study-quote-text">
+ Integrating Apache Beam into our in-house ELT platform has reduced
engineering effort and operational overhead, while improving efficiency at
scale. Teams can now focus more on delivering business outcomes instead of
managing infrastructure.
+ </p>
+ <div class="case-study-quote-author">
+ <div class="case-study-quote-author-img">
+ <img src="/images/case-study/albertsons/vinaydesai.jpeg">
+ </div>
+ <div class="case-study-quote-author-info">
+ <div class="case-study-quote-author-name">
+ Vinay Desai
+ </div>
+ <div class="case-study-quote-author-position">
+ Director Engineering @ Albertsons
+ </div>
+ </div>
+ </div>
+</blockquote>
+
+<blockquote class="case-study-quote-block case-study-quote-wrapped">
+ <p class="case-study-quote-text">
+ By leveraging Apache Beam into the ACI platform, we achieved a significant
reduction in downtime. The adoption of reusable features further minimized the
risk of production issues.
+ </p>
+ <div class="case-study-quote-author">
+ <div class="case-study-quote-author-img">
+ <img src="/images/case-study/albertsons/ankurraj.jpeg">
+ </div>
+ <div class="case-study-quote-author-info">
+ <div class="case-study-quote-author-name">
+ Ankur Raj
+ </div>
+ <div class="case-study-quote-author-position">
+ Director , Data Engineering Operations @ Albertsons
+ </div>
+ </div>
+ </div>
+</blockquote>
+
+## Infrastructure
+
+{{< table >}}
+| Component | Detail |
+| ---------------------- | --------------------------------------------- |
+| Cloud | Google Cloud Platform |
+| Runner | DataflowRunner |
+| Beam SDKs | Java & Python |
+| Workflow Orchestration | Apache Airflow with dynamic DAG creation |
+| Deployment | BashOperator submits Dataflow jobs |
+| Sources | Kafka, JDBC systems, files, MQ, APIs |
+| Targets | BigQuery, GCS, Kafka |
+| Observability | Centralized logging, alerting, retry patterns |
+{{< /table >}}
+
+Deployment is portable across Dev, QA, and Prod environments.
+
+## Beam Community & Evolution
+
+Beam community resources supported the framework’s growth through:
+
+- Slack & developer channels
+- Documentation
+- Beam Summit participation
+
+<!-- case_study_feedback adds feedback buttons -->
+{{< case_study_feedback "AlbertsonsCompanies" >}}
+
+</div>
+<div class="clear-nav"></div>
\ No newline at end of file
diff --git a/website/www/site/data/en/quotes.yaml
b/website/www/site/data/en/quotes.yaml
index 4ae6cca442a..2c0dbef443f 100644
--- a/website/www/site/data/en/quotes.yaml
+++ b/website/www/site/data/en/quotes.yaml
@@ -41,6 +41,11 @@
logoUrl: images/logos/powered-by/credit-karma.png
linkUrl: case-studies/creditkarma/index.html
linkText: Learn more
+- text: Apache Beam enabled Albertsons to standardize ingestion into a
resilient and portable framework, delivering 99.9% reliability at enterprise
scale across both real-time signals and core business data.
+ icon: icons/quote-icon.svg
+ logoUrl: images/logos/powered-by/albertsons.jpg
+ linkUrl: case-studies/creditkarma/index.html
+ linkText: Learn more
- text: Apache Beam is a central component to Intuit's Stream Processing
Platform, which has driven 3x faster time-to-production for authoring a stream
processing pipeline.
icon: icons/quote-icon.svg
logoUrl: images/case-study/intuit/intuit-quote.png
diff --git a/website/www/site/static/images/case-study/albertsons/ankurraj.jpeg
b/website/www/site/static/images/case-study/albertsons/ankurraj.jpeg
new file mode 100644
index 00000000000..47a5f83b75c
Binary files /dev/null and
b/website/www/site/static/images/case-study/albertsons/ankurraj.jpeg differ
diff --git
a/website/www/site/static/images/case-study/albertsons/mohammedjawedkhan.jpeg
b/website/www/site/static/images/case-study/albertsons/mohammedjawedkhan.jpeg
new file mode 100644
index 00000000000..579bc885e63
Binary files /dev/null and
b/website/www/site/static/images/case-study/albertsons/mohammedjawedkhan.jpeg
differ
diff --git
a/website/www/site/static/images/case-study/albertsons/utkarshparekh.png
b/website/www/site/static/images/case-study/albertsons/utkarshparekh.png
new file mode 100644
index 00000000000..9383e0be40b
Binary files /dev/null and
b/website/www/site/static/images/case-study/albertsons/utkarshparekh.png differ
diff --git
a/website/www/site/static/images/case-study/albertsons/vinaydesai.jpeg
b/website/www/site/static/images/case-study/albertsons/vinaydesai.jpeg
new file mode 100644
index 00000000000..6ff6938a2f7
Binary files /dev/null and
b/website/www/site/static/images/case-study/albertsons/vinaydesai.jpeg differ
diff --git a/website/www/site/static/images/logos/powered-by/albertsons.jpg
b/website/www/site/static/images/logos/powered-by/albertsons.jpg
new file mode 100644
index 00000000000..35d5e40f96e
Binary files /dev/null and
b/website/www/site/static/images/logos/powered-by/albertsons.jpg differ