Skip to content

Commit 7afeb30

Browse files
[Website] add akvelon case study (#34943)
* feat: add akvelon logo * feat: add akvelon case study * fix: remove white space * feat: add akvelon to main page * feat: use new images * fix: typos * fix: change order of akvelon case-study * fix: update text * fix: update mainPage text * fix: update images * fix: about akvelon section update * fix: update akvelon card * fix: update akvelon header * fix: update code tag * fix: update about akvelon * fix: update date and order * fix: add link and change img * fix: change CDAP text * fix: add bold weight * fix: solve conflicts * fix: remove unused code * fix: delete whitespace * fix: indents format * fix: add bold text --------- Co-authored-by: Bulat Safiullin <v-safiullinb@microsoft.com>
1 parent 7eb7b9b commit 7afeb30

File tree

7 files changed

+173
-4
lines changed

7 files changed

+173
-4
lines changed

website/www/site/content/en/case-studies/akvelon.md

Lines changed: 169 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,17 @@
11
---
2-
title: "Akvelon"
3-
icon: /images/logos/powered-by/akvelon.png
4-
hasNav: true
5-
cardDescription: "<p><a href='https://akvelon.com/' target='_blank' rel='noopener noreferrer'>Akvelon</a> is a software engineering company that helps start-ups, SMBs, and Fortune 500 companies unlock the full potential of cloud, data, and AI/ML to empower their strategic advantage. Akvelon team has deep expertise in integrating Apache Beam with diverse data processing ecosystems and is an enthusiastic Apache Beam community contributor.</p>"
2+
title: "Secure and Interoperable Apache Beam Pipelines by Akvelon"
3+
name: "Akvelon"
4+
icon: "/images/logos/powered-by/akvelon.png"
5+
category: "study"
6+
cardTitle: "Secure and Interoperable Apache Beam Pipelines by Akvelon"
7+
cardDescription: "To support data privacy and pipeline reusability at scale, Akvelon developed Beam-based solutions for Protegrity and a major North American credit reporting company, enabling tokenization with Dataflow Flex Templates. Akvelon also built a CDAP Connector to integrate CDAP plugins with Apache Beam, enabling plugin reuse and multi-runtime compatibility."
8+
authorName: "Vitaly Terentyev"
9+
coauthorName: "Ashley Pikle"
10+
authorPosition: "Software Engineer @Akvelon"
11+
coauthorPosition: "Director of AI Business Development @Akvelon"
12+
authorImg: /images/case-study/akvelon/terentyev.png
13+
coauthorImg: /images/case-study/akvelon/pikle.png
14+
publishDate: 2025-05-25T00:12:00+00:00
615
---
716
<!--
817
Licensed under the Apache License, Version 2.0 (the "License");
@@ -17,3 +26,159 @@ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
1726
See the License for the specific language governing permissions and
1827
limitations under the License.
1928
-->
29+
<div class="case-study-opinion">
30+
<div class="case-study-opinion-img">
31+
<img src="/images/logos/powered-by/akvelon.png"/>
32+
</div>
33+
<blockquote class="case-study-quote-block">
34+
<p class="case-study-quote-text">
35+
“To support data privacy and pipeline reusability at scale, Akvelon developed Beam-based solutions for Protegrity and a major North American credit reporting company, enabling tokenization with Dataflow Flex Templates. Akvelon also built a CDAP Connector to integrate CDAP plugins with Apache Beam, enabling plugin reuse and multi-runtime compatibility.”
36+
</p>
37+
<div class="case-study-quote-author">
38+
<div class="case-study-quote-author-img">
39+
<img src="/images/case-study/akvelon/pikle.png">
40+
</div>
41+
<div class="case-study-quote-author-info">
42+
<div class="case-study-quote-author-name">
43+
Ashley Pikle
44+
</div>
45+
<div class="case-study-quote-author-position">
46+
Director of AI Business Development @Akvelon
47+
</div>
48+
</div>
49+
</div>
50+
</blockquote>
51+
</div>
52+
<div class="case-study-post">
53+
54+
# Secure and Interoperable Apache Beam Pipelines by Akvelon
55+
56+
## Background
57+
58+
To meet growing enterprise needs for secure, scalable, and interoperable data processing pipelines, **Akvelon** developed multiple Apache Beam-powered solutions tailored for real-world production environments:
59+
- Data tokenization and detokenization capabilities for **Protegrity** and a leading North American credit reporting company
60+
- A connector layer to integrate **CDAP** plugins into Apache Beam pipelines
61+
62+
By leveraging [Apache Beam](https://beam.apache.org/) and [Google Cloud Dataflow](https://cloud.google.com/products/dataflow?hl=en), Akvelon enabled its clients to achieve scalable data protection, regulatory compliance, and platform interoperability through reusable, open-source pipeline components.
63+
64+
## Use Case 1: Data Tokenization for Protegrity and a Leading Credit Reporting Company
65+
66+
### The Challenge
67+
68+
**Protegrity**, a leading enterprise data-security vendor, sought to enhance its data protection platform with scalable tokenization support for batch and streaming data. Their goal: allow customers such as a major North American credit reporting company to tokenize sensitive data using Google Cloud Dataflow. The solution needed to be fast, secure, reusable, and compliant with privacy regulations (e.g., HIPAA, GDPR).
69+
70+
### The Solution
71+
72+
Akvelon designed and implemented a **Dataflow Flex Template** using Apache Beam that allows users to tokenize and detokenize sensitive data within both batch and streaming pipelines.
73+
74+
<div class="post-scheme">
75+
<a href="/images/case-study/akvelon/diagram-01.png" target="_blank" title="Click to enlarge">
76+
<img src="/images/case-study/akvelon/diagram-01.png" alt="Protegrity & Equifax Tokenization Pipeline">
77+
</a>
78+
</div>
79+
80+
### Key features
81+
- **Seamless integration with Protegrity UDFs**, enabling native tokenization directly within Beam transforms without requiring external service orchestration
82+
- **Support for multiple data formats** such as CSV, JSON, Parquet, allowing flexible deployment across diverse data pipelines
83+
- **Stateful processing with `DoFn` and timers**, which improves streaming reliability and reduces overall pipeline latency
84+
- **Full compatibility with Google Cloud Dataflow**, ensuring autoscaling, fault tolerance, and operational simplicity through managed Apache Beam execution
85+
86+
This design provided both Protegrity and its enterprise clients with a reusable, open-source architecture for scalable data privacy and processing.
87+
88+
### The Results
89+
- **Enabled data tokenization at** for regulated industries
90+
- **Accelerated adoption of Dataflow templates** across Protegrity’s customer base
91+
- **Delivered an [open-source Flex Template](https://github.com/apache/beam/blob/master/examples/java/src/main/java/org/apache/beam/examples/complete/datatokenization/README.md)** that benefits the entire Apache Beam community
92+
93+
<blockquote class="case-study-quote-block case-study-quote-wrapped">
94+
<p class="case-study-quote-text">
95+
In collaboration with Akvelon, Protegrity utilized a Dataflow Flex template that helps us enable customers to tokenize and detokenize streaming and batch data from a fully managed Google Cloud Dataflow service. We appreciate Akvelon’s support as a trusted partner with Google Cloud expertise.
96+
</p>
97+
<div class="case-study-quote-author">
98+
<div class="case-study-quote-author-img">
99+
<img src="/images/case-study/akvelon/chitnis.png">
100+
</div>
101+
<div class="case-study-quote-author-info">
102+
<div class="case-study-quote-author-name">
103+
Jay Chitnis
104+
</div>
105+
<div class="case-study-quote-author-position">
106+
VP of Partners and Business Development @Protegrity
107+
</div>
108+
</div>
109+
</div>
110+
</blockquote>
111+
112+
## Use Case 2: CDAP Connector for Apache Beam
113+
114+
### The Challenge
115+
116+
**CDAP** had extensive plugin support for Spark but lacked native compatibility with Apache Beam. This limitation prevented organizations from reusing CDAP's rich ecosystem of data connectors (e.g., Salesforce, HubSpot, ServiceNow) within Beam-based pipelines, constraining cross-platform integration.
117+
118+
### The Solution
119+
120+
Akvelon engineered a **shim layer** (CDAP Connector) that bridges CDAP plugins with Apache Beam. This innovation enables CDAP source and sink plugins to operate seamlessly within Beam pipelines.
121+
122+
<div class="post-scheme">
123+
<a href="/images/case-study/akvelon/diagram-02.png" target="_blank" title="Click to enlarge">
124+
<img src="/images/case-study/akvelon/diagram-02.png" alt="CDAP Connector Integration with Apache Beam">
125+
</a>
126+
</div>
127+
128+
### Highlights
129+
130+
- Supports `StructuredRecord` format conversion to Beam schema (`BeamRow`)
131+
- Enables CDAP plugins to run seamlessly in both Spark and Beam pipelines
132+
- Facilitates integration testing across third-party data sources (e.g., Salesforce, Zendesk)
133+
- Complies with Beam’s development and style guide for open-source contributions
134+
135+
The project included prototyping, test infrastructure, and Salesforce plugin pipelines to ensure robustness.
136+
137+
### The Results
138+
139+
- **Enabled seamless reuse of CDAP plugins in Beam**
140+
- **30+ CDAP plugins** now work seamlessly with Beam pipelines
141+
- **Integration time** reduced **from hours to just a few minutes**
142+
- **Simplified execution and migration of CDAP pipelines to Beam**
143+
- Enabled **seamless execution** of CDAP pipelines on the Beam runtime
144+
- Simplified **migration of existing CDAP pipelines** to Beam with minimal changes
145+
- **Accelerated delivery and validated performance for Google Cloud customers**
146+
- Delivered **rapid development cycles** with standardized plugin configurations
147+
- Successfully processed **5 million records** in end-to-end tests for **batch and streaming**
148+
149+
## Technology Stack
150+
151+
- Apache Beam
152+
- Google Cloud Dataflow
153+
- Protegrity Data Protection Platform
154+
- CDAP (Cloud Data Fusion)
155+
- BigQuery
156+
- Salesforce, Zendesk, HubSpot, ServiceNow plugins
157+
158+
## Final words
159+
160+
Akvelon’s contributions to Apache Beam-based solutions - from advanced tokenization for Protegrity and its enterprise customers to enabling plugin interoperability through the CDAP Connector - demonstrate the value of open-source, cloud-native data engineering. By delivering reusable and secure components, Akvelon supports enterprises in modernizing and unifying their data infrastructure.
161+
162+
## Watch the Solution in Action
163+
164+
[Architecture Walkthrough Video ](https://www.youtube.com/watch?v=IQIzdfNIAHk)
165+
166+
## About Akvelon, Inc.
167+
168+
Akvelon guides enterprises through digital transformation on Google Cloud - applying deep expertise in data engineering, AI/ML, cloud infrastructure, and custom application development to design, deploy, and scale modern workloads.
169+
170+
At Akvelon, we’ve built a long-standing partnership with Google Cloud—helping software-driven organizations implement, migrate, modernize, automate, and optimize their systems while making the most of cloud technologies.
171+
172+
As a **Google Cloud Service** and **Build Partner**, we contribute actively to the ecosystem:
173+
- Contributing code and guidance to **Apache Beam**—including Playground, Tour of Beam, and the Duet AI training set
174+
- Improving project infrastructure and supporting the Apache Beam community—now with an official Apache Beam Committer on our team
175+
176+
Backed by deep expertise in data engineering, AI/ML, cloud architecture, and application development, our engineers deliver reusable, secure, and production-ready solutions on Google Cloud for enterprises worldwide.
177+
178+
- [Akvelon on Google Cloud](https://cloud.google.com/find-a-partner/partner/akvelon)
179+
- [Akvelon Data and Analytics Accelerators](https://github.com/akvelon/DnA_accelerators)
180+
181+
{{< case_study_feedback "Akvelon" >}}
182+
183+
</div>
184+
<div class="clear-nav"></div>

website/www/site/data/en/quotes.yaml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -81,6 +81,10 @@
8181
logoUrl: /images/logos/powered-by/accenture.png
8282
linkUrl: case-studies/accenture_baltics/index.html
8383
linkText: Learn more
84+
- text: Akvelon built Beam-based solutions for Protegrity and a major North American credit reporting company, enabling tokenization with Dataflow Flex Templates and reducing infrastructure and deployment complexity.
85+
icon: icons/quote-icon.svg
86+
logoUrl: /images/logos/powered-by/akvelon.png
87+
linkUrl: case-studies/akvelon/index.html
8488
- text: With Apache Beam and Dataflow, Credit Karma achieved a 99% uptime for critical data pipelines, a significant jump from 80%. This reliability, coupled with faster development (1 engineer vs. 3 estimated), has been crucial for enabling real-time financial insights for our more than 140 million members.
8589
icon: icons/quote-icon.svg
8690
logoUrl: images/logos/powered-by/credit-karma.png
137 KB
Loading
103 KB
Loading
72.1 KB
Loading
491 KB
Loading
796 KB
Loading

0 commit comments

Comments
 (0)