Skip to content

Commit 40745ee

Browse files
committed
site update Wed Nov 5 07:26:15 PST 2025
1 parent 1b49d73 commit 40745ee

File tree

16 files changed

+21
-21
lines changed

16 files changed

+21
-21
lines changed

declarative-kubernetes-cluster-emulation-with-kemu/index.html

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -70,7 +70,7 @@
7070
name="description"
7171
content="
7272
73-
Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.
73+
Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.
7474
7575
"
7676
/>
@@ -97,7 +97,7 @@
9797
<meta property="og:url" content="https://datastrophic.io/declarative-kubernetes-cluster-emulation-with-kemu/">
9898
<meta property="og:site_name" content="datastrophic">
9999
<meta property="og:title" content="KEMU: A Declarative Approach to Emulating Kubernetes Clusters at Scale">
100-
<meta property="og:description" content="Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.">
100+
<meta property="og:description" content="Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.">
101101
<meta property="og:locale" content="en">
102102
<meta property="og:type" content="article">
103103
<meta property="article:section" content="posts">
@@ -112,7 +112,7 @@
112112

113113
<meta name="twitter:card" content="summary">
114114
<meta name="twitter:title" content="KEMU: A Declarative Approach to Emulating Kubernetes Clusters at Scale">
115-
<meta name="twitter:description" content="Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.">
115+
<meta name="twitter:description" content="Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.">
116116

117117

118118
<script type="application/ld+json">
@@ -123,7 +123,7 @@
123123
"name": "KEMU: A Declarative Approach to Emulating Kubernetes Clusters at Scale",
124124
"headline": "KEMU: A Declarative Approach to Emulating Kubernetes Clusters at Scale",
125125

126-
"abstract": "Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.",
126+
"abstract": "Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.",
127127
"inLanguage": "en",
128128
"url" : "https:\/\/datastrophic.io\/declarative-kubernetes-cluster-emulation-with-kemu\/",
129129
"author" : {
@@ -449,7 +449,7 @@ <h1 class="mb-8 mt-0 text-4xl font-extrabold text-neutral-900 dark:text-neutral"
449449

450450
<div class="min-h-0 min-w-0 max-w-prose grow">
451451
<p>Optimizing scheduling efficiency for AI workloads requires extensive experimentation and observation.
452-
Extended GPU procurement lead timesoften spanning months mean existing infrastructure must be
452+
Extended GPU procurement lead times, often spanning months, mean existing infrastructure must be
453453
maximized for utilization to avoid capacity bottlenecks. For high-end GPUs, supply constraints
454454
eliminate cloud autoscaling advantages, making both cloud and on-premises environments equally
455455
constrained in their ability to rapidly expand capacity on demand.</p>
@@ -467,7 +467,7 @@ <h1 class="mb-8 mt-0 text-4xl font-extrabold text-neutral-900 dark:text-neutral"
467467
<h2 id="requirements" class="relative group">Requirements <span class="absolute top-0 w-6 transition-opacity opacity-0 -start-6 not-prose group-hover:opacity-100"><a class="group-hover:text-primary-300 dark:group-hover:text-neutral-700" style="text-decoration-line: none !important;" href="#requirements" aria-label="Anchor">#</a></span></h2><p>Let&rsquo;s consider the following cluster setup to provide background for the functionality of the emulated cluster:</p>
468468
<ul>
469469
<li>A Kubernetes cluster with 1,000+ GPU nodes of different types;</li>
470-
<li>The nodes are spread across several data centers/availability zones;</li>
470+
<li>The nodes are spread across multiple topology domains (availability zones, racks);</li>
471471
<li>Specialized scheduling and training operators are running on the cluster;</li>
472472
<li>Observability is provided via the Prometheus stack.</li>
473473
</ul>

index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -492,7 +492,7 @@ <h3 class="flex items-center text-xl font-semibold">
492492
</div>
493493

494494
<div class="prose py-1 dark:prose-invert">
495-
Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.
495+
Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.
496496
</div>
497497

498498
</div>

index.json

Lines changed: 1 addition & 1 deletion
Large diffs are not rendered by default.

index.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414
<link>https://datastrophic.io/declarative-kubernetes-cluster-emulation-with-kemu/</link>
1515
<pubDate>Tue, 04 Nov 2025 00:00:00 +0000</pubDate>
1616
<guid>https://datastrophic.io/declarative-kubernetes-cluster-emulation-with-kemu/</guid>
17-
<description>Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.</description>
17+
<description>Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.</description>
1818
</item>
1919
<item>
2020
<title>Secure Kubeflow Ingress and Authentication with Istio External Auth, Dex, and OAuth2 Proxy</title>

posts/index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -481,7 +481,7 @@ <h3 class="flex items-center text-xl font-semibold">
481481
</div>
482482

483483
<div class="prose py-1 dark:prose-invert">
484-
Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.
484+
Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.
485485
</div>
486486

487487
</div>

posts/index.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414
<link>https://datastrophic.io/declarative-kubernetes-cluster-emulation-with-kemu/</link>
1515
<pubDate>Tue, 04 Nov 2025 00:00:00 +0000</pubDate>
1616
<guid>https://datastrophic.io/declarative-kubernetes-cluster-emulation-with-kemu/</guid>
17-
<description>Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.</description>
17+
<description>Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.</description>
1818
</item>
1919
<item>
2020
<title>Secure Kubeflow Ingress and Authentication with Istio External Auth, Dex, and OAuth2 Proxy</title>

tags/emulation/index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -405,7 +405,7 @@ <h3 class="flex items-center text-xl font-semibold">
405405
</div>
406406

407407
<div class="prose py-1 dark:prose-invert">
408-
Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.
408+
Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.
409409
</div>
410410

411411
</div>

tags/emulation/index.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414
<link>https://datastrophic.io/declarative-kubernetes-cluster-emulation-with-kemu/</link>
1515
<pubDate>Tue, 04 Nov 2025 00:00:00 +0000</pubDate>
1616
<guid>https://datastrophic.io/declarative-kubernetes-cluster-emulation-with-kemu/</guid>
17-
<description>Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.</description>
17+
<description>Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.</description>
1818
</item>
1919
</channel>
2020
</rss>

tags/kemu/index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -405,7 +405,7 @@ <h3 class="flex items-center text-xl font-semibold">
405405
</div>
406406

407407
<div class="prose py-1 dark:prose-invert">
408-
Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.
408+
Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.
409409
</div>
410410

411411
</div>

tags/kemu/index.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414
<link>https://datastrophic.io/declarative-kubernetes-cluster-emulation-with-kemu/</link>
1515
<pubDate>Tue, 04 Nov 2025 00:00:00 +0000</pubDate>
1616
<guid>https://datastrophic.io/declarative-kubernetes-cluster-emulation-with-kemu/</guid>
17-
<description>Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.</description>
17+
<description>Optimizing AI workload scheduling requires extensive experimentation and observation, but testing scheduler modifications in production is risky - configuration errors can cause multi-day delays and wasted capacity. This post introduces KEMU, a declarative Kubernetes Emulator Utility that replaces fragmented multi-tool cluster setups with a single configuration specification, enabling safe experimentation with large-scale GPU clusters on minimal resources.</description>
1818
</item>
1919
</channel>
2020
</rss>

0 commit comments

Comments
 (0)