You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<h2id="deploying-sglang-with-pd-disaggregation-via-shepherd-model-gateway"><aclass="toclink" href="../smg/">Deploying SGLang with PD disaggregation via Shepherd Model Gateway</a></h2>
4392
+
<p><code>dstack</code> is an open-source control plane that simplifies GPU orchestration for both training and inference — across cloud providers, hardware vendors, and frameworks. Over the past year, we've been steadily making inference a first-class citizen in dstack.</p>
<h2id="supporting-hot-aisle-amd-ai-developer-cloud"><aclass="toclink" href="../hotaisle/">Supporting Hot Aisle AMD AI Developer Cloud</a></h2>
4791
-
<p>As the ecosystem around AMD GPUs matures, developers are looking for easier ways to experiment with ROCm, benchmark new architectures, and run cost-effective workloads—without manual infrastructure setup. </p>
4792
-
<p><code>dstack</code> is an open-source orchestrator designed for AI workloads, providing a lightweight, container-native alternative to Kubernetes and Slurm.</p>
<p>Today, we’re excited to announce native integration with <ahref="https://www.hotaisle.io/">Hot Aisle</a>, an AMD-only GPU neocloud offering VMs and clusters at highly competitive on-demand pricing. </p>
<h2id="supporting-hot-aisle-amd-ai-developer-cloud"><aclass="toclink" href="../../../hotaisle/">Supporting Hot Aisle AMD AI Developer Cloud</a></h2>
4390
+
<p>As the ecosystem around AMD GPUs matures, developers are looking for easier ways to experiment with ROCm, benchmark new architectures, and run cost-effective workloads—without manual infrastructure setup. </p>
4391
+
<p><code>dstack</code> is an open-source orchestrator designed for AI workloads, providing a lightweight, container-native alternative to Kubernetes and Slurm.</p>
<p>Today, we’re excited to announce native integration with <ahref="https://www.hotaisle.io/">Hot Aisle</a>, an AMD-only GPU neocloud offering VMs and clusters at highly competitive on-demand pricing. </p>
<h2id="introducing-gpu-blocks-and-proxy-jump-for-ssh-fleets"><aclass="toclink" href="../../../gpu-blocks-and-proxy-jump/">Introducing GPU blocks and proxy jump for SSH fleets</a></h2>
4839
-
<p>Recent breakthroughs in open-source AI have made AI infrastructure accessible beyond public clouds, driving demand for
4840
-
running AI workloads in on-premises data centers and private clouds.
4841
-
This shift offers organizations both high-performant clusters and flexibility and control.</p>
4842
-
<p>However, Kubernetes, while a popular choice for traditional deployments, is often too complex and low-level to address
4843
-
the needs of AI teams.</p>
4844
-
<p>Originally, <code>dstack</code> was focused on public clouds. With the new release, <code>dstack</code>
4845
-
extends support to data centers and private clouds, offering a simpler, AI-native solution that replaces Kubernetes and
0 commit comments