dstackai
diff --git a/‎assets/images/social/blog/smg.png‎
46.3 KB b/‎assets/images/social/blog/smg.png‎
46.3 KB
diff --git a/‎blog/changelog/index.html‎
Lines changed: 65 additions & 67 deletions b/‎blog/changelog/index.html‎
Lines changed: 65 additions & 67 deletions
diff --git a/‎blog/changelog/page/2/index.html‎
Lines changed: 67 additions & 72 deletions b/‎blog/changelog/page/2/index.html‎
Lines changed: 67 additions & 72 deletions
@@ -3923,6 +3923,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#deploying-sglang-with-pd-disaggregation-via-shepherd-model-gateway" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        Deploying SGLang with PD disaggregation via Shepherd Model Gateway
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#infrastructure-orchestration-is-an-agent-skill" class="md-nav__link">
     <span class="md-ellipsis">
@@ -4020,17 +4031,6 @@
     </span>
   </a>
 
-</li>
-      
-        <li class="md-nav__item">
-  <a href="#supporting-hot-aisle-amd-ai-developer-cloud" class="md-nav__link">
-    <span class="md-ellipsis">
-      
-        Supporting Hot Aisle AMD AI Developer Cloud
-      
-    </span>
-  </a>
-  
 </li>
 
     </ul>
@@ -4236,6 +4236,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#deploying-sglang-with-pd-disaggregation-via-shepherd-model-gateway" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        Deploying SGLang with PD disaggregation via Shepherd Model Gateway
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#infrastructure-orchestration-is-an-agent-skill" class="md-nav__link">
     <span class="md-ellipsis">
@@ -4333,17 +4344,6 @@
     </span>
   </a>
 
-</li>
-      
-        <li class="md-nav__item">
-  <a href="#supporting-hot-aisle-amd-ai-developer-cloud" class="md-nav__link">
-    <span class="md-ellipsis">
-      
-        Supporting Hot Aisle AMD AI Developer Cloud
-      
-    </span>
-  </a>
-  
 </li>
 
     </ul>
@@ -4364,6 +4364,49 @@ <h1 id="changelog">Changelog<a class="headerlink" href="#changelog" title="Perma
         <article class="md-post md-post--excerpt">
   <header class="md-post__header">
 
+    <div class="md-post__meta md-meta">
+      <ul class="md-meta__list">
+        <li class="md-meta__item">
+          <time datetime="2026-04-29 00:00:00+00:00">April 29, 2026</time></li>
+        
+          <li class="md-meta__item">
+            in
+            
+              <a href="./" class="md-meta__link">Changelog</a></li>
+        
+        
+          
+          <li class="md-meta__item">
+            
+              3 min read
+            
+          </li>
+        
+        
+      </ul>
+      
+    </div>
+  </header>
+  <div class="md-post__content md-typeset">
+    <h2 id="deploying-sglang-with-pd-disaggregation-via-shepherd-model-gateway"><a class="toclink" href="../smg/">Deploying SGLang with PD disaggregation via Shepherd Model Gateway</a></h2>
+<p><code>dstack</code> is an open-source control plane that simplifies GPU orchestration for both training and inference — across cloud providers, hardware vendors, and frameworks. Over the past year, we've been steadily making inference a first-class citizen in dstack.</p>
+<p><img src="https://dstack.ai/static-assets/static-assets/images/smg.png" width="630"/></p>
+
+    
+      <nav class="md-post__action">
+        <a href="../smg/">
+            <span>Continue reading</span>
+            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
+        </a>
+      </nav>
+    
+    
+  </div>
+</article>
+      
+        <article class="md-post md-post--excerpt">
+  <header class="md-post__header">
+    
     <div class="md-post__meta md-meta">
       <ul class="md-meta__list">
         <li class="md-meta__item">
@@ -4760,51 +4803,6 @@ <h2 id="introducing-passive-gpu-health-checks"><a class="toclink" href="../gpu-h
   </div>
 </article>
 
-        <article class="md-post md-post--excerpt">
-  <header class="md-post__header">
-    
-    <div class="md-post__meta md-meta">
-      <ul class="md-meta__list">
-        <li class="md-meta__item">
-          <time datetime="2025-08-11 00:00:00+00:00">August 11, 2025</time></li>
-        
-          <li class="md-meta__item">
-            in
-            
-              <a href="./" class="md-meta__link">Changelog</a></li>
-        
-        
-          
-          <li class="md-meta__item">
-            
-              3 min read
-            
-          </li>
-        
-        
-      </ul>
-      
-    </div>
-  </header>
-  <div class="md-post__content md-typeset">
-    <h2 id="supporting-hot-aisle-amd-ai-developer-cloud"><a class="toclink" href="../hotaisle/">Supporting Hot Aisle AMD AI Developer Cloud</a></h2>
-<p>As the ecosystem around AMD GPUs matures, developers are looking for easier ways to experiment with ROCm, benchmark new architectures, and run cost-effective workloads—without manual infrastructure setup.  </p>
-<p><code>dstack</code> is an open-source orchestrator designed for AI workloads, providing a lightweight, container-native alternative to Kubernetes and Slurm.</p>
-<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-hotaisle.png" width="630"/></p>
-<p>Today, we’re excited to announce native integration with <a href="https://www.hotaisle.io/">Hot Aisle</a>, an AMD-only GPU neocloud offering VMs and clusters at highly competitive on-demand pricing.  </p>
-
-    
-      <nav class="md-post__action">
-        <a href="../hotaisle/">
-            <span>Continue reading</span>
-            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
-        </a>
-      </nav>
-    
-    
-  </div>
-</article>
-      
 
 
 
 
@@ -3921,6 +3921,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#supporting-hot-aisle-amd-ai-developer-cloud" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        Supporting Hot Aisle AMD AI Developer Cloud
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#rolling-deployment-secrets-files-tenstorrent-and-more" class="md-nav__link">
     <span class="md-ellipsis">
@@ -4018,17 +4029,6 @@
     </span>
   </a>
 
-</li>
-      
-        <li class="md-nav__item">
-  <a href="#introducing-gpu-blocks-and-proxy-jump-for-ssh-fleets" class="md-nav__link">
-    <span class="md-ellipsis">
-      
-        Introducing GPU blocks and proxy jump for SSH fleets
-      
-    </span>
-  </a>
-  
 </li>
 
     </ul>
@@ -4234,6 +4234,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#supporting-hot-aisle-amd-ai-developer-cloud" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        Supporting Hot Aisle AMD AI Developer Cloud
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#rolling-deployment-secrets-files-tenstorrent-and-more" class="md-nav__link">
     <span class="md-ellipsis">
@@ -4331,17 +4342,6 @@
     </span>
   </a>
 
-</li>
-      
-        <li class="md-nav__item">
-  <a href="#introducing-gpu-blocks-and-proxy-jump-for-ssh-fleets" class="md-nav__link">
-    <span class="md-ellipsis">
-      
-        Introducing GPU blocks and proxy jump for SSH fleets
-      
-    </span>
-  </a>
-  
 </li>
 
     </ul>
@@ -4362,6 +4362,51 @@ <h1 id="changelog">Changelog<a class="headerlink" href="#changelog" title="Perma
         <article class="md-post md-post--excerpt">
   <header class="md-post__header">
 
+    <div class="md-post__meta md-meta">
+      <ul class="md-meta__list">
+        <li class="md-meta__item">
+          <time datetime="2025-08-11 00:00:00+00:00">August 11, 2025</time></li>
+        
+          <li class="md-meta__item">
+            in
+            
+              <a href="../../" class="md-meta__link">Changelog</a></li>
+        
+        
+          
+          <li class="md-meta__item">
+            
+              3 min read
+            
+          </li>
+        
+        
+      </ul>
+      
+    </div>
+  </header>
+  <div class="md-post__content md-typeset">
+    <h2 id="supporting-hot-aisle-amd-ai-developer-cloud"><a class="toclink" href="../../../hotaisle/">Supporting Hot Aisle AMD AI Developer Cloud</a></h2>
+<p>As the ecosystem around AMD GPUs matures, developers are looking for easier ways to experiment with ROCm, benchmark new architectures, and run cost-effective workloads—without manual infrastructure setup.  </p>
+<p><code>dstack</code> is an open-source orchestrator designed for AI workloads, providing a lightweight, container-native alternative to Kubernetes and Slurm.</p>
+<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-hotaisle.png" width="630"/></p>
+<p>Today, we’re excited to announce native integration with <a href="https://www.hotaisle.io/">Hot Aisle</a>, an AMD-only GPU neocloud offering VMs and clusters at highly competitive on-demand pricing.  </p>
+
+    
+      <nav class="md-post__action">
+        <a href="../../../hotaisle/">
+            <span>Continue reading</span>
+            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
+        </a>
+      </nav>
+    
+    
+  </div>
+</article>
+      
+        <article class="md-post md-post--excerpt">
+  <header class="md-post__header">
+    
     <div class="md-post__meta md-meta">
       <ul class="md-meta__list">
         <li class="md-meta__item">
@@ -4808,56 +4853,6 @@ <h2 id="auto-shutdown-for-inactive-dev-environmentsno-idle-gpus"><a class="tocli
   </div>
 </article>
 
-        <article class="md-post md-post--excerpt">
-  <header class="md-post__header">
-    
-    <div class="md-post__meta md-meta">
-      <ul class="md-meta__list">
-        <li class="md-meta__item">
-          <time datetime="2025-02-18 00:00:00+00:00">February 18, 2025</time></li>
-        
-          <li class="md-meta__item">
-            in
-            
-              <a href="../../" class="md-meta__link">Changelog</a></li>
-        
-        
-          
-          <li class="md-meta__item">
-            
-              4 min read
-            
-          </li>
-        
-        
-      </ul>
-      
-    </div>
-  </header>
-  <div class="md-post__content md-typeset">
-    <h2 id="introducing-gpu-blocks-and-proxy-jump-for-ssh-fleets"><a class="toclink" href="../../../gpu-blocks-and-proxy-jump/">Introducing GPU blocks and proxy jump for SSH fleets</a></h2>
-<p>Recent breakthroughs in open-source AI have made AI infrastructure accessible beyond public clouds, driving demand for
-running AI workloads in on-premises data centers and private clouds. 
-This shift offers organizations both high-performant clusters and flexibility and control.</p>
-<p>However, Kubernetes, while a popular choice for traditional deployments, is often too complex and low-level to address
-the needs of AI teams.</p>
-<p>Originally, <code>dstack</code> was focused on public clouds. With the new release, <code>dstack</code>
-extends support to data centers and private clouds, offering a simpler, AI-native solution that replaces Kubernetes and
-Slurm.</p>
-<p><img src="https://dstack.ai/static-assets/static-assets/images/data-centers-and-private-clouds.png" width="630"/></p>
-
-    
-      <nav class="md-post__action">
-        <a href="../../../gpu-blocks-and-proxy-jump/">
-            <span>Continue reading</span>
-            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
-        </a>
-      </nav>
-    
-    
-  </div>
-</article>
-